In an increasingly serverless tech industry, graph databases like Neo4J still require infrastructure provisioning. When you start a Neo4J instance, you run the risk of starting a server you may not use, thereby paying for uptime on a server that you think will scale up, but have no guarantee that it will.
AWS Glue has "crawlers" that can schematize JSON, text, and CSV files, and store that data in a serverless database, called AWS Athena. The output of a Glue crawler is typically a Parquet file that is stored in S3 (regular cloud storage for files), which Athena reads as a table in its database. AWS Glue also allows for Spark jobs that allow you to relationalize the output of a Crawler, meaning you can turn any unstructured data into structured data that can be queried with SQL in Athena. The fact that it uses Parquet also means it enforces strong data typing that typical CSVs and JSON files don't allow. It also compresses regula