Skip to content

Instantly share code, notes, and snippets.

@vanleantking
Created April 1, 2023 07:07
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save vanleantking/fc6500a83849ba3d1f5c1754a55a1c97 to your computer and use it in GitHub Desktop.
Save vanleantking/fc6500a83849ba3d1f5c1754a55a1c97 to your computer and use it in GitHub Desktop.
mongodb analytics tools
But MongoDB is a general-purpose database, and not a data analysis tool, so a common pattern when analysing data that's stored within MongoDB is to extract the results of a query into a Numpy array, or Pandas dataframe, and to run complex and potentially long running analyses using the toolkit those frameworks provide. Until recently, the performance hit of converting large amounts of BSON data, as provided by MongoDB into these data structures, has been slower than we'd like.
Ref:
https://www.mongodb.com/developer/languages/python/pymongoarrow-and-data-analysis/
https://arrow.apache.org/use_cases/
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment