Skip to content

Instantly share code, notes, and snippets.

@dotsdl
Last active March 31, 2017 21:48
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
Star You must be signed in to star a gist
Save dotsdl/0e0fbd409e3e102d0458 to your computer and use it in GitHub Desktop.
MDAnalysis performance improvements under new topology model
Display the source blob
Display the rendered blob
Raw
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@jandom
Copy link

jandom commented Dec 22, 2015

Wow, this looks awesome! This benchmark is largely a mode where "read-in one, big system", how is this expected to perform in "read in many, small systems"?

@dotsdl
Copy link
Author

dotsdl commented Dec 28, 2015

@jandom sorry didn't see this until now. Since the Topology is a collection of numpy arrays instead of a list of Atom objects, it should perform better for many smaller systems, too, and each one also has a smaller memory footprint since we only have as many attributes as we need, no duplication of data, etc.. We already see that we get a decent speedup on parsing a GRO file with this new scheme, but we also omitted guessing from it, too, so perhaps it's not a fair comparison.

Does that kinda answer your question?

@orbeckst
Copy link

orbeckst commented Jul 5, 2016

The notebook says that the benchmark system are not available but we recently put them on figshare (as also mentioned in the updated README for the vesicle_library):

A set of large vesicle systems, ranging in size from 1.75 M to 10 M particles are made available under doi:10.6084/m9.figshare.3406708.

@orbeckst
Copy link

@dotsdl please fix the notebook as it holds up MDAnalysis/MDAnalysis.github.io#41 (see also MDAnalysis/MDAnalysis.github.io#41 (comment) )

  • fix availability of vesicle library
  • remove stupid json warnings

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment