grab-site 2.0 uses ludios/wpull for much faster HTML parsing using
html5-parser and implements faster ignore-matching using the
re2 module. These were the two major bottlenecks identified by pyflame and flamegraph.
grab-site processes should now reconnect reliably to
gs-server goes down and reappears. Please let me know if this is not the case.
grab-site 2.0 upgrade guide
Follow the new install instructions, which now require installing libxml2/libxslt/re2 dependencies and Python 3.7.x.
If you have custom ignore patterns, replace