Skip to content

Instantly share code, notes, and snippets.

@punchouty
Last active August 1, 2018 13:12
Show Gist options
  • Save punchouty/7c9c93c4816d336022aecbe0f9dc49c0 to your computer and use it in GitHub Desktop.
Save punchouty/7c9c93c4816d336022aecbe0f9dc49c0 to your computer and use it in GitHub Desktop.
Cross Ref
xz -d crossrefworks.json.xz
mongod --dbpath /Volumes/Seagate/data
mongoimport --db thomson --collection works --file /Volumes/Seagate/crossrefworks.json
mongoexport --db crossref --collection names --out /Volumes/Seagate/names.json
/Volumes/Seagate/elasticsearch-5.5.0/bin/elasticsearch
https://www.quora.com/What-is-a-good-algorithm-service-for-fuzzy-matching-of-peoples-names-for-de-duplication
http://ntz-develop.blogspot.com/2011/03/phonetic-algorithms.html
https://commons.apache.org/proper/commons-text/javadocs/api-release/index.html
https://www.ibm.com/support/knowledgecenter/en/SSEV5M_4.2.0/com.ibm.iis.gnm.install.doc/topics/gnr_tsk_installingconfiguring.html
https://commons.apache.org/proper/commons-codec/apidocs/index.html?org/apache/commons/codec/language/Soundex.html
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment