Skip to content

Instantly share code, notes, and snippets.

View dchud's full-sized avatar

Dan Chudnov dchud

  • @mitre (work) + CDC projects (work) + personal projects
  • Washington, DC, USA
View GitHub Profile
@dchud
dchud / README.txt
Last active November 9, 2018 12:29
Testing dbplus VM for analytics class
This describes installing a virtual machine configured for use in a data warehousing
for analytics course. Students will be working with Jupyter notebooks (Python, R, Spark),
the unix (ubuntu-14.04) command line, MySQL, Spyder, PostgreSQL, and a few other things
as they come up.
The box contains a lot of stuff, and is rather big compared to a standard Ubuntu ISO, say.
You will need at least 3Gb free on your host machine to download it, and probably at least
double that to run it. Because the download file is big, you will want to be on a network
with a fat pipe.
@dchud
dchud / config.json
Last active December 21, 2017 22:21
sample of deduped businesses
{
"field_names": ["estab_name", "site_address", "site_city", "site_state",
"site_zip", "nr_in_estab", "owner_type"],
"field_definitions": [{"field": "estab_name", "type": "String"},
{"field": "site_address", "type": "Address"},
{"field": "site_city", "type": "ShortString",
"Has Missing": true},
{"field": "site_state", "type": "ShortString",
"Has Missing": true},
{"field": "site_zip", "type": "ShortString",
@dchud
dchud / keybase.md
Created April 25, 2016 16:12
keybase.md

Keybase proof

I hereby claim:

  • I am dchud on github.
  • I am dchud (https://keybase.io/dchud) on keybase.
  • I have a public key ASCMTFzlMORcg1rB4eJb0Pb2JvL15aBDlpFvkJ10gHD8swo

To claim this, I am signing this object:

@dchud
dchud / followers.py
Created October 20, 2016 04:16
repeats fetching of follower lists from twitter for a small number of users
#!/usr/bin/env python
"""
Simple tool to fetch follower lists every n seconds and store them
with time-based filenames. Can be later merged, deduped, and fed to
users/lookup method to extract full user info.
"""
import argparse
import datetime
@dchud
dchud / ol-cover-identifiers.txt
Last active December 27, 2015 15:28
unique categories of identifiers found in open library cover identifier mapping file
1sbn
alecso
alexandriava.gov
alibris_id
almedina
amazon
amazon_asin#
amazon.ca_asin
amazon.co.uk_asin
amazon.de
@dchud
dchud / gist:6497887
Created September 9, 2013 16:11
archivesspace rc1 install log. ubuntu 12.04.
$ sudo apt-get update
$ sudo apt-get install default-jre
$ java -version
java version "1.6.0_27"
download archivesspace-1.0.0RC1.tar.gz from archivespace
$ gunzip archivesspace-1.0.0RC1.tar.gz
$ tar -xf archivesspace-1.0.0RC1.tar
@dchud
dchud / gist:5911546
Created July 2, 2013 17:57
diff to comment out all invoking of solr without apache2-level errors
dchud@gwdev-dchud12:~/public_html/ncsu-quicksearch (master *)$ git diff
diff --git a/bestbets/bestbets.php b/bestbets/bestbets.php
index ca547bd..703258a 100644
--- a/bestbets/bestbets.php
+++ b/bestbets/bestbets.php
@@ -45,7 +45,7 @@ class BestBet {
// instantiate new SolrPhpClient service with connection
// to best bets solr index
- $solr = new Apache_Solr_Service('HOST', PORT, 'SOLRULR'); // EDIT
@dchud
dchud / ercamp-examples.md
Last active December 18, 2015 19:09
API call examples for ERcamp 2013 at GW Libraries
@dchud
dchud / gist:5090701
Created March 5, 2013 14:35
I stand with Dale Askey.
http://mellenpress.com/mellenpress.cfm?pc=26
Hello, I am an academic librarian in the USA, and I am writing to urge you to drop all your legal proceedings against Dale Askey. I am not a specialist in the fields your authors cover, nor do I have any opinion about the scholarly quality about your published titles. As a research university librarian, however, I personally believe that your lawsuits against Mr. Askey and McMaster University are an affront to academic freedom and scholarship in general. You have many options for fora within which to disprove your critics that fall well within the sphere of scholarly communication and gracious professional discourse. But by slapping down lawsuits in reaction to what appears to be reasoned, polite criticism, you have crossed the boundary into unacceptable behavior.
I urge you to reconsider and to drop all of your legal actions against Mr. Askey immediately.
Thank you for your consideration, -Daniel Chudnov
@dchud
dchud / gist:4496191
Created January 9, 2013 19:43
settings to add to local_settings.py
DATA_DIR = '/home/dchud/projects/social-feed-manager/sfm/data'
TWITTER_DEFAULT_USERNAME = 'MY FANCY USERNAME'
TWITTER_CONSUMER_KEY = 'ITS CONSUMER KEY'
TWITTER_CONSUMER_SECRET = 'ITS CONSUMER SECRET'