This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
<!DOCTYPE html> | |
<html> | |
<head> | |
<title>Leaflet Example</title> | |
<meta name="viewport" content="width=device-width, initial-scale=1.0"> | |
<!-- Import Leaflet assets --> | |
<link rel="stylesheet" href="http://leafletjs.com/dist/leaflet.css" /> | |
<script src="http://leafletjs.com/dist/leaflet.js"></script> |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
import random | |
class MinHasher(object): | |
def __init__(self, n, universe_size, seed=None): | |
if seed != None: random.seed(seed) | |
self.hash_functions = [self._create_random_hash_function(universe_size) for i in range(n)] | |
def _create_random_hash_function(self, universe_size): | |
a = random.randint(0, universe_size) | |
b = random.randint(0, universe_size) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
import csv, os | |
# This chunk iterates through all of the csv files in a directory, turns them | |
# into 2-dimensional arrays (lists of lists), and puts all those arrays into | |
# a list called "tables" | |
tables = [] | |
# Loop over all files in the current directory (which is what "." means) | |
for f in os.listdir('.'): |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
{ | |
"took": 14, | |
"timed_out": false, | |
"_shards": { | |
"total": 6, | |
"successful": 6, | |
"failed": 0 | |
}, | |
"hits": { | |
"total": 1419, |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
{ | |
"took": 2, | |
"timed_out": false, | |
"_shards": { | |
"total": 6, | |
"successful": 6, | |
"failed": 0 | |
}, | |
"hits": { | |
"total": 1, |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
name|id | |
1911 United|C00508200 | |
50 State Strategy|C00502633 | |
9-9-9 FUND|C00504241 | |
Accountability 2010|C00489641 | |
AFL-CIO Workers' Voices PAC|C00484287 | |
Alaskans Standing Together|C00489385 | |
America for the People|C00497081 | |
America Get Up|C00494278 | |
America Votes Action Fund|C00492520 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
AMAZON_ACCESS_KEY = 'WHATEVER' | |
AMAZON_SECRET_KEY = 'SECRET_WHATEVER' | |
# I'm old-school, so I like the AWS-S3 gem. It's just a lightweight wrapper around Amazon's API. | |
# https://github.com/marcel/aws-s3 | |
require "aws/s3" | |
include AWS::S3 | |
def publish_json!(bucket='int.nyt.com', path='applications/represent-json/', filename='foo.json') |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
{ | |
"discipline_id":"AS", | |
"discipline_name":"Alpine Skiing", | |
"results": | |
[ | |
{ | |
"id":"ASM010", | |
"name":"Men's Downhill", | |
"competitor_type":"ATH", | |
"results": |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
import numpy | |
def get_similar(vec, matrix, K=10): | |
# Set up the query vector and the whole dataset for K-nearest neighbors query | |
qvector = numpy.array([vec]).transpose() | |
alldata = numpy.array(matrix).transpose() | |
# You can't get more neighbors than there are entities | |
ndata = alldata.shape[1] | |
K = K if K < ndata else ndata |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
''' | |
compare.py | |
Quickly produces a pairwise similarity matrix of lawmakers' roll call votes, given | |
an input *.ord matrix file from Poole, McCarty and Lewis: http://www.voteview.com/dwnl.htm | |
''' | |
import numpy, string | |
from scipy.spatial.distance import cdist |
OlderNewer