Skip to content

Instantly share code, notes, and snippets.

View chrisk314's full-sized avatar

Chris Knight chrisk314

View GitHub Profile
@chrisk314
chrisk314 / archive_to_bigtable.py
Created November 24, 2016 10:30
Spark job script to load data messages from Moreover archive and insert them into BigTable.
from __future__ import print_function
import json
import sys
import os
os.environ["THEANO_FLAGS"] = "base_compiledir=/home/csk13/.theano"
import cPickle as pkl
import logging
import random
from time import time