Skip to content

Instantly share code, notes, and snippets.

View shreyaskarnik's full-sized avatar
🤖

Shreyas Karnik shreyaskarnik

🤖
View GitHub Profile
#This is my own interpatation of USA.gov PubSub feed with some tips and code from HarlanH from twitter.
#I am interested finding out links about which agency are shared from which part of US.
library(stringr)
library(plyr)
library(ggplot2)
library(scrapeR)
library(RJSONIO)
library(colorspace)
library(RColorBrewer)
library(maps)
@shreyaskarnik
shreyaskarnik / llda-learn.scala
Created August 3, 2011 14:36
Learning Labeled LDA Model using Stanford Topic Modeling Toolbox
// Stanford TMT Example 6 - Training a LabeledLDA model
// http://nlp.stanford.edu/software/tmt/0.3/
// tells Scala where to find the TMT classes
import scalanlp.io._;
import scalanlp.stage._;
import scalanlp.stage.text._;
import scalanlp.text.tokenize._;
import scalanlp.pipes.Pipes.global._;
@shreyaskarnik
shreyaskarnik / llda-infer.scala
Created August 3, 2011 14:38
Infer from Labeled LDA Model
// http://nlp.stanford.edu/software/tmt/0.3/
// tells Scala where to find the TMT classes
import scalanlp.io._;
import scalanlp.stage._;
import scalanlp.stage.text._;
import scalanlp.text.tokenize._;
import scalanlp.pipes.Pipes.global._;
import edu.stanford.nlp.tmt.stage._;
@shreyaskarnik
shreyaskarnik / example-highlight-extract.py
Created March 5, 2012 23:51
Python Code to Extract Highlighted Text from DOCX (Word 2007 and Up format)
#!usr/bin/python
# -*- coding: utf-8 -*-
from docx import *
document = opendocx(r'test.docx')
words = document.xpath('//w:r', namespaces=document.nsmap)
WPML_URI = "{http://schemas.openxmlformats.org/wordprocessingml/2006/main}"
tag_rPr = WPML_URI + 'rPr'
tag_highlight = WPML_URI + 'highlight'
tag_val = WPML_URI + 'val'
tag_t = WPML_URI + 't'
@shreyaskarnik
shreyaskarnik / Dockerfile
Created October 3, 2013 19:56
Dockerfile for shreyask/mlnotebook
FROM lopter/raring-base
MAINTAINER Shreyas
# Basic Bootstrap
RUN echo "deb http://archive.ubuntu.com/ubuntu raring main universe" >> /etc/apt/sources.list
RUN apt-get update
RUN apt-get install -y libxml2-dev libxslt1.1 libxslt1-dev libevent-dev build-essential make python-dev python-dateutil libatlas-base-dev gfortran vim git wget libsqlite3-dev sqlite3 openssh-server libpq5 libpq-dev libpng-dev libjpeg8-dev libfreetype6-dev
RUN wget http://python-distribute.org/distribute_setup.py
RUN python distribute_setup.py
RUN wget --no-check-certificate http://raw.github.com/pypa/pip/master/contrib/get-pip.py
common: &common
issue: docker-registry server
loglevel: _env:LOGLEVEL:debug
storage_redirect: true
standalone: true
debug: _env:DEBUG:true
index_endpoint: _env:INDEX_ENDPOINT
disable_token_auth: _env:DISABLE_TOKEN_AUTH
privileged_key: _env:PRIVILEGED_KEY
search_backend: _env:SEARCH_BACKEND
panic: The chunksize parameter should be a number that is larger than 5*1024*1024
goroutine 1 [running]:
github.com/docker/distribution/registry/handlers.NewApp(0x7f3d6a6d3bd0, 0xc2080e8600, 0xc20802b378, 0x3, 0xc20802b3a0, 0x5, 0xc20803dfb0, 0x0, 0x0, 0x0, ...)
/go/src/github.com/docker/distribution/registry/handlers/app.go:90 +0x590
main.main()
/go/src/github.com/docker/distribution/cmd/registry/main.go:56 +0x590
goroutine 6 [syscall]:
os/signal.loop()
version: 0.1
loglevel: debug
storage:
s3:
accesskey: DOCKER
secretkey: DIST-TEST
region: us-west-2
bucket: mybucket
encrypt: true
secure: true
$> docker build -t distrubution .
Sending build context to Docker daemon 5.376 MB
Sending build context to Docker daemon
Step 0 : FROM golang:1.4
---> aac88aa7c9ab
Step 1 : ENV CONFIG_PATH /etc/docker/registry/config.yml
---> Using cache
---> c3623f914a24
Step 2 : ENV DISTRIBUTION_DIR /go/src/github.com/docker/distribution
---> Using cache
DEBU[0000] /sbin/iptables, [--wait -t nat -A OUTPUT -m addrtype --dst-type LOCAL ! --dst 127.0.0.0/8 -j DOCKER]
DEBU[0000] /sbin/iptables, [--wait -t filter -n -L DOCKER]
DEBU[0000] /sbin/iptables, [--wait -t filter -C FORWARD -o docker0 -j DOCKER]
INFO[0000] -job init_networkdriver() = OK (0)
WARN[0000] Your kernel does not support cgroup swap limit.
DEBU[0000] Restarting containers...
INFO[0000] docker daemon: 1.6.0 4749651; execdriver: native-0.2; graphdriver: aufs
INFO[0000] +job acceptconnections()
INFO[0000] -job acceptconnections() = OK (0)
INFO[0000] Daemon has completed initialization