Skip to content

Instantly share code, notes, and snippets.

View jaganadhg's full-sized avatar
🎯
Focusing

Jaganadh Gopinadhan jaganadhg

🎯
Focusing
View GitHub Profile
@jaganadhg
jaganadhg / Scipy_linier_regression
Created October 9, 2013 14:09
Scipy_linier_regression
{
"metadata": {
"name": "Scipy_linier_regression"
},
"nbformat": 3,
"nbformat_minor": 0,
"worksheets": [
{
"cells": [
{
@jaganadhg
jaganadhg / pdf_table_with Tesseract
Created October 9, 2013 14:14
Extract Data from PDF table using Python Image. Image Magick and tesseract
#Refer http://craiget.com/extracting-table-data-from-pdfs-with-ocr/
import Image, ImageOps
import subprocess, sys, os, glob
# minimum run of adjacent pixels to call something a line
H_THRESH = 300
V_THRESH = 300
def get_hlines(pix, w, h):
"""Get start/end pixels of lines containing horizontal runs of at least THRESH black pix"""
@jaganadhg
jaganadhg / Hadoop Machine Learning Links
Last active December 25, 2015 02:19
Hadoop Machine Learning Links
@jaganadhg
jaganadhg / NLTK NLp
Created October 10, 2013 09:40
NLTK NLP Links
http://stackoverflow.com/questions/10098533/implementing-bag-of-words-naive-bayes-classifier-in-nltk
http://stackoverflow.com/questions/10592605/save-naivebayes-classifier-to-disk-in-scikits-learn
https://github.com/ianozsvald/social_media_brand_disambiguator
http://techupd.blogspot.in/2012/03/nltk-python-module-for-natural-language.html
http://gavinmhackeling.com/blog/2013/02/named-entity-extraction-with-nltk-and-python/
@jaganadhg
jaganadhg / CTM
Created October 10, 2013 17:49
Clinical TM
http://www.techworld.com.au/article/451309/unsw_project_spotlights_text_mining_language_analysis/
http://text.mine.unsw.edu.au/resources?cate=Online%20Annotation%20Tools
http://bioportal.bioontology.org/resource_index
http://www.la-press.com/unblocking-blockbusters-using-boolean-text-mining-to-optimise-clinical-article-a1596-abstract
http://www.linguamatics.com/welcome/software/I2E.html
@jaganadhg
jaganadhg / 15102013
Created October 15, 2013 15:36
15102013
NER
http://stackoverflow.com/questions/10585864/ner-naive-algorithm
https://github.com/emory-libraries-disc/name-dropper
https://gist.github.com/shlomibabluki/6333174
https://www.googleapis.com/freebase/v1/search
http://pyke.sourceforge.net/
@jaganadhg
jaganadhg / 15102013A
Created October 15, 2013 15:46
Python Logic Programming
https://github.com/logpy/logpy
http://www.amazon.com/Design-Logic-Programming-Python-Hands/dp/0595408109
https://sites.google.com/site/pydatalog/Online-datalog-tutorial
http://pyke.sourceforge.net/
http://code.activestate.com/recipes/303057-pythologic-prolog-syntax-in-python/
@jaganadhg
jaganadhg / entropy-calculator
Created October 23, 2013 12:13
A fork and cleaned version of entropy-calculator http://code.google.com/p/entropy-calculator/
#!/usr/bin/env python
"""
Module to find keywords using the Montemurro and Zanette algorithm.
Created by Dr Peter J. Bleackley
"""
from math import log, exp, lgamma
import re
class EntropyCalculator(object):
@jaganadhg
jaganadhg / NPB
Created November 4, 2013 17:12
New Papers and Booka
1) http://www.intechopen.com/books/theory-and-applications-for-advanced-text-mining Text Mining
2) http://blog.scripted.com/staff/nlp-hacking-in-python/ -- NLP Hacking Python
3) https://github.com/barmalei/scalpel Sclapel NLP (NC)
4) http://www.esp.uem.es/jmgomez/tmweka/index.html Weka Text Mining
5) http://www.idilia.com/developer/sense-annotated-datasets/ SENSE ANNOTATED DATASET
6) http://www.cl.cam.ac.uk/teaching/1011/L100/introling.pdf LINGUISTICS
@jaganadhg
jaganadhg / Django_apache
Created November 17, 2013 07:11
Django Apache Deployment
1) http://thecodeship.com/deployment/deploy-django-apache-virtualenv-and-mod_wsgi/
2) http://blog.lipautz.org/managing-apache-virtualhosts-on-fedora-16/
3) http://docs.fedoraproject.org/en-US/Fedora/13/html/Deployment_Guide/s1-apache-virtualhosts.html
4) https://library.linode.com/frameworks/django-apache-mod-wsgi/centos-5
5) https://library.linode.com/frameworks/django-apache-mod-wsgi/centos-5