This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
{ | |
"metadata": { | |
"name": "Scipy_linier_regression" | |
}, | |
"nbformat": 3, | |
"nbformat_minor": 0, | |
"worksheets": [ | |
{ | |
"cells": [ | |
{ |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#Refer http://craiget.com/extracting-table-data-from-pdfs-with-ocr/ | |
import Image, ImageOps | |
import subprocess, sys, os, glob | |
# minimum run of adjacent pixels to call something a line | |
H_THRESH = 300 | |
V_THRESH = 300 | |
def get_hlines(pix, w, h): | |
"""Get start/end pixels of lines containing horizontal runs of at least THRESH black pix""" |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
1] Item Based Collaborative Filtering with Hadoop http://craiget.com/item-based-collaborative-filtering-with-hadoop/ | |
2] A Little Job Scraper http://craiget.com/a-little-job-scraper/ | |
3] Finding Anagrams with Python http://craiget.com/finding-anagrams-with-python/ | |
4] Fetching Android Market Stats with Selenium RC http://craiget.com/fetching-android-market-stats-with-selenium-rc/ | |
5] Pandas Big query http://nbviewer.ipython.org/6459195 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
http://stackoverflow.com/questions/10098533/implementing-bag-of-words-naive-bayes-classifier-in-nltk | |
http://stackoverflow.com/questions/10592605/save-naivebayes-classifier-to-disk-in-scikits-learn | |
https://github.com/ianozsvald/social_media_brand_disambiguator | |
http://techupd.blogspot.in/2012/03/nltk-python-module-for-natural-language.html | |
http://gavinmhackeling.com/blog/2013/02/named-entity-extraction-with-nltk-and-python/ |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
http://www.techworld.com.au/article/451309/unsw_project_spotlights_text_mining_language_analysis/ | |
http://text.mine.unsw.edu.au/resources?cate=Online%20Annotation%20Tools | |
http://bioportal.bioontology.org/resource_index | |
http://www.la-press.com/unblocking-blockbusters-using-boolean-text-mining-to-optimise-clinical-article-a1596-abstract | |
http://www.linguamatics.com/welcome/software/I2E.html |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
NER | |
http://stackoverflow.com/questions/10585864/ner-naive-algorithm | |
https://github.com/emory-libraries-disc/name-dropper | |
https://gist.github.com/shlomibabluki/6333174 | |
https://www.googleapis.com/freebase/v1/search | |
http://pyke.sourceforge.net/ |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
https://github.com/logpy/logpy | |
http://www.amazon.com/Design-Logic-Programming-Python-Hands/dp/0595408109 | |
https://sites.google.com/site/pydatalog/Online-datalog-tutorial | |
http://pyke.sourceforge.net/ | |
http://code.activestate.com/recipes/303057-pythologic-prolog-syntax-in-python/ |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#!/usr/bin/env python | |
""" | |
Module to find keywords using the Montemurro and Zanette algorithm. | |
Created by Dr Peter J. Bleackley | |
""" | |
from math import log, exp, lgamma | |
import re | |
class EntropyCalculator(object): |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
1) http://www.intechopen.com/books/theory-and-applications-for-advanced-text-mining Text Mining | |
2) http://blog.scripted.com/staff/nlp-hacking-in-python/ -- NLP Hacking Python | |
3) https://github.com/barmalei/scalpel Sclapel NLP (NC) | |
4) http://www.esp.uem.es/jmgomez/tmweka/index.html Weka Text Mining | |
5) http://www.idilia.com/developer/sense-annotated-datasets/ SENSE ANNOTATED DATASET | |
6) http://www.cl.cam.ac.uk/teaching/1011/L100/introling.pdf LINGUISTICS |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
1) http://thecodeship.com/deployment/deploy-django-apache-virtualenv-and-mod_wsgi/ | |
2) http://blog.lipautz.org/managing-apache-virtualhosts-on-fedora-16/ | |
3) http://docs.fedoraproject.org/en-US/Fedora/13/html/Deployment_Guide/s1-apache-virtualhosts.html | |
4) https://library.linode.com/frameworks/django-apache-mod-wsgi/centos-5 | |
5) https://library.linode.com/frameworks/django-apache-mod-wsgi/centos-5 | |