Skip to content

Instantly share code, notes, and snippets.

View KhepryQuixote's full-sized avatar

Khepry Quixote KhepryQuixote

View GitHub Profile
@KhepryQuixote
KhepryQuixote / PyPubChemXtractor.py
Last active November 17, 2023 09:53
Python 3 script to extract Chemical Abstract Society Registry Numbers (CASRNs) from PubChem's CID Synonym file.
# -*- coding: utf-8 -*-
'''
Name: PyPubChemXtractor.py
Author: Khepry Quixote
Date: 11 Nov 2014
Language: Python 3.4
Narrative:
This Python 3.4 program will extract CASRN values from PubChem's
@KhepryQuixote
KhepryQuixote / PyTorStemPrivoxy.md
Last active September 9, 2023 20:36
Python script to connect to Tor via Stem and Privoxy, requesting a new connection (hence a new IP as well) as desired.

Crawling Anonymously with Tor in Python

adapted from the article "Crawling anonymously with Tor in Python" by S. Acharya, Nov 2, 2013.

The most common use-case is to be able to hide one's identity using TOR or being able to change identities programmatically, for example when you are crawling a website like Google and you don’t want to be rate-limited or blocked via IP address.

Tor

Install Tor.

@KhepryQuixote
KhepryQuixote / PyFrackETL.py
Last active May 4, 2021 14:12
Python scripts to download, extract, and output oil, gas, and/or fracking well information from various states in the United States of America.
# -*- coding: utf-8 -*-
'''
This Python script is oriented towards oil, gas, and fracking well
information download, extract, and output at the state-level within
the United States of America. Not all states are present as not all
states either allow fracking or have any sites available for download.
This Python script is dependent upon another Python script entitled
Src2Tgt.py, which is also available as a Gist under the same name.
@KhepryQuixote
KhepryQuixote / Linux-Tor-Install.md
Created November 10, 2014 20:58
Linux-Tor-Install.md
  • 7zip
    • p7zip
    • p7zip-full
    • sudo apt-get install p7zip p7zip-full
  • nautilus
    • nautilus-open-terminal
    • sudo apt-get install nautilus-open-terminal
  • browsers
    • firefox
    • chromium-browser
@KhepryQuixote
KhepryQuixote / Src2Tgt.py
Created September 30, 2014 20:00
Python scripts to download, extract, and output oil, gas, and/or fracking well information from various states in the United States of America.
# -*- coding: utf-8 -*-
'''
This Python script is oriented towards oil, gas, and fracking well
information download, extract, and output at the state-level within
the United States of America. Not all states are present as not all
states either allow fracking or have any sites available for download.
This Python script is depended upon by another Python script entitled
PyFrackETL.py, which is also available as a Gist under the same name.