Skip to content

Instantly share code, notes, and snippets.

View waleking's full-sized avatar
🏠
Working from home

weijing waleking

🏠
Working from home
View GitHub Profile
@waleking
waleking / ExpandEdinburghFSDCorpus.md
Created June 17, 2016 15:17 — forked from emaadmanzoor/ExpandEdinburghFSDCorpus.md
Expand the Edinburgh Twitter FSD corpus

Expand The Edinburgh Twitter FSD Corpus

The Python scripts attached here take care of the following tedious work, and should help one quickly get started with some real work on the corpus:

  • Respect the Twitter API rate limits and throttle API hits.
  • Don't hit the API for already expanded tweet ID's, so you can resume tweet expansion after stopping midway.
  • Parse the API response and dump it into the correct column in the sqlite3 database.
  • Gracefully handle exceptions while acquiring tweets from the API.
  • Wrap version 1.1 of the Twitter API.
  • Start from a specified tweet ID, assuming the input file is sorted in increasing order of tweet ID.
@waleking
waleking / 0_reuse_code.js
Created November 25, 2016 09:39
Here are some things you can do with Gists in GistBox.
// Use Gists to store code you would like to remember later on
console.log(window); // log the "window" object to the console