Skip to content

Instantly share code, notes, and snippets.

@zhiyzuo
Last active September 16, 2017 05:40
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save zhiyzuo/c643ddea5f4ad57ed576998e62477fae to your computer and use it in GitHub Desktop.
Save zhiyzuo/c643ddea5f4ad57ed576998e62477fae to your computer and use it in GitHub Desktop.
Data resource

This gist will be a place for some interesting data source I found but have no time to collect/think about yet.

__author__ = "Zhiya Zuo"
__email__ = "zhiyazuo@gmail.com"

Toy/Experimental datasets

This section lists datasets to play with or experiment with (e.g., test/evaluate/validate/learn new algorithms.)

A list of classic network datasets compiled by Prof. Mark Newman at the University of Michigan.

Specifically for machine learning and data mining purposes. Has some very classic dataset to play with.


Search engines for datasets


Business

Yelp has a lot of datasets for text mining, network analysis, etc.

Alpha Vantage APIs are grouped into three categories: (1) Time Series Data, (2) Technical Indicators, and (3) Sector Performances. All APIs are realtime: the latest data points are derived from the current trading day. Examples in this documentation are for demo purposes.

-Value Viz Free for 10 requests per hour.


Research

Genealogy

Academic genealogy dataset. Probabily the one with the best coverage is the one of [neuroscience] (https://neurotree.org/neurotree/). Can I scrape it or just ask for the data?

This one is specifically for Math.


Social Network

I have been scraping the beer ratings but have not checked out their social network. Seems to be an interesting dataset.


Education

This one has many interesting dataset (including salary) but needs subscription (UIowa does have it).

Faculty Salary

This one can be utilized to find Iowa public univerity faculty salaries (or maybe local government) with appropriate queries in the HTTP request.

Public universities in Illinois. Actually the website has many other data including enrollment and degree program but it is harder to scrape.

The 2017 iteration of the survey includes information on salary and benefits for more than 380,000 faculty members from 1,022 institutions in all 50 states and Puerto Rico.


Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment