Skip to content

Instantly share code, notes, and snippets.

datasourceName about link tool categoryName vintage
1000 Genomes 1000 Genomes http://www.1000genomes.org/data Biology NA
CCLE Broad Cancer Cell Line Encyclopedia (CCLE) http://www.broadinstitute.org/ccle/home Biology NA
BBBC Broad Bioimage Benchmark Collection (BBBC) https://www.broadinstitute.org/bbbc Biology NA
Cell Image Cell Image Library http://www.cellimagelibrary.org Biology NA
Complete Genomics Complete Genomics Public Data http://www.completegenomics.com/public-data/69-genomes/ Biology NA
EBI ArrayExpress EBI ArrayExpress http://www.ebi.ac.uk/arrayexpress/ Biology NA
EBI Protein EBI Protein Data Bank in Europe http://www.ebi.ac.uk/pdbe/emdb/index.html/ Biology NA
EMPIAR Electron Microscopy Pilot Image Archive (EMPIAR) http://www.ebi.ac.uk/pdbe/emdb/empiar/ Biology NA
ENCODE project ENCODE project https://www.encodeproject.org Biology NA
We can make this file beautiful and searchable if this error is corrected: It looks like row 9 should actually have 6 columns, instead of 4. in line 8.
datasetName,about,link,categoryName,cloud,vintage
Microbiome Project,American Gut (Microbiome Project),https://github.com/biocore/American-Gut,Biology,GitHub,NA
GloBI,Global Biotic Interactions (GloBI),https://github.com/jhpoelen/eol-globi-data/wiki#accessing-species-interaction-data,Biology,GitHub,NA
Global Climate,Global Climate Data Since 1929,http://en.tutiempo.net/climate,Climate/Weather,,1929
CommonCraw 2012,3.5B Web Pages from CommonCraw 2012,http://www.bigdatanews.com/profiles/blogs/big-data-set-3-5-billion-web-pages-made-available-for-all-of-us,Computer Networks,,2012
Indiana Webclicks,53.5B Web clicks of 100K users in Indiana Univ.,http://cnets.indiana.edu/groups/nan/webtraffic/click-dataset/,Computer Networks,,NA
Criteo click-through,Criteo click-through data,http://labs.criteo.com/2015/03/criteo-releases-its-new-dataset/,Computer Networks,,NA
ICWSM 2009,ICWSM Data Challenge (since 2009),http://icwsm.cs.umbc.edu/,Data Challenges,,2009
KDD Cup,KDD Cup by Tencent 2012,http://www.kddcup2012.org/,Data
We can make this file beautiful and searchable if this error is corrected: No commas found in this CSV file in line 0.
categoryName
Agriculture
Biology
Climate/Weather
Complex Networks
Computer Networks
Contextual Data
Data Challenges
Earth Science
Economics
This file has been truncated, but you can view the full file.
{ "city" : "AGAWAM", "loc" : [ -72.622739, 42.070206 ], "pop" : 15338, "state" : "MA", "_id" : "01001" }
{ "city" : "CUSHMAN", "loc" : [ -72.51564999999999, 42.377017 ], "pop" : 36963, "state" : "MA", "_id" : "01002" }
{ "city" : "BARRE", "loc" : [ -72.10835400000001, 42.409698 ], "pop" : 4546, "state" : "MA", "_id" : "01005" }
{ "city" : "BELCHERTOWN", "loc" : [ -72.41095300000001, 42.275103 ], "pop" : 10579, "state" : "MA", "_id" : "01007" }
{ "city" : "BLANDFORD", "loc" : [ -72.936114, 42.182949 ], "pop" : 1240, "state" : "MA", "_id" : "01008" }
{ "city" : "BRIMFIELD", "loc" : [ -72.188455, 42.116543 ], "pop" : 3706, "state" : "MA", "_id" : "01010" }
{ "city" : "CHESTER", "loc" : [ -72.988761, 42.279421 ], "pop" : 1688, "state" : "MA", "_id" : "01011" }
{ "city" : "CHESTERFIELD", "loc" : [ -72.833309, 42.38167 ], "pop" : 177, "state" : "MA", "_id" : "01012" }
{ "city" : "CHICOPEE", "loc" : [ -72.607962, 42.162046 ], "pop" : 23396, "state" : "MA", "_id" : "01013" }
{ "city" : "CHICOPEE", "loc" : [ -72.57614