Skip to content

Instantly share code, notes, and snippets.

Show Gist options
  • Star 1 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save ReadmeCritic/263e1b7146386abc7dda to your computer and use it in GitHub Desktop.
Save ReadmeCritic/263e1b7146386abc7dda to your computer and use it in GitHub Desktop.
Finding default branch for caesar0301/awesome-public-datasets
Found: master for caesar0301/awesome-public-datasets — An awesome list of high-quality open datasets in public domains (on-going). — 6089⭐️ — last updated 10 days ago
🔎 Checking 355 links
⚪ https://travis-ci.org/caesar0301/awesome-public-datasets.svg
⚪ https://groups.google.com/forum/#!forum/awesomepublicdatasets
✅ https://github.com/caesar0301/awesome-public-datasets
✅ https://github.com/bayandin/awesome-awesomeness
✅ https://travis-ci.org/caesar0301/awesome-public-datasets
✅ https://github.com/sindresorhus/awesome
✅ https://github.com/sindresorhus/awesome
✅ https://github.com/biocore/American-Gut
✅ http://crcns.org/data-sets
✅ https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg
✅ http://geneontology.org/page/download-annotations
✅ https://github.com/jhpoelen/eol-globi-data/wiki#accessing-species-interaction-data
✅ https://www.encodeproject.org
✅ http://www.plants.usda.gov/dl_all.html
🔶 301 http://www.infobiotic.net/PSPbenchmarks/
✅ http://www.hmpdacc.org/reference_genomes/reference_genomes.php
✅ http://www.ncbi.nlm.nih.gov/Traces/sra/
🔴 404 http://bit.do/VVW6
✅ http://www.ncbi.nlm.nih.gov/geo/
🔶 301 http://pdb.org/
✅ http://www.pathguide.org/
✅ https://opensnp.org/
✅ https://pubchem.ncbi.nlm.nih.gov/
✅ http://smd.stanford.edu/
✅ http://www.pubgene.org/
✅ http://www.personalgenomes.org/
🔴 400 http://www.ebi.ac.uk/arrayexpress/
✅ http://www.ncbi.nlm.nih.gov/unigene
✅ http://hgdownload.soe.ucsc.edu/downloads.html
✅ https://weather.gc.ca/grib/index_e.html
✅ http://www.catalogueoflife.org/content/annual-checklist-archive
✅ http://www.bom.gov.au/climate/dwo/
✅ http://www.beringclimate.noaa.gov/
🔶 301 http://ncdc.noaa.gov/data-access/quick-links
🔶 302 https://wiki.earthdata.nasa.gov/display/GIBS
✅ http://www.ncdc.noaa.gov/data-access/model-data/model-datasets/numerical-weather-prediction
✅ http://sinda.crn2.inpe.br/PCD/SITE/novo/site/
✅ http://www.cru.uea.ac.uk/cru/data/temperature/#datter
✅ http://www.1000genomes.org/data
✅ http://data.worldbank.org/developers/climate-data-api
✅ http://nber.org/patents/
🔶 301 http://www.tutiempo.net/en/Climate
✅ http://www.wunderground.com/history/index.html
✅ https://kdl.cs.umass.edu/display/public/DBLP
✅ http://www.broadinstitute.org/cgi-bin/cancer/datasets.cgi
🔶 301 http://ogirardot.wordpress.com/2013/01/31/sharing-pypimaven-dependency-data/
✅ http://www.cru.uea.ac.uk/data
✅ http://www-personal.umich.edu/~mejn/netdata/
✅ http://snap.stanford.edu/data/
✅ https://archive.org/details/doi-urls
✅ http://math.nist.gov/~RPozo/complex_datasets.html
✅ http://vlado.fmf.uni-lj.si/pub/networks/data/bio/Yeast/Yeast.htm
✅ http://konect.uni-koblenz.de/
✅ http://www.cise.ufl.edu/research/sparse/matrices/
✅ http://www.eecs.wsu.edu/mgd/gdb.html
✅ http://www3.cs.stonybrook.edu/~algorith/implement/graphbase/implement.shtml
🔶 301 http://www.elsevier.com/online-tools/scopus
✅ http://law.di.unimi.it/datasets.php
🔶 301 http://cnets.indiana.edu/groups/nan/webtraffic/click-dataset
✅ http://nexus.igraph.org/
✅ http://commoncrawl.org/the-data/get-started/
✅ http://lemurproject.org/clueweb09/
✅ http://lemurproject.org/clueweb12/
✅ http://www.caida.org/data/overview/
✅ http://www.bigdatanews.com/profiles/blogs/big-data-set-3-5-billion-web-pages-made-available-for-all-of-us
⚪ https://github.com/irecsys/CARSKit/tree/master/context-aware_data_sets
✅ http://www.caida.org/projects/network_telescope/
✅ http://networkdata.ics.uci.edu/resources.php
✅ http://students.depaul.edu/~yzheng8/DataSets.html#Data
✅ https://networkdata.ics.uci.edu/resources.php
🔶 301 http://labs.criteo.com/2015/03/criteo-releses-its-new-dataset/
🔶 302 https://console.developers.google.com/storage/openmobiledata_public/
✅ http://icwsm.cs.umbc.edu/
✅ http://www.chalearn.org/
🔶 301 http://www.kaggle.com/
😡 Error: Getting link https://www.kddcup2012.org/ hostname "www.kddcup2012.org" does not match the server certificate
✅ https://github.com/localytics/data-viz-challenge
🔶 302 https://www.crowdanalytix.com/datax
✅ http://www.netflixprize.com/leaderboard
🔶 301 http://crawdad.cs.dartmouth.edu/
✅ http://www.drivendata.org/
🔶 302 https://www.spaceappschallenge.org
✅ http://inforumweb.umd.edu/econdata/econdata.html
🔶 301 http://www.d4d.orange.com/en/home
🔶 302 http://www.aeaweb.org/RFE/toc.php?show=complete
🔶 302 http://ampds.org/
✅ http://www.yelp.com/dataset_challenge
⚪ http://combed.github.io/
✅ http://www.upcdatabase.com/
✅ http://nilm.cmubi.org/
⚪ http://hfed.github.io/
⚪ http://iawe.github.io/
✅ https://dandelion.eu/datamine/open-big-data/
✅ http://plaidplug.com/
✅ https://dataport.pecanstreet.org/
✅ http://redd.csail.mit.edu/
✅ https://www.google.com/finance
😡 Error: Getting link https://data.nasdaq.com/ SSL_connect SYSCALL returned=5 errno=0 state=SSLv2/v3 read server hello A
🔶 302 http://www.google.com/trends?q=google&ctab=0&geo=all&date=all&sort=0
✅ http://www.eia.gov/electricity/data/eia923/
✅ http://cfe.cboe.com/Data/
✅ http://fisher.osu.edu/fin/fdf/osudata.htm
✅ http://www.vs.inf.ethz.ch/res/show.html?what=eco-data
🔶 301 http://research.stlouisfed.org/fred2/
✅ http://www.oanda.com/
🔶 301 http://www.quandl.com/
✅ http://www.doc.ic.ac.uk/~dk3810/data/
⚪ http://cambridgegis.github.io/gisdata.html
🔶 302 http://www.volcano.si.edu
✅ http://earthquake.usgs.gov/earthquakes/search/
✅ http://finance.yahoo.com/
😡 No redirect found for http://www.google.com/trends?q=google&ctab=0&geo=all&date=all&sort=0
🔶 301 http://www.factual.com/
✅ https://aws.amazon.com/public-data-sets/landsat/
✅ http://www.geonames.org/
✅ https://github.com/foursquare/twofishes
✅ http://geodacenter.asu.edu/datalist/
✅ http://wiki.openstreetmap.org/wiki/Downloading_data
✅ http://efele.net/maps/tz/world/
✅ http://www.naturalearthdata.com/
✅ http://www.census.gov/geo/maps-data/data/tiger-line.html
✅ https://github.com/mledoze/countries
✅ https://github.com/umpirsky/country-list
✅ http://openaddresses.io/
✅ http://www.gadm.org/
✅ http://www.bodc.ac.uk/data/where_to_find_data/
✅ http://www.abs.gov.au/AUSSTATS/abs@.nsf/DetailsPage/3301.02009?OpenDocument
✅ http://sedac.ciesin.columbia.edu/data/sets/browse
✅ http://data.gov.be/nl/datasets
✅ https://data.austintexas.gov/
🔶 301 http://www.data.gc.ca/default.asp?lang=En&n=5BCD274E-1
✅ https://data.cambridgema.gov/
✅ https://data.cityofchicago.org/
✅ http://opendata.antwerpen.be/datasets
✅ http://data.denvergov.org//
😡 Error: Getting link http://www.fedstats.gov/cgi-bin/A2Z.cgi getaddrinfo: nodename nor servname provided, or not known
✅ https://www.dallasopendata.com/
✅ https://data.gov.au/
✅ https://www.data.gv.at/
✅ http://ec.europa.eu/eurostat/data/database
✅ http://lginform.local.gov.uk/
✅ https://opendurham.nc.gov/explore/
✅ https://www-genesis.destatis.de/genesis/online
🔶 301 http://www.guardian.co.uk/world-government-data
✅ https://my.pgp-hms.org/public_genetic_data
✅ http://dados.gov.br/dataset
🔶 301 http://data.glasgow.gov.uk/
✅ https://www.data.gouv.fr/en/datasets/
✅ https://www.opendata.fi/en
✅ http://data.ohouston.org
✅ https://data.stad.gent/datasets
🔶 302 http://www.data.gov.in
✅ http://data.go.id/
✅ http://betanyc.us/
✅ http://www.mass.gov/anf/research-and-tech/it-serv-and-support/application-serv/office-of-geographic-information-massgis/
🔶 301 http://nycplatform.socrata.com/
✅ http://www.stats.govt.nz/browse_for_stats.aspx
✅ https://data.lacity.org/
🔴 404 http://data.london.gov.uk/dataset
🔶 302 http://www.data.gov.in/
✅ https://data.overheid.nl/
✅ http://catalogo.datos.gob.mx/dataset
🔶 307 http://www.portlandoregon.gov/28130/
✅ https://data.ok.gov/
✅ https://data.oregon.gov/
✅ http://www.oecd.org/document/0
🔶 302 http://datasf.org/
✅ http://data.gov.ro/
✅ https://data.seattle.gov/
🔶 302 http://wdronline.worldbank.org/
🔶 301 http://data.gov.uk/data
🔶 301 http://www.data.gov.sg/
✅ http://www.census.gov/acs/www/data_documentation/data_release_info/
✅ https://data.texas.gov/
✅ http://www.census.gov/data.html
✅ https://data.pr.gov//
🔶 301 http://data.rio.rj.gov.br/
✅ http://www.opendata.admin.ch/
🔶 301 http://www.huduser.org/portal/datasets/pdrdatas.html
✅ http://www.cdc.gov/nchs/data_access/ftp_data.htm
✅ http://nces.ed.gov/
🔶 301 http://www.data.gov/metric
✅ http://catalog.data.gov/dataset
✅ http://beta2.statssa.gov.za/
✅ http://www.data.gov/open-gov/
✅ http://www.alex-singleton.com/r/2013/02/05/2011-census-open-atlas-project/
✅ http://data.vancouver.ca/datacatalogue/
✅ https://open.fda.gov/index.html
✅ http://data.un.org/
🔶 301 http://go.cms.gov/19xxPN4
✅ http://www.gapminder.org/data/
✅ http://www.ehdp.com/vitalnet/datasets.htm
😡 Error: Getting link http://137.189.35.203/WebUI/CatDatabase/catData.html Connection refused - connect(2)
🔶 301 http://www.cms.gov/medicare-coverage-database/
✅ http://wilmabainbridge.com/facememorability2.html
✅ https://catalogodatos.gub.uy/
✅ https://data.medicare.gov/
✅ http://vision.stanford.edu/aditya86/ImageNetDogs/
✅ https://www.nlm.nih.gov/mesh/filelist.html
✅ http://www.imageemotion.org/
✅ http://cvcl.mit.edu/MM/stimuli.html
✅ https://web.archive.org/web/20150520175645/http://137.189.35.203/WebUI/CatDatabase/catData.html
✅ http://www.image-net.org/
✅ http://www.robots.ox.ac.uk/~vgg/data/pets/
😡 Error: Getting link http://www.discogs.com/data/ wrong status line: "403 Forbidden"
✅ http://groups.csail.mit.edu/vision/SUN/hierarchy.html
✅ http://csea.phhp.ufl.edu/media/iapsmessage.html
✅ http://attributes.kyb.tuebingen.mpg.de/
✅ http://web.mit.edu/torralba/www/indoor.html
✅ http://www.cs.toronto.edu/~delve/data/datasets.html
✅ http://www.face-rec.org/databases/
✅ http://www.modelingonlineauctions.com/datasets
🔴 405 http://www.imdb.com/interfaces
✅ https://data.hdx.rwlabs.org/dataset/ebola-cases-2014
✅ http://sci2s.ugr.es/keel/datasets.php
✅ http://mldata.org/
✅ http://grouplens.org/datasets/movielens/
✅ https://github.com/cooperhewitt/collection
✅ https://www.lendingclub.com/info/download-data.action
🔴 404 http://www.analyticbridge.com/profiles/blogs/registered-meteorites-that-has-impacted-on-earth-visualized
✅ http://labrosa.ee.columbia.edu/millionsong/
✅ http://labrosa.ee.columbia.edu/millionsong/pages/additional-datasets
✅ http://www.rdatamining.com/data
✅ https://github.com/artsmia/collection
✅ https://github.com/tategallery/collection
✅ http://webscope.sandbox.yahoo.com/catalog.php?datatype=r
✅ http://vocab.getty.edu
✅ http://missionlocal.org/san-francisco-restaurant-health-inspections/
🔶 301 http://aws.amazon.com/datasets/8172056142375670
✅ http://www.isi.edu/~lerman/downloads/flickr/flickr_taxonomies.html
✅ http://lemurproject.org/clueweb12/FACC1/
✅ http://lemurproject.org/clueweb09/FACC1/
✅ http://data.nhm.ac.uk/
✅ http://www.isi.edu/natural-language/download/hansard/
✅ https://www.rijksmuseum.nl/en/api
✅ https://github.com/ParallelMazen/SaudiNewsNet
✅ http://statmt.org/wmt11/translation-task.html#download
✅ https://www.wikidata.org/wiki/Wikidata:Database_download
✅ http://archive.ics.uci.edu/ml/
✅ http://www.psych.ualberta.ca/~westburylab/downloads/usenetcorpus.download.html
✅ https://catalog.ldc.upenn.edu/LDC2006T13
✅ http://nssdc.gsfc.nasa.gov/nssdc/obtaining_data.html
✅ http://www.dt.fee.unicamp.br/~tiago/smsspamcollection/
✅ http://exoplanetarchive.ipac.caltech.edu/
✅ http://u.cs.biu.ac.il/~koppel/BlogCorpus.htm
🔶 301 http://aws.amazon.com/datasets
✅ https://code.google.com/p/wiki-links/downloads/list
✅ http://www.gutenberg.org/wiki/Gutenberg:Offline_Catalogs
✅ http://wordnet.princeton.edu/wordnet/download/
✅ http://opendata.cern.ch/
✅ http://www.google.com/publicdata/directory
✅ http://lib.stat.cmu.edu/datasets/
✅ http://lib.stat.cmu.edu/jasadata/
✅ http://www.sdss.org/
⚪ 503 http://datamob.org/datasets
🔶 301 http://www.reddit.com/r/datasets
✅ http://www.kdnuggets.com/datasets/index.html
🔴 404 http://datamarket.azure.com/browse/data?price=free
✅ http://www.infochimps.com/
✅ http://www.stats4stem.org/data-sets.html
🔶 301 http://www.revolutionanalytics.com/subscriptions/datasets/
😡 Error: Getting link http://numbrary.com/ getaddrinfo: nodename nor servname provided, or not known
✅ http://www.washingtonpost.com/wp-srv/metro/data/datapost.html
✅ http://stat.ethz.ch/R-manual/R-patched/library/datasets/html/00Index.html
✅ http://www.data360.org/index.aspx
🔶 302 http://webscope.sandbox.yahoo.com/catalog.php
✅ http://academictorrents.com/
✅ http://wiki.stat.ucla.edu/socr/index.php/SOCR_Data
🔶 302 http://datahub.io/dataset
✅ http://www.nuforc.org/webreports.html
🔶 302 http://911.wikileaks.org/files/index.html
✅ https://archive.org/details/datasets
✅ https://www.archive-it.org/explore?show=Collections
✅ http://www.statsci.org/datasets.html
🔶 302 http://thedata.harvard.edu/dvn/
✅ http://www.statista.com/
✅ http://www.icpsr.umich.edu/icpsrweb/ICPSR/index.jsp
✅ http://www.cs.tau.ac.il/~wolf/ytfaces/
🔶 302 http://www.archive.org/details/twitter_cikm_2010
🔶 302 http://www.archive.org/details/2011-05-calufa-twitter-sql
🔶 302 https://certificates.theodi.org/datasets
✅ http://waxy.org/random/misc/gamergate_tweets.csv
✅ http://snap.stanford.edu/data/higgs-twitter.html
✅ http://snap.stanford.edu/data/egonets-Twitter.html
✅ http://help.sentiment140.com/for-students/
✅ http://wiki.dbpedia.org/Datasets
✅ http://www.cs.cmu.edu/~enron/
🔶 301 https://aws.amazon.com/datasets/917205
✅ http://www.cs.cmu.edu/~jelsas/data/ancestry.com/
⚪ https://github.com/emorisse/FBI-Hate-Crime-Statistics/tree/master/2013
✅ https://datamarket.com/data/list/?q=all
🔴 404 http://www.public.asu.edu/~hgao16/dataset.html
⚪ http://bit.ly/1aL8XS0
✅ http://www3.norc.org/GSS+Website/
⚪ 503 http://www.freebase.com/
✅ https://kdl.cs.umass.edu/display/public/Mobile+Social+Networks
🔶 301 http://www.githubarchive.org/
✅ http://law.di.unimi.it/datasets.php
✅ https://archive.org/details/201309_foursquare_dataset_umn
⚪ https://github.com/caesar0301/awesome-public-datasets/tree/master/Datasets
✅ http://realitycommons.media.mit.edu/realitymining.html
✅ http://www.tdcj.state.tx.us/death_row/dr_executed_offenders.html
✅ https://archive.org/details/oxford-2005-facebook-matrix
✅ http://data.stackexchange.com/help
✅ http://an.kaist.ac.kr/traces/WWW2010.html
✅ http://dataarchives.ss.ucla.edu/Home.DataPortals.htm
✅ http://ucdata.berkeley.edu/
✅ http://www.pewinternet.org/datasets/pages/2/
✅ http://law.di.unimi.it/datasets.php
🔶 301 http://www.nd.edu/~oss/Data/data.html
✅ http://webscope.sandbox.yahoo.com/catalog.php?datatype=g
✅ http://www3.cs.stonybrook.edu/~leman/data/gscholar.db
✅ http://www3.cs.stonybrook.edu/~leman/data/14-icwsm-political-polarity-data.zip
✅ http://univ.cc/
✅ https://www.reddit.com/r/datasets/comments/3bxlg7/i_have_every_publicly_available_reddit_comment/
✅ http://cricsheet.org/
✅ http://netsg.cs.sfu.ca/youtubedata/
✅ http://gdeltproject.org/data.html
✅ https://github.com/quankiquanki/skytrax-reviews-dataset
✅ http://www.retrosheet.org/game.htm
✅ http://data.betfair.com/
🔴 404 http://www.upjohn.org/erdc/erdc.html
✅ http://www.jokecamp.com/blog/guide-to-football-and-soccer-data-and-apis/
✅ https://www.backblaze.com/hard-drive-test-data.html
✅ http://stat-computing.org/dataexpo/2009/the-data.html
✅ http://ergast.com/mrd/db
✅ http://www.cs.ucr.edu/~eamonn/time_series_data/
✅ http://www.seanlahman.com/baseball-archive/statistics/
✅ http://hubwaydatachallenge.org/trip-history-data/
✅ https://github.com/BetaNYC/Bike-Share-Data-Best-Practices/wiki/Bike-Share-Data-Systems
✅ http://research.microsoft.com/en-us/downloads/b16d359d-d164-469e-9fd4-daa38f2b2e13/
✅ https://datamarket.com/data/list/?q=provider:tsdl
🔶 302 http://www.bayareabikeshare.com/datachallenge
✅ http://openflights.org/data.html
✅ http://ecg.mit.edu/time-series/
✅ http://www.transtats.bts.gov/Tables.asp?DB_ID=120
✅ http://www.transtats.bts.gov/DataIndex.asp
✅ http://www.planecrashinfo.com/database.htm
✅ http://academictorrents.com/details/a2ccf94bbb4af222bf8e69dad60a68a29f310d9a
🔶 301 https://www.marinetraffic.com/de/p/api-services
✅ http://www.nyc.gov/html/tlc/html/about/trip_record_data.shtml
✅ http://ops.fhwa.dot.gov/freight/freight_analysis/faf/index.htm
🔶 302 http://www.tfl.gov.uk/info-for/open-data-users/our-feeds
✅ http://www.cmap.illinois.gov/data/transportation/travel-tracker-survey
✅ https://github.com/fivethirtyeight/uber-tlc-foil-response
✅ http://www.inside-r.org/howto/finding-data-internet
🔴 404 http://www.datawrangling.com/some-datasets-available-on-the-web
🔶 302 http://opendatanetwork.com
✅ http://www.rita.dot.gov/bts/
🔴 403 http://www.quora.com/Where-can-I-find-large-datasets-open-to-the-public
🔶 302 http://opendatamonitor.eu
✅ http://xiaming.me/posts/2014/10/23/leveraging-open-data-to-understand-urban-lives/
🔶 301 http://rs.io/2014/05/29/list-of-data-sets.html
✅ https://archive.org/details/nycTaxiTripData2013
✅ https://zenodo.org/collection/datasets
😡 Error: Getting link http://www.cmr.osu.edu/browse/datasets Operation timed out - connect(2)

📋 frankenstein results: 77 issues (21%)
(77 of 355 links
["🔶 301 http://www.infobiotic.net/PSPbenchmarks/", "🔴 404 http://bit.do/VVW6", "🔶 301 http://pdb.org/", "🔴 400 http://www.ebi.ac.uk/arrayexpress/", "🔶 301 http://ncdc.noaa.gov/data-access/quick-links", "🔶 302 https://wiki.earthdata.nasa.gov/display/GIBS", "🔶 301 http://www.tutiempo.net/en/Climate", "🔶 301 http://ogirardot.wordpress.com/2013/01/31/sharing-pypimaven-dependency-data/", "🔶 301 http://www.elsevier.com/online-tools/scopus", "🔶 301 http://cnets.indiana.edu/groups/nan/webtraffic/click-dataset", "🔶 301 http://labs.criteo.com/2015/03/criteo-releses-its-new-dataset/", "🔶 302 https://console.developers.google.com/storage/openmobiledata_public/", "🔶 301 http://www.kaggle.com/", "🔴 hostname \"www.kddcup2012.org\" does not match the server certificate https://www.kddcup2012.org/", "🔶 302 https://www.crowdanalytix.com/datax", "🔶 301 http://crawdad.cs.dartmouth.edu/", "🔶 302 https://www.spaceappschallenge.org", "🔶 302 http://www.aeaweb.org/RFE/toc.php?show=complete", "🔶 301 http://www.d4d.orange.com/en/home", "🔶 302 http://ampds.org/", "🔴 SSL_connect SYSCALL returned=5 errno=0 state=SSLv2/v3 read server hello A https://data.nasdaq.com/", "🔶 302 http://www.google.com/trends?q=google&ctab=0&geo=all&date=all&sort=0", "🔶 301 http://research.stlouisfed.org/fred2/", "🔶 301 http://www.quandl.com/", "🔶 302 http://www.volcano.si.edu", "🔶 301 http://www.factual.com/", "🔶 301 http://www.data.gc.ca/default.asp?lang=En&n=5BCD274E-1", "🔴 getaddrinfo: nodename nor servname provided, or not known http://www.fedstats.gov/cgi-bin/A2Z.cgi", "🔶 301 http://www.guardian.co.uk/world-government-data", "🔶 301 http://data.glasgow.gov.uk/", "🔶 302 http://www.data.gov.in", "🔶 301 http://nycplatform.socrata.com/", "🔴 404 http://data.london.gov.uk/dataset", "🔶 302 http://www.data.gov.in/", "🔶 307 http://www.portlandoregon.gov/28130/", "🔶 302 http://datasf.org/", "🔶 302 http://wdronline.worldbank.org/", "🔶 301 http://data.gov.uk/data", "🔶 301 http://www.data.gov.sg/", "🔶 301 http://data.rio.rj.gov.br/", "🔶 301 http://www.huduser.org/portal/datasets/pdrdatas.html", "🔶 301 http://www.data.gov/metric", "🔶 301 http://go.cms.gov/19xxPN4", "🔴 Connection refused - connect(2) http://137.189.35.203/WebUI/CatDatabase/catData.html", "🔶 301 http://www.cms.gov/medicare-coverage-database/", "🔴 wrong status line: \"403 Forbidden\" http://www.discogs.com/data/", "🔴 405 http://www.imdb.com/interfaces", "🔴 404 http://www.analyticbridge.com/profiles/blogs/registered-meteorites-that-has-impacted-on-earth-visualized", "🔶 301 http://aws.amazon.com/datasets/8172056142375670", "🔶 301 http://aws.amazon.com/datasets", "⚪ 503 http://datamob.org/datasets", "🔶 301 http://www.reddit.com/r/datasets", "🔴 404 http://datamarket.azure.com/browse/data?price=free", "🔶 301 http://www.revolutionanalytics.com/subscriptions/datasets/", "🔴 getaddrinfo: nodename nor servname provided, or not known http://numbrary.com/", "🔶 302 http://webscope.sandbox.yahoo.com/catalog.php", "🔶 302 http://datahub.io/dataset", "🔶 302 http://911.wikileaks.org/files/index.html", "🔶 302 http://thedata.harvard.edu/dvn/", "🔶 302 http://www.archive.org/details/twitter_cikm_2010", "🔶 302 http://www.archive.org/details/2011-05-calufa-twitter-sql", "🔶 302 https://certificates.theodi.org/datasets", "🔶 301 https://aws.amazon.com/datasets/917205", "🔴 404 http://www.public.asu.edu/~hgao16/dataset.html", "⚪ 503 http://www.freebase.com/", "🔶 301 http://www.githubarchive.org/", "🔶 301 http://www.nd.edu/~oss/Data/data.html", "🔴 404 http://www.upjohn.org/erdc/erdc.html", "🔶 302 http://www.bayareabikeshare.com/datachallenge", "🔶 301 https://www.marinetraffic.com/de/p/api-services", "🔶 302 http://www.tfl.gov.uk/info-for/open-data-users/our-feeds", "🔴 404 http://www.datawrangling.com/some-datasets-available-on-the-web", "🔶 302 http://opendatanetwork.com", "🔴 403 http://www.quora.com/Where-can-I-find-large-datasets-open-to-the-public", "🔶 302 http://opendatamonitor.eu", "🔶 301 http://rs.io/2014/05/29/list-of-data-sets.html", "🔴 Operation timed out - connect(2) http://www.cmr.osu.edu/browse/datasets"]

2 misc items

🔶 57 redirects
http://pdb.org/ 5 redirects to
http://www.rcsb.org/
http://www.infobiotic.net/PSPbenchmarks/ 4 redirects to
http://ico2s.org/datasets/psp_benchmark.html
http://ncdc.noaa.gov/data-access/quick-links 4 redirects to
http://www.ncdc.noaa.gov/data-access/quick-links
https://wiki.earthdata.nasa.gov/display/GIBS 38 redirects to
https://wiki.earthdata.nasa.gov/display/GIBS/Global+Imagery+Browse+Services+-+GIBS
http://ogirardot.wordpress.com/2013/01/31/sharing-pypimaven-dependency-data/ 1 redirects to
https://ogirardot.wordpress.com/2013/01/31/sharing-pypimaven-dependency-data/
http://www.tutiempo.net/en/Climate -4 redirects to
http://en.tutiempo.net/climate
http://cnets.indiana.edu/groups/nan/webtraffic/click-dataset 1 redirects to
http://cnets.indiana.edu/groups/nan/webtraffic/click-dataset/
http://www.elsevier.com/online-tools/scopus -2 redirects to
https://www.elsevier.com/solutions/scopus
https://console.developers.google.com/storage/openmobiledata_public/ 70 redirects to
https://accounts.google.com/ServiceLogin?osid=1&passive=true&continue=https://console.developers.google.com/storage/openmobiledata_public/
http://labs.criteo.com/2015/03/criteo-releses-its-new-dataset/ 1 redirects to
http://labs.criteo.com/2015/03/criteo-releases-its-new-dataset/
http://www.kaggle.com/ 1 redirects to
https://www.kaggle.com/
http://crawdad.cs.dartmouth.edu/ 1 redirects to
https://crawdad.cs.dartmouth.edu/
http://ampds.org/ 0 redirects to
http://ampds.org/
https://www.crowdanalytix.com/datax -6 redirects to
http://data.crowdanalytix.com
http://www.aeaweb.org/RFE/toc.php?show=complete 1 redirects to
https://www.aeaweb.org/RFE/toc.php?show=complete
https://www.spaceappschallenge.org 1 redirects to
https://2015.spaceappschallenge.org
http://research.stlouisfed.org/fred2/ 1 redirects to
https://research.stlouisfed.org/fred2/
http://www.quandl.com/ 1 redirects to
https://www.quandl.com/
http://www.d4d.orange.com/en/home 11 redirects to
http://www.d4d.orange.com/en/Accueil/home-V2
http://www.factual.com/ 1 redirects to
https://www.factual.com/
http://www.volcano.si.edu -3 redirects to
http://volcano.si.edu/
http://www.data.gc.ca/default.asp?lang=En&n=5BCD274E-1 -9 redirects to
http://open.canada.ca/en?lang=En&n=5BCD274E-1
http://www.guardian.co.uk/world-government-data 1 redirects to
http://www.theguardian.com/world-government-data
http://nycplatform.socrata.com/ 1 redirects to
https://nycplatform.socrata.com/
http://data.glasgow.gov.uk/ 1 redirects to
https://data.glasgow.gov.uk/
http://datasf.org/ 0 redirects to
http://datasf.org/
http://www.portlandoregon.gov/28130/ 0 redirects to
http://www.portlandonline.com/28130/
http://www.data.gov.in -2 redirects to
https://data.gov.in/
http://www.data.gov.in/ -3 redirects to
https://data.gov.in/
http://www.data.gov/metric 1 redirects to
http://www.data.gov/metrics
http://www.huduser.org/portal/datasets/pdrdatas.html 0 redirects to
http://www.huduser.gov/portal/datasets/pdrdatas.html
http://wdronline.worldbank.org/ 22 redirects to
https://openknowledge.worldbank.org/handle/10986/2124
http://data.rio.rj.gov.br/ -9 redirects to
http://data.rio//
http://data.gov.uk/data 8 redirects to
https://data.gov.uk/data/search
http://www.cms.gov/medicare-coverage-database/ 1 redirects to
https://www.cms.gov/medicare-coverage-database/
http://go.cms.gov/19xxPN4 100 redirects to
https://www.cms.gov/Research-Statistics-Data-and-Systems/Statistics-Trends-and-Reports/BSAPUFS/Downloads/2010_Carrier_PUF.zip
http://www.data.gov.sg/ -3 redirects to
https://data.gov.sg/
http://aws.amazon.com/datasets/8172056142375670 5 redirects to
https://aws.amazon.com/datasets/google-books-ngrams/
http://aws.amazon.com/datasets 1 redirects to
http://aws.amazon.com/datasets/
http://www.reddit.com/r/datasets 1 redirects to
https://www.reddit.com/r/datasets
http://www.revolutionanalytics.com/subscriptions/datasets/ -9 redirects to
http://packages.revolutionanalytics.com/datasets/
http://webscope.sandbox.yahoo.com/catalog.php -12 redirects to
http://webscope.sandbox.yahoo.com
http://datahub.io/dataset 1 redirects to
https://datahub.io/dataset
http://www.archive.org/details/2011-05-calufa-twitter-sql -4 redirects to
http://archive.org/details/2011-05-calufa-twitter-sql
http://911.wikileaks.org/files/index.html 1 redirects to
https://911.wikileaks.org/files/index.html
https://aws.amazon.com/datasets/917205 11 redirects to
https://aws.amazon.com/datasets/enron-email-data/
http://www.archive.org/details/twitter_cikm_2010 -3 redirects to
https://archive.org/details/twitter_cikm_2010
http://www.githubarchive.org/ 1 redirects to
https://www.githubarchive.org/
http://thedata.harvard.edu/dvn/ -1 redirects to
https://dataverse.harvard.edu/
http://www.nd.edu/~oss/Data/data.html 1 redirects to
http://www3.nd.edu/~oss/Data/data.html
http://www.bayareabikeshare.com/datachallenge -4 redirects to
http://www.bayareabikeshare.com/open-data
https://certificates.theodi.org/datasets 3 redirects to
https://certificates.theodi.org/en/datasets
https://www.marinetraffic.com/de/p/api-services 1 redirects to
http://www.marinetraffic.com/de/ais-api-services
http://opendatanetwork.com 5 redirects to
http://www.opendatanetwork.com/
http://www.tfl.gov.uk/info-for/open-data-users/our-feeds -3 redirects to
https://tfl.gov.uk/info-for/open-data-users/our-feeds
http://opendatamonitor.eu 43 redirects to
http://opendatamonitor.eu/frontend/web/index.php?r=dashboard%2Findex
http://rs.io/2014/05/29/list-of-data-sets.html 8 redirects to
http://rs.io/100-interesting-data-sets-for-statistics/

🕐 Time elapsed: 1 minute48s
🔴 17 failures for caesar0301/awesome-public-datasets
Created with https://github.com/dkhamsing/frankenstein Nov 20, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment