Skip to content

Instantly share code, notes, and snippets.

@edsu
Created September 11, 2010 02:28
Show Gist options
  • Star 1 You must be signed in to star a gist
  • Fork 1 You must be signed in to fork a gist
  • Save edsu/574700 to your computer and use it in GitHub Desktop.
Save edsu/574700 to your computer and use it in GitHub Desktop.
# Hosts that serve up SKOS in the Billion Triple Challenge dataset:
#
# http://challenge.semanticweb.org/
#
# Results are ordered by the number of SKOS triples from the host, and were calculated with the
# following command:
#
# zgrep 'http://www.w3.org/2004/02/skos/core' btc-2010-chunk-*.gz | quadtabs.pl | cut -d "<ctrl-v><tab>" -f 4 | sort | uniq -c | sort -rn
#
# where quadtabs.pl = http://gist.github.com/574679
669536 dbpedia.org
499715 www.uniprot.org
325000 www4.wiwiss.fu-berlin.de
244291 thesauri.cs.vu.nl
162946 wiki.kiwi-project.eu:8060
137766 lod.geospecies.org
105922 www.ukat.org.uk
99695 psh.ntkcz.cz
87325 cdsware.cern.ch
80012 yago.zitgist.com
50083 education.data.gov.uk
48468 www.w3.org
46101 data.nytimes.com
40376 id.loc.gov
36798 volute.googlecode.com
25467 www.uni-sw.gwdg.de
19868 ontology.neuinfo.org
18043 cain.ice.ucdavis.edu
17687 www.eionet.europa.eu
16656 ccdb.ucsd.edu
11512 isegserv.itd.rl.ac.uk
9379 umbel.org
9187 amc-app2.amc.sara.nl
9107 www.ottevanger.plus.com
8108 www.cs.man.ac.uk
7721 ontologi.es
7604 data-gov.tw.rpi.edu
7510 www.ivoa.net
6247 transport.data.gov.uk
5540 www.astro.physik.uni-goettingen.de
5485 staff.oclc.org
4379 marinemetadata.org
4081 www.yamaguti.comp.ae.keio.ac.jp
3834 dublincore.org
3238 neuroscientific.net
3205 vocab.org
2700 welkin.googlecode.com
2693 data.linkedmdb.org
2321 svnmirror.osgeo.org
2115 assets.geospecies.org
2021 schemas.library.nhs.uk
1968 www.ivan-herman.net
1946 wwwis.win.tue.nl
1891 www.astro.gla.ac.uk
1878 my.opera.com
1750 entrezneuron.googlecode.com
1679 lcweb2.loc.gov
1580 aaronland.info
1577 www.sembase.at
1474 www.berkeleybop.org
1401 metadataregistry.org
1204 www.dur.ac.uk
1191 www.tlrp.org
1060 lists.w3.org
1044 www.biologeek.com
1004 www.neuroscientific.net
986 wiki.sembase.at
982 rdf.geospecies.org
964 tobyinkster.co.uk
833 wiktolog.com
831 rdvocab.info
813 revyu.com
768 kiss.salzburgresearch.at
722 archvocab.net
584 server1.coloradostoutenburg.com
576 www.articlestoreprint.net:8080
497 foafbuilder.qdos.com
492 sandbox.metadataregistry.org
439 demo.openlinksw.com
428 ivoa.net
377 4uing.com
321 myopenlink.net
312 productdb.org
306 statistics.data.gov.uk
290 social.semantic-web.at
286 www.kanzaki.com
280 www-sop.inria.fr
270 www.w3c.rl.ac.uk
256 ontowiki.googlecode.com
249 www.mindswap.org
240 miskinhill.com.au
200 www.semanlink.net
180 www.abeservices.com.au
173 registre.docutheque.com
159 www.wasab.dk
153 eculture.cs.vu.nl:48080
123 vocabularyserver.com
123 swui.semanticweb.org
117 research.data.gov.uk
115 www.openlinksw.com
93 www.snee.com
86 www.loa-cnr.it
85 rdf.taxonconcept.org
83 www.cambridgesemantics.com
79 lod.taxonconcept.org
78 www.eswc2006.org
75 brg.ldeo.columbia.edu
66 www.mondeca.com
59 www.lsrn.org
54 www.game.cat
49 redined.r020.com.ar
49 lod.openlinksw.com
42 www.yso.fi
39 business.data.gov.uk
37 bielenberg.info
36 rdf.freebase.com
31 saei.org
31 lcsubjects.org
26 sw.opencyc.org
26 schemas.talis.com
26 dbpedia2.openlinksw.com:8895
25 www.heppnetz.de
25 uriplay.org
24 heml.mta.ca
23 www.spraci.com
23 aimlab.cs.uoregon.edu
22 n2.talis.com
22 bruce.darcus.name
19 oecd.dataincubator.org
19 discobits.org
17 www.dublincore.org
16 www.johngoodwin.me.uk
15 www.xml.com
15 labs.systemone.at
15 195.251.218.37:2020
13 zbw.eu
13 torrez.us
13 people.geospecies.org
13 dev.torrez.us
12 riese.joanneum.at
11 rdfohloh.wikier.org
10 libris.kb.se
10 groupme.org
9 virtuoso.openlinksw.com
8 buzzword.org.uk
7 www.taxonconcept.org
7 tuuli.info
7 jelenajovanovic.net
7 2007.zbw.eu
6 www.marcont.org
6 www.jenitennison.com
6 www.few.vu.nl
6 www.aaronland.info
6 tagont.googlecode.com
6 rdf.myexperiment.org
6 data.ordnancesurvey.co.uk
5 lccn.heroku.com
5 jena.cvs.sourceforge.net
4 www.schemaweb.info
4 www.patrickgmj.net
4 www.holygoat.co.uk
4 www.groupme.net
4 out.l3s.uni-hannover.de:8080
3 www.twine.com
3 planetrdf.com
3
2 www.uni-koblenz.de
2 www.soton.ac.uk
2 www.eclipse.org
2 www.csd.abdn.ac.uk
2 www.abes.fr
2 sw.deri.org
2 swaml.berlios.de
2 r.hatena.ne.jp
2 panesofglass.org
2 musicontology.com
2 moustaki.org
2 motools.sourceforge.net
2 entitydescriber.googlecode.com
2 berkeleybop.neurocommons.org
2 alexandre.alapetite.fr
1 www.bbc.co.uk
1 sioc-project.org
1 nedko.arnaudov.name
1 linkeddata.uriburner.com
1 idi.fundacionctic.org
1 dev.isb-sib.ch
1 commontag.org
1 blog.semantic-web.at
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment