Generated with the following steps (thanks Jakob, https://twitter.com/nichtich/status/1410526898694807552):
$ curl https://bartoc.org/data/dumps/latest.ndjson | jq -r .FORMAT[].uri > bartoc-formats.txt
$ cat bartoc-formats.txt | sort | uniq -c | sort -nr
- Remove http://bartoc.org/en/Format/ (for better readability)
- Make it a Markdown table
Count | Format |
---|---|
1935 | Online |
898 | |
846 | RDF |
511 | Printed |
499 | SKOS |
473 | XML |
282 | Spreadsheet |
182 | HTML |
161 | CSV |
138 | JSON |
111 | MADS |
105 | OWL |
89 | Zthes |
84 | XTM |
82 | DC |
82 | BS8723-5 |
74 | JSON-LD |
69 | VDEX |
53 | Microform |
51 | Word |
41 | TXT |
35 | CD-ROM |
32 | XSD |
29 | OBO |
26 | MARC |
23 | Floppy-Disc |
17 | Database |
10 | EPUB |
9 | Geodata |
4 | JSKOS |
3 | ClaML |
This is not working anymore.
Some entries do not contain the
FORMAT
key. This breaks the query.I had to change to
curl https://bartoc.org/data/dumps/latest.ndjson | jq '(.FORMAT[].uri)?' > bartoc-formats.txt
To see what lines do not contain the
FORMAT
key you can use:curl https://bartoc.org/data/dumps/latest.ndjson | jq -n 'inputs | select(has("FORMAT") | not) | input_line_number'