Skip to content

Instantly share code, notes, and snippets.

Embed
What would you like to do?
Distinkte GSW zu Notation 37 mit Anzahl der Vorkommnisse
$ curl --header "Accept-Encoding: gzip" "http://lobid.org/resources/search?q=coverage:37&format=bulk" | gzip -dc > n37-strings.jsonl
$ cat n37-strings.jsonl | jq .coverage[] | sort | uniq -c | sort -n -r > n37-strings.txt
Abschließend per RegEx um Nicht-37-Notationen bereinigt.
3 "Wormbach <Dekanat> | 37"
3 "Dekanat Olpe | 37"
2 "Dekanat Siegen | 37"
2 "Dekanat Rheine | 37"
2 "Dekanat Ahlen, Warendorf | 37"
1 "Dekanat Werl | 37"
1 "Dekanat Wattenscheid | 37"
1 "Dekanat Warendorf | 37"
1 "Dekanat Vechta | 37"
1 "Dekanat Sundern | 37"
1 "Dekanat Steinheim <Höxter> | 37"
1 "Dekanat Steinfurt <Westfalen> | 37"
1 "Dekanat Recklinghausen | 37"
1 "Dekanat Meschede | 37"
1 "Dekanat Lippe | 37"
1 "Dekanat Iserlohn | 37"
1 "Dekanat Dortmund | 37"
1 "Dekanat Borken | 37"
1 "Dekanat Bigge-Medebach | 37"
1 "Dekanat Beckum, Warendorf | 37"
1 "Dekanat Attendorn | 37"
1 "Dekanat Ahlen | 37"
1 "Dekanat Ahaus | 37"
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
You can’t perform that action at this time.