- Länk event
- Länk forum.civictech.se
- Nedan chatten från eventet - inspelningen var visst bara live....
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#Naturkartan check | |
# https://gist.github.com/salgo60/eae5986297ad88a801549d0a37731817 | |
# | |
from datetime import datetime | |
import urllib3 | |
import sys | |
from SPARQLWrapper import SPARQLWrapper, JSON | |
endpoint_url = "https://query.wikidata.org/sparql" | |
http = urllib3.PoolManager() |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
# See question RAÄ FB https://www.facebook.com/riksantikvarieambetet/posts/10158191730201970 | |
# pip install sparqlwrapper | |
# https://rdflib.github.io/sparqlwrapper/ | |
# this https://gist.github.com/salgo60/49c52e1f7009f0ef318e9fadd94addc5 | |
# old https://gist.github.com/salgo60/a4ebde4f0a279d5f9479aeaf7b846403 | |
# | |
# "Problemet med Persistenta identifierare och hävda att http status koder fungerar" | |
# https://github.com/salgo60/SamlaLibris/issues/12 | |
# | |
# Försök med bättre felsida https://github.com/riksantikvarieambetet/ksamsok/pull/1#issuecomment-752963121 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Q5792855 Hasselrot i Stockholm senare Lundegård 2:278 4:231 | |
Q5914280 af Klint i Linköping senare Helsingborg 2:85 4:243 | |
Q6218850 af Ugglas i Linköping senare Stockholm 1:183 2:114 | |
Q1347731 Ahlmark i Enskede senare Stockholm 1:49 4:469 | |
Q5544228 Ahlsten i Alva senare Hemse 2:331 4:469 | |
Q6255333 Åkerman i Strömstad senare Halmstad 1:404 4:177 | |
Q5547372 Almgren i Stockholm 4:199 5:399 | |
Q5547719 Almström i Stockholm 1:50 5:119 | |
Q5553501 Anderson i Stockholm 2:138 5:201 | |
Q953952 Andersson i Göteborg senare Riksby, Bromma, Mariehäll o Sundbyberg 1:360 4:58 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
# Check linkroot | |
# See question RAÄ FB https://www.facebook.com/riksantikvarieambetet/posts/10158191730201970 | |
# pip install sparqlwrapper | |
# https://rdflib.github.io/sparqlwrapper/ | |
# https://gist.github.com/salgo60/a4ebde4f0a279d5f9479aeaf7b846403 | |
from datetime import datetime | |
import urllib3 | |
import sys | |
from SPARQLWrapper import SPARQLWrapper, JSON |
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Wikidata | NamniData | AntalRemisser | |
---|---|---|---|
Q7654603 | Sveriges advokatsamfund | 124 | |
Q1474680 | Svenskt Näringsliv | 118 | |
Q385435 | Sveriges akademikers centralorganisation | 115 | |
Q338636 | Landsorganisationen i Sverige | 86 | |
Q10501287 | Företagarna | 80 | |
Q10554096 | Lantbrukarnas riksförbund | 73 | |
Q10494305 | Fastighetsägarna | 57 | |
Q10670356 | Skogsindustrierna | 54 | |
Q121316105 | Näringslivets Regelnämnd | 52 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Vi samlar in och dokumenterar fotbollsspelare och Riksdagsmän på WIkipedia och önskar bild grav och gärna koordinat | |
URL,,,,,,, | |
Se mer info | |
* Fotbollsspelare https://github.com/salgo60/ifkdb/issues/9 | |
* Riksdagsmän https://github.com/salgo60/Wikidata_riksdagen-corpus/issues/155 | |
Tack |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Tackar många tankar finns och som Lars som byggt upp Runeberg sa på Sv:Wikipedia det här är ju en perfekt pusselbit att se om det som museer, Kungliga biblioteket, RAÄ, Riksarkivet dokumenterar Sverige är komplett.. vad missar dom… | |
* en intressant väg framåt är att kopiera Wikidata konceptet och skapa en community driven kunskapsgraf gärna med APIer så man kan skriva botar…. | |
Jag har börjat titta på dels hur olika samlingar överlappar varandra och det känns som vi idag skapar mer <b>digitala silos</b> än jobbar ihop och ger varandra mervärde. Extremt intressant är Humlabs TEI projekt där man nu har ”samma som” Wikidata för gubbar och Wikidata men där hoppas jag dom går vidare med | |
* platser | |
* händelser | |
* ord jmf lexeme projektet i Wikidata där alla världens ord med alla deras böjningsformer dokumenteras https://ordia.toolforge.org/language/ | |
Exempel vad jag kollat | |
* författare i Litteraturbanken och om dom finns i Wikidata och i LibrisXL där jag tycker det borde vara 1-1 mellan KB och Litteraturbanken men känn |
Can you explain the problems with LLM and "Coverage gap on long tail"
Sure, the terms you are using seem to refer to Large Language Models (LLM) and a specific problem related to their training and performance, known as "Coverage gap on long tail".
Large Language Models like GPT-4 are trained on a vast range of internet text. But despite their training on diverse data, they may still have some gaps in their knowledge. This could be due to several reasons, but one notable reason relates to the problem of the "coverage gap on long tail".