wtf_wikipedia
is a wonderful tool for extracting structured data from Wikipedia pages. One of the main ways I use it is to extract information from politicians' infoboxes about the positions they've held, to compare this with what Wikidata knows.
To make processing these a lot simpler, I've often wished that the JSON returned from wft_wikipedia
could be augmented with the Wikidata IDs for any linked item. So, for example, when getting officeholder data for Kaja Kallas, instead of
"office": {
"text": "19th Prime Minister of Estonia",
"links": [
{
"type": "internal",