Skip to content

Instantly share code, notes, and snippets.

View mromanello's full-sized avatar

Matteo Romanello mromanello

View GitHub Profile
@mromanello
mromanello / test_oai-pmh.py
Created October 18, 2011 14:43
Poking around with the Infrae OAI-PMH client module
from oaipmh.client import Client
from oaipmh.metadata import MetadataRegistry, oai_dc_reader
zenon_URL = "http://opac.dainst.org/OAI"
dai_URL = "http://arachne.uni-koeln.de:8080/OAI-PMH/oai-pmh.xml"
oc_URL="http://opencontext.org/oai/request"
registry = MetadataRegistry()
registry.registerReader('oai_dc', oai_dc_reader)
client = Client(oc_URL, registry)
@mromanello
mromanello / gist:5168303
Created March 15, 2013 08:25
Python script to create an example of data that can be displyaed in DARIAH geo-browser <http://dev2.dariah.eu/e4d/>, a tool to visualize data in time and space. Data are drawn from the Gazetteer and the Opac (library catalogue) of the German Archaeological Institute (DAI). This dataset contains data about publications related to the Limes groupe…
import json
import urllib
import urllib2
import sys
import codecs
"""
Author: Matteo Romanello, <mro@dainst.de>
"""
@mromanello
mromanello / gist:5168335
Created March 15, 2013 08:32
And this is the output you should get by running the script contained in [this Gist](https://gist.github.com/mromanello/5168303).
<?xml version="1.0" encoding="utf-8"?>
<kml xmlns="http://www.opengis.net/kml/2.2">
<Document>
<ExtendedData xmlns:dc="http://purl.org/dc/elements/1.1/">
<dc:description>
This dataset contains data about publications related to the Limes grouped by the place in the Limes they relate to.
The main purpose is the display within the DARIAH Geo-browser, a tool to visualize data in time and space.
</dc:description>
<dc:subject>Archaeology, geo-date, Limes</dc:subject>
<dc:identifier>http://de.digitalclassicist.org/berlin/files/output.kml</dc:identifier>
@mromanello
mromanello / desert_castle_levant.kml
Last active August 29, 2015 14:08
Mashup of data related to desert castles in the Levant using DAI's gazetteer and library catalogue. This mashup was created for This dataset was created for Sabine Thänert's presentation at the EVA 2014 conference (Berlin, 5-7 Nov. 2014). The visualization can be viewed at <http://geobrowser.de.dariah.eu/?csv=http://geobrowser.de.dariah.eu/stora…
<?xml version="1.0" encoding="utf-8"?>
<kml xmlns="http://www.opengis.net/kml/2.2">
<Document>
<!--
<ExtendedData xmlns:dc="http://purl.org/dc/elements/1.1/">
<dc:description>
This dataset was created for a presentation by Sabine Thänert at the EVA 2014 conferece (Berlin, 5-7 Nov. 2014).
The dataset contains data about data about publications contained in DAI's library catalog (Zenon)
that are related to desert castles of the Levant. This dataset can be visualized using <a href=" http://geobrowser.de.dariah.eu">DARIAH's geobrowser</a> to explore
the data through the dimensions of time and space.
@mromanello
mromanello / AWOL-index language distribution.ipynb
Created November 18, 2016 11:50
A quick-and-dirty hack to see the language distribution of journals listed in the AWOL Index <http://isaw.nyu.edu/publications/awol-index/index.html>
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
<?xml version="1.0" encoding="UTF-8"?>
<kml xmlns:dme="https://schereg.de.dariah.eu/namespaces/595b4c53127321587b2db26d/" xmlns:ns1="http://www.opengis.net/kml/2.2" xmlns:ns2="http://www.w3.org/2001/XMLSchema">
<Document>
<Placemark>
<name>Befund: 1114; Form: Kanne (, hellenistisch)</name>
<TimeSpan>
<begin>-336</begin>
<end>-30</end>
</TimeSpan>
<name>Befund: 31355; Form: Formschüssel (, )</name>
@mromanello
mromanello / process BMCR.ipynb
Created November 23, 2017 11:23
Quick and dirty notebook to demonstrate the extraction of canonical references from one BMCR record.
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@mromanello
mromanello / 20190207.ipynb
Created March 8, 2019 09:36
Instructions to query the VeniceScholar API to find out what are the cited publications for which we have the full-text.
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@mromanello
mromanello / Work with impresso images.ipynb
Created March 11, 2019 13:40
Minimal example to work with image information in impresso canonical data stored in S3
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

Digital scholarly publications as APIs: the case of the Structures of Epic Poetry compendium

Matteo Romanello (École polytechnique fédérale de Lausanne)

Simone Finkmann (University of Rostock)

Abstract

Indices (e.g. locorum, nominum, rerum) facilitate the readers' orientation within a printed publication and help them find the information they are looking for. While the transition to digitalpublishing does not render indices less useful, it does require us to find an adequate adaptation of such indices for a digital publishing environment. What should an index of a digital scholarly publication look like? As argued by McGuire (2013), the natural translation of a printed index in a digital environment is an API (Application Programming Interface) — i.e. a machine-friendly interface aimed at enabling the programmatic access to content published on the Web — and the job of “good publishers of the future” is to provide APIs for their publications.