Skip to content

Instantly share code, notes, and snippets.


Nikola Milosevic nikolamilosevic86

View GitHub Profile
nikolamilosevic86 / Annotation
Last active Sep 30, 2016
Proposals for annotations of tables
View Annotation
   "xml": "<table>
   "denotations": [
      {"id": "T1", "xpath": "/table/tr[1]/td[1]", "obj": "Header"},
      {"id": "T2", "xpath": "/table/tr[1]/td[1]", "obj": "Stub"},
 {"id": "T3","xpath": "/table/tr[1]/td[2]", "obj": "Header"},
View Rich web document

RichAnnotator - Annotation Tool For Rich Web Documents

Author: Nikola Milosevic, Goran Nenadic


Most current annotation schemas and tools allow annotation of entities and relationships between the words or phrases in raw textual documents. However, most of the documents on the web are actually rich documents, usually presented in HTML or specialized XML format, then interpreted in web browsers. Vast amounts of information is presented in tables, figures and other rich text elements, which is lost when only the text is annotated. For example, in clinical trial publications information about experimental settings, results and adverse events are usually presented in tables. Many of such publications are accessible via PubMedCentral in rich XML/HTML format, but gathering and semantically enriching information from elements such as tables is not currently possible.


We are aiming to develop a web based annotation tool which will allow users to