Skip to content

Instantly share code, notes, and snippets.

View nikolamilosevic86's full-sized avatar
🔮
Predictin'

Nikola Milosevic nikolamilosevic86

🔮
Predictin'
View GitHub Profile
@nikolamilosevic86
nikolamilosevic86 / Annotation proposals.md
Last active September 30, 2016 15:58
Proposals for annotations of tables
{
   "xml": "<table>
              <tr><td>parameter</td><td>number</td></tr>
              <tr><td>male/famale</td><td>15/18</td></tr>
          </table>",
   "denotations": [
      {"id": "T1", "xpath": "/table/tr[1]/td[1]", "obj": "Header"},
      {"id": "T2", "xpath": "/table/tr[1]/td[1]", "obj": "Stub"},
      {"id": "T3","xpath": "/table/tr[1]/td[2]", "obj": "Header"},

RichAnnotator - Annotation Tool For Rich Web Documents

Author: Nikola Milosevic, Goran Nenadic

Introduction

Most current annotation schemas and tools allow annotation of entities and relationships between the words or phrases in raw textual documents. However, most of the documents on the web are actually rich documents, usually presented in HTML or specialized XML format, then interpreted in web browsers. Vast amounts of information is presented in tables, figures and other rich text elements, which is lost when only the text is annotated. For example, in clinical trial publications information about experimental settings, results and adverse events are usually presented in tables. Many of such publications are accessible via PubMedCentral in rich XML/HTML format, but gathering and semantically enriching information from elements such as tables is not currently possible.

Aim

We are aiming to develop a web based annotation tool which will allow users to