Author: Nikola Milosevic, Goran Nenadic
Most current annotation schemas and tools allow annotation of entities and relationships between the words or phrases in raw textual documents. However, most of the documents on the web are actually rich documents, usually presented in HTML or specialized XML format, then interpreted in web browsers. Vast amounts of information is presented in tables, figures and other rich text elements, which is lost when only the text is annotated. For example, in clinical trial publications information about experimental settings, results and adverse events are usually presented in tables. Many of such publications are accessible via PubMedCentral in rich XML/HTML format, but gathering and semantically enriching information from elements such as tables is not currently possible.
We are aiming to develop a web based annotation tool which will allow users to