This is a quick draft of a data format specification to encode annotated character data to support overlapping markup, also known as standoff markup.
Everything is subject to discussion
- By now this is kind of a fork of atjson
- Ted Nelson's xanadoc EDL format
- OCR formats ALTO, PAGE, hOCR
- Mac OSX Core Text