Skip to content

Instantly share code, notes, and snippets.

@konklone
Last active December 24, 2015 16:59
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save konklone/42ed50f1b8f9f2275c13 to your computer and use it in GitHub Desktop.
Save konklone/42ed50f1b8f9f2275c13 to your computer and use it in GitHub Desktop.
breakout notes

Annotation Thesaurus Breakout

Participants

Eric Mill eric@konklone.com
Waldo Jaquith
Greg Elin greg@gitmachines.com
Balazs Czifra balazs@czifra.com
Peter Brantley pbrantley@hypothes.is

General Discussion

StateDecoded Legal Dictionary - Big Json

Wiktionary has a legal dictionary but it is CC-BY thereby needing a hover annotation.

Discussing a set of terms that are not copyright so there is no barrier to use.

Virginia State Decoded has created 12,000 definitions. Waldo hoping to aggregate the dictionary.

Discussing a simple tool for doing word association for synonyms

Eric's Glossary - https://github.com/unitedstates/glossary

What are discussing?

Thesaurus Glossary

Governance

  • No copyright so really easy to use
  • Control updates by using commit management
  • Clear description of purpose
  • Does not matter if a state redefines or extends glossary because each authority is responsible for its "name space"
  • Handling differences between states is about creating synonyms
  • Baking into the tool with glossaries, the exposure of the glossary itself.

Handling usage errors

  • Waldo: Cases in which a term is used incorrectly. How do you catch it?
  • Balazs: How do other groups deal with handling definitions and contexts? Do you invert and track context?
  • Discussion of content classification
  • Statedecoded could track usage of mouse overs.
  • Alternatively, include all context of a term in defition and let users select appropriate term (e.g., use of "schedule" in different terms). Could solve this issue as the dictionary does using numbered definition

Example

  • Minor defined as person under age of 18; and then legislation will use term "minor offense"

Technical implementation discussion

  • Should edits be in native json? Should the edits be on individual terms and then assembled.
  • Greg: Developing a simple protocol for doing definitions. Creating defitinitions on GitHub repository.

How to expand the glossary corpus

  • Identify the terms that need to be defined
  • Discuss if this could be possible pro-bono activity for lawyers
  • Simplest enough ask for crowdsource project - Eric: described similar example of using GitHub generated issues to gather information for members of Congress

Links

https://github.com/statedecoded/legal-dictionary https://github.com/unitedsates/glossary

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment