Skip to content

Instantly share code, notes, and snippets.

@kylebgorman
Created June 22, 2018 18:57
Show Gist options
  • Star 3 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save kylebgorman/c61cb02f511cd0cb0168d1dc9bdd8f5a to your computer and use it in GitHub Desktop.
Save kylebgorman/c61cb02f511cd0cb0168d1dc9bdd8f5a to your computer and use it in GitHub Desktop.
Function words
"""English function words.
Sets of English function words, based on
E.O. Selkirk. 1984. Phonology and syntax: The relationship between
sound and structure. Cambridge: MIT Press. (p. 352f.)
The categories are of my own creation.
"""
AUXILIARIES = frozenset([
'AM', 'ARE', "AREN'T", 'BEEN', 'DID', 'DO', 'DOES', "DON'T", 'HAD',
"HADN'T", 'HAS', "HASN'T", 'HAVE', "HAVEN'T", 'IS', "ISN'T", 'WAS',
"WASN'T", 'WERE', "WEREN'T"
])
CONJUNCTIONS = frozenset([
'AND', 'AS', 'BECAUSE', 'BEFORE', 'BOTH', 'BUT', 'CUZ', 'EXCEPT', 'IF',
'OR', 'NOR', 'SINCE', 'SO', 'THAT'
])
DETERMINERS = frozenset([
'A', 'AN', 'ANY', 'EACH', 'EITHER', 'EVERY', 'NEITHER', 'SOME', 'SUCH',
'THAT', 'THIS', 'THE', 'ALL', 'BOTH', 'ONE', 'ANOTHER'
])
Q_ADJECTIVES = frozenset([
'HALF', 'TWICE', 'FIRST', 'OTHER', 'NEXT', 'SECOND', 'LAST', 'MANY', 'MUCH',
'MORE', 'MOST', 'SEVERAL', 'FEW', 'LITTLE', 'LESS', 'LEAST', 'OWN'
])
INTENSIFIERS = frozenset(['SO', 'TOO'])
MODALS = frozenset([
'CAN', "CAN'T", 'COULD', "COULDN'T", 'MAY', 'MIGHT', 'MUST', "MUSTN'T",
'OUGHT', 'SHALL', "SHAN'T", 'SHOULD', "SHOULDN'T", 'WILL', 'WOULD',
"WOULDN'T"
])
NEGATION = frozenset(['NO', 'NOT'])
SPACE_DEIXIS = frozenset([
'ABOVE', 'ACROSS', 'AGAINST', 'AMONG', 'AMONGST', 'AT', 'BEHIND', 'BENEATH',
'BETWEEN', 'BEYOND', 'BY', 'FROM', 'IN', 'HERE', 'ON', 'OUT', 'THERE',
'THROUGH', 'TO', 'TOWARD', 'TOWARDS', 'WITH', 'UNDER', 'UP'
])
TIME_DEIXIS = frozenset(['AFTER', 'AT', 'DURING', 'IN', 'ON', "'TIL", 'UNTIL'])
PREPOSITIONS = (frozenset(['ABOUT', 'FOR', 'LIKE', 'OF']) | SPACE_DEIXIS
| TIME_DEIXIS)
TIME = frozenset([
'TODAY', 'TOMORROW', 'NOW', 'THEN', 'ALWAYS', 'NEVER', 'SOMETIMES',
'USUALLY', 'OFTEN'
])
ADVERBS = frozenset([
'THEREFORE', 'HOWEVER', 'BESIDES', 'MOREOVER', 'THOUGH', 'OTHERWISE',
'ELSE', 'INSTEAD', 'ANYWAY', 'INCIDENTALLY', 'MEANWHILE'
])
PRONOUNS = frozenset([
'HE', 'HER', 'HIM', 'HIS', 'I', 'IT', 'ITS', 'ME', 'MY', 'OUR', 'SHE',
'THEIR', 'THEM', 'THESE', 'THOSE', 'THEY', 'US', 'YOU', 'WE', 'MINE',
'OURS', 'THEIRS', 'MYSELF', 'HIMSELF', 'HERSELF', 'ITSELF', 'OURSELVES',
'THEMSELVES', 'ANYTHING', 'EVERYTHING', 'SOMETHING', 'NOTHING', 'ANYONE',
'EVERYONE', 'SOMEONE', 'ONE', 'SUCH'
])
WH = frozenset(
['HOW', 'WHAT', 'WHEN', 'WHERE', 'WHICH', 'WHO', 'WHOM', 'WHOSE', 'WHY'])
CLITICS = frozenset(["'D", "'LL", "'M", "'RE", "'S", "'T", "'VE", "N'T"])
FUNCTION_WORDS = (AUXILIARIES | CONJUNCTIONS | DETERMINERS | Q_ADJECTIVES
| INTENSIFIERS | MODALS | NEGATION | PREPOSITIONS | PRONOUNS
| WH | CLITICS)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment