This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
"""Script to add Chinese words from the input list to the word file when all the | |
characters in that word are known (i.e. exist in the character file). | |
The input list is currently assumed to be the output from the ArchChinese-Scraper | |
Chrome extension (see https://github.com/Khouderchah-Alex/ArchChinese-Scraper). | |
Note that in the current form, the character file must contain both traditional | |
and simplified characters. If one is only studying traditional or simplified | |
characters, the word_regex variable can be modified such the word capture group |