This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
data = [("me gusta comer en la cafeteria".split(), "SPANISH"), | |
("Give it to me".split(), "ENGLISH"), | |
("No creo que sea una buena idea".split(), "SPANISH"), | |
("No it is not a good idea to get lost at sea".split(), "ENGLISH")] | |
test_data = [("Yo creo que si".split(), "SPANISH"), | |
("it is lost on me".split(), "ENGLISH")] | |
# word_to_ix maps each word in the vocab to a unique integer, which will be its | |
# index into the Bag of words vector |