Skip to content

Instantly share code, notes, and snippets.

Embed
What would you like to do?
# encode pre-tokenized text string
encoded_output_2 = tokenizer.encode(sentence.split(), is_pretokenized=True)
print(encoded_output_2.tokens)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
You can’t perform that action at this time.