Skip to content

Instantly share code, notes, and snippets.

Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save piegu/3bb3786834839e140c057c5697fa6132 to your computer and use it in GitHub Desktop.
Save piegu/3bb3786834839e140c057c5697fa6132 to your computer and use it in GitHub Desktop.
English vs Portuguese tokenizer on Portuguese Wikipedia of Byte-Level-BPE_universal_tokenizer_but_en_tokenizer.ipynb
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment