Skip to content

Instantly share code, notes, and snippets.

View chainyo's full-sized avatar
🦁

Thomas Chaigneau chainyo

🦁
View GitHub Profile
from datasets import load_dataset
from transformers import AutoTokenizer
dataset = load_dataset("oscar-corpus/OSCAR-2201", "fr", split="train", streaming=True)
tokenizer = AutoTokenizer.from_pretrained("pegasus-large")
length = 1_000_000
iter_dataset = iter(dataset)
"""🚀 Wordcab Python Package Gist
To install the Wordcab package:
$ pip install wordcab
To login to Wordcab API with your API key:
$ wordcab login
"""🚀 Wordcab Python Package Gist
To install the Wordcab package:
$ pip install wordcab
To login to Wordcab API with your API key:
$ wordcab login
"""🚀 Wordcab Python Package Gist
To install the Wordcab package:
$ pip install wordcab
To login to Wordcab API with your API key:
$ wordcab login
"""🚀 Wordcab Python Package Gist
To install the Wordcab package:
$ pip install wordcab
To login to Wordcab API with your API key:
$ wordcab login
"""🚀 Wordcab Python Package Gist
To install the Wordcab package:
$ pip install wordcab
To login to Wordcab API with your API key:
$ wordcab login
"""🚀 Wordcab Python Package Gist
To install the Wordcab package:
$ pip install wordcab
To login to Wordcab API with your API key:
$ wordcab login
"""🚀 Wordcab Python Package Gist
To install the Wordcab package:
$ pip install wordcab
To login to Wordcab API with your API key:
$ wordcab login
"""🚀 Wordcab Python Package Gist
To install the Wordcab package:
$ pip install wordcab
To login to Wordcab API with your API key:
$ wordcab login
"""🚀 Wordcab Python Package Gist
To install the Wordcab package:
$ pip install wordcab
To login to Wordcab API with your API key:
$ wordcab login