Skip to content

Instantly share code, notes, and snippets.

@Gastron
Gastron / kaldi-to-webdataset.py
Last active November 23, 2021 16:10
Converts Kaldi format data into WebDataset shards (BETA)
#!/usr/bin/env python3
"""Write Kaldi data as WebDataset shards"""
import webdataset as wds
import multiprocessing as mp
import subprocess
import torchaudio
import queue
import os
import pathlib