Skip to content

Instantly share code, notes, and snippets.

@manchuwook
Last active June 4, 2023 19:58
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save manchuwook/3a99b5cff6bef52f02faeeb0baf50476 to your computer and use it in GitHub Desktop.
Save manchuwook/3a99b5cff6bef52f02faeeb0baf50476 to your computer and use it in GitHub Desktop.
A list of phrases to use for your ML vocal training.
She said she loved him
Only she said she loved him.
She only said she loved him.
She said only she loved him.
She said she only loved him.
She said she loved only him.
She said she loved him only.
Bob thought the law was taught by his father.
The hot dog caught on fire in the kitchen.
Don't toss your socks in the pot; they'll get caught!
I bought a lot of cotton shirts at the store.
The foggy morning brought a lot of thoughts to my mind.
My father bought a lot of apples at the store.
Don't bother with that old book; it's not worth it.
She loves to drink wine with her dinner.
Stop whining and start working on your project.
I need a pen to write down my notes.
Be careful not to prick yourself with that pin.
She's going to the university to study biology.
The duke of York is visiting our town.
Can you bring me a thin slice of bread, please?
This sweater is very warm and comfortable.
The sunset over the ocean was a breathtakingly beautiful sight.
Spending time with loved ones brings immense happiness to my heart.
I couldn't resist indulging in a rich, creamy piece of chocolate cake.
The aroma of freshly baked bread filled the kitchen with a delicious scent.
Exploring the ancient ruins was an exciting adventure we'll never forget.
The majestic elephant gracefully moved through the grasslands.
The colorful butterfly fluttered from flower to flower in the garden.
Studying astronomy allows us to learn about the vast expanse of the universe.
We gathered to celebrate their wedding with laughter and joy.
True friendship is built on trust, understanding, and support.
I captured a beautiful photograph of the snow-covered mountains.
The lively music had everyone on the dance floor feeling energetic.
The musician serenaded the crowd with his enchanting melodies.
It was a wonderful surprise to receive a thoughtful gift from a friend.
The chef prepared an exquisite meal with delicate flavors and presentation.
We savored every bite of the delicious homemade lasagna.
The documentary revealed the fascinating lives of marine creatures.
The orchestra performed a breathtaking symphony that moved the audience.
The view from the mountaintop was nothing short of majestic.
The book's intriguing plot kept me engaged until the very end.

Person names

Visit SSA Names and read off the top 20 or more for as many age generations as you wish - Boomer (or older), Gen Y, Millenial, Zoomer, etc. Try to also include common names for names not native to your origin country. If you have fictional names of characters, take extra care to pronounce them consistently (you'd be surprised how often authors mess them up).

Country names

Say the names of each capitol with their country or associated province/state. It is intended to handle the comma between them Also as a phrase like "Last month, (country) signed a (peace treaty/economic deal/prisoner swap) with (another country) for the first time in over a decade.

City names

  • High population cities
  • Interesting cities, known for major attractions or historical events

Monuments

  • Eiffel Tower
  • Tokyo Tower
  • The Statue of Liberty
  • The White House ... and so on
(say One through Ten)
(say One-hundred through Nine-hundred, skipping every one hundred)
(say One-Thousand through Nine-thousand, skipping every one thousand)
(say One-Million through Nine-million, skipping every one million)
(... and so on for billions and trillions)
(Now do precision such as tenths, thousandths, millionths)
(Read off telephone numbers, just make sure not to give real ones. Give some that have 2, 3, and 4 in a row e.g. 223-444-5555)
Read off numbers in newscaster/stock exchange-style sentences such as
"One point one million people were all given the pink slip today as the Malaysian government shut down the public workers division"
"On this day, June seventh, 20 people were given cake in Georgia."
"You can call the Orange delivery hotline at one eight hundred two two nine thirty-thirty"
{
"sentences": [
{
"filename": "a_beautiful_day.wav",
"duration": "",
"words": [
{
"word": "A",
"in": "00:00:00",
"out": "00:00:01"
},
{
"word": "beautiful",
"in": "00:00:01",
"out": "00:00:02"
},
{
"word": "day",
"in": "00:00:02",
"out": "00:00:03"
},
{
"word": "brings",
"in": "00:00:03",
"out": "00:00:04"
},
{
"word": "joy",
"in": "00:00:04",
"out": "00:00:05"
},
{
"word": "and",
"in": "00:00:05",
"out": "00:00:06"
},
{
"word": "happiness,",
"in": "00:00:06",
"out": "00:00:07"
},
{
"word": "like",
"in": "00:00:07",
"out": "00:00:08"
},
{
"word": "a",
"in": "00:00:08",
"out": "00:00:09"
},
{
"word": "bird",
"in": "00:00:09",
"out": "00:00:10"
},
{
"word": "singing",
"in": "00:00:10",
"out": "00:00:11"
},
{
"word": "in",
"in": "00:00:11",
"out": "00:00:12"
},
{
"word": "the",
"in": "00:00:12",
"out": "00:00:13"
},
{
"word": "morning",
"in": "00:00:13",
"out": "00:00:14"
},
{
"word": "light.",
"in": "00:00:14",
"out": "00:00:15"
}
]
},
{
"filename": "the_gentle_river.wav",
"duration": "",
"words": [
{
"word": "The",
"in": "00:00:00",
"out": "00:00:01"
},
{
"word": "gentle",
"in": "00:00:01",
"out": "00:00:02"
},
{
"word": "river",
"in": "00:00:02",
"out": "00:00:03"
},
{
"word": "flows",
"in": "00:00:03",
"out": "00:00:04"
},
{
"word": "through",
"in": "00:00:04",
"out": "00:00:05"
},
{
"word": "the",
"in": "00:00:05",
"out": "00:00:06"
},
{
"word": "green",
"in": "00:00:06",
"out": "00:00:07"
},
{
"word": "valley,",
"in": "00:00:07",
"out": "00:00:08"
},
{
"word": "bringing",
"in": "00:00:08",
"out": "00:00:09"
},
{
"word": "life",
"in": "00:00:09",
"out": "00:00:10"
},
{
"word": "to",
"in": "00:00:10",
"out": "00:00:11"
},
{
"word": "the",
"in": "00:00:11",
"out": "00:00:12"
},
{
"word": "quiet",
"in": "00:00:12",
"out": "00:00:13"
},
{
"word": "village.",
"in": "00:00:13",
"out": "00:00:14"
}
]
},
{
"filename": "golden_rays_and_soft_breeze.wav",
"duration": "",
"words": [
{
"word": "The",
"in": "00:00:00",
"out": "00:00:01"
},
{
"word": "sun's",
"in": "00:00:01",
"out": "00:00:02"
},
{
"word": "golden",
"in": "00:00:02",
"out": "00:00:03"
},
{
"word": "rays",
"in": "00:00:03",
"out": "00:00:04"
},
{
"word": "bring",
"in": "00:00:04",
"out": "00:00:05"
},
{
"word": "warmth",
"in": "00:00:05",
"out": "00:00:06"
},
{
"word": "to",
"in": "00:00:06",
"out": "00:00:07"
},
{
"word": "the",
"in": "00:00:07",
"out": "00:00:08"
},
{
"word": "clear,",
"in": "00:00:08",
"out": "00:00:09"
},
{
"word": "blue",
"in": "00:00:09",
"out": "00:00:10"
},
{
"word": "sky,",
"in": "00:00:10",
"out": "00:00:11"
},
{
"word": "while",
"in": "00:00:11",
"out": "00:00:12"
},
{
"word": "the",
"in": "00:00:12",
"out": "00:00:13"
},
{
"word": "soft",
"in": "00:00:13",
"out": "00:00:14"
},
{
"word": "breeze",
"in": "00:00:14",
"out": "00:00:15"
},
{
"word": "whispers",
"in": "00:00:15",
"out": "00:00:16"
},
{
"word": "through",
"in": "00:00:16",
"out": "00:00:17"
},
{
"word": "the",
"in": "00:00:17",
"out": "00:00:18"
},
{
"word": "trees.",
"in": "00:00:18",
"out": "00:00:19"
}
]
},
{
"filename": "majestic_mountain.wav",
"duration": "",
"words": [
{
"word": "In",
"in": "00:00:00",
"out": "00:00:01"
},
{
"word": "the",
"in": "00:00:01",
"out": "00:00:02"
},
{
"word": "distance,",
"in": "00:00:02",
"out": "00:00:03"
},
{
"word": "the",
"in": "00:00:03",
"out": "00:00:04"
},
{
"word": "majestic",
"in": "00:00:04",
"out": "00:00:05"
},
{
"word": "mountain",
"in": "00:00:05",
"out": "00:00:06"
},
{
"word": "stands",
"in": "00:00:06",
"out": "00:00:07"
},
{
"word": "tall,",
"in": "00:00:07",
"out": "00:00:08"
},
{
"word": "untouched",
"in": "00:00:08",
"out": "00:00:09"
},
{
"word": "by",
"in": "00:00:09",
"out": "00:00:10"
},
{
"word": "the",
"in": "00:00:10",
"out": "00:00:11"
},
{
"word": "passing",
"in": "00:00:11",
"out": "00:00:12"
},
{
"word": "of",
"in": "00:00:12",
"out": "00:00:13"
},
{
"word": "time.",
"in": "00:00:13",
"out": "00:00:14"
}
]
},
{
"filename": "enchanted_tales_of_love.wav",
"duration": "",
"words": [
{
"word": "The",
"in": "00:00:00",
"out": "00:00:01"
},
{
"word": "book",
"in": "00:00:01",
"out": "00:00:02"
},
{
"word": "on",
"in": "00:00:02",
"out": "00:00:03"
},
{
"word": "the",
"in": "00:00:03",
"out": "00:00:04"
},
{
"word": "shelf",
"in": "00:00:04",
"out": "00:00:05"
},
{
"word": "is",
"in": "00:00:05",
"out": "00:00:06"
},
{
"word": "filled",
"in": "00:00:06",
"out": "00:00:07"
},
{
"word": "with",
"in": "00:00:07",
"out": "00:00:08"
},
{
"word": "enchanting",
"in": "00:00:08",
"out": "00:00:09"
},
{
"word": "tales",
"in": "00:00:09",
"out": "00:00:10"
},
{
"word": "of",
"in": "00:00:10",
"out": "00:00:11"
},
{
"word": "love,",
"in": "00:00:11",
"out": "00:00:12"
},
{
"word": "courage,",
"in": "00:00:12",
"out": "00:00:13"
},
{
"word": "and",
"in": "00:00:13",
"out": "00:00:14"
},
{
"word": "adventure.",
"in": "00:00:14",
"out": "00:00:15"
}
]
},
{
"filename": "warm_fire_in_the_hearth.wav",
"duration": "",
"words": [
{
"word": "On",
"in": "00:00:00",
"out": "00:00:01"
},
{
"word": "a",
"in": "00:00:01",
"out": "00:00:02"
},
{
"word": "cold",
"in": "00:00:02",
"out": "00:00:03"
},
{
"word": "winter's",
"in": "00:00:03",
"out": "00:00:04"
},
{
"word": "day,",
"in": "00:00:04",
"out": "00:00:05"
},
{
"word": "the",
"in": "00:00:05",
"out": "00:00:06"
},
{
"word": "warm",
"in": "00:00:06",
"out": "00:00:07"
},
{
"word": "fire",
"in": "00:00:07",
"out": "00:00:08"
},
{
"word": "in",
"in": "00:00:08",
"out": "00:00:09"
},
{
"word": "the",
"in": "00:00:09",
"out": "00:00:10"
},
{
"word": "hearth",
"in": "00:00:10",
"out": "00:00:11"
},
{
"word": "is",
"in": "00:00:11",
"out": "00:00:12"
},
{
"word": "a",
"in": "00:00:12",
"out": "00:00:13"
},
{
"word": "comforting",
"in": "00:00:13",
"out": "00:00:14"
},
{
"word": "presence.",
"in": "00:00:14",
"out": "00:00:15"
}
]
},
{
"filename": "hot_tea_and_good_book.wav",
"duration": "",
"words": [
{
"word": "A",
"in": "00:00:00",
"out": "00:00:01"
},
{
"word": "cup",
"in": "00:00:01",
"out": "00:00:02"
},
{
"word": "of",
"in": "00:00:02",
"out": "00:00:03"
},
{
"word": "hot",
"in": "00:00:03",
"out": "00:00:04"
},
{
"word": "tea",
"in": "00:00:04",
"out": "00:00:05"
},
{
"word": "and",
"in": "00:00:05",
"out": "00:00:06"
},
{
"word": "a",
"in": "00:00:06",
"out": "00:00:07"
},
{
"word": "good",
"in": "00:00:07",
"out": "00:00:08"
},
{
"word": "book",
"in": "00:00:08",
"out": "00:00:09"
},
{
"word": "are",
"in": "00:00:09",
"out": "00:00:10"
},
{
"word": "the",
"in": "00:00:10",
"out": "00:00:11"
},
{
"word": "perfect",
"in": "00:00:11",
"out": "00:00:12"
},
{
"word": "companions",
"in": "00:00:12",
"out": "00:00:13"
},
{
"word": "for",
"in": "00:00:13",
"out": "00:00:14"
},
{
"word": "a",
"in": "00:00:14",
"out": "00:00:15"
},
{
"word": "rainy",
"in": "00:00:15",
"out": "00:00:16"
},
{
"word": "day.",
"in": "00:00:16",
"out": "00:00:17"
}
]
},
{
"filename": "laughter_of_children_in_the_park.wav",
"duration": "",
"words": [
{
"word": "The",
"in": "00:00:00",
"out": "00:00:01"
},
{
"word": "laughter",
"in": "00:00:01",
"out": "00:00:02"
},
{
"word": "and",
"in": "00:00:02",
"out": "00:00:03"
},
{
"word": "chatter",
"in": "00:00:03",
"out": "00:00:04"
},
{
"word": "of",
"in": "00:00:04",
"out": "00:00:05"
},
{
"word": "children",
"in": "00:00:05",
"out": "00:00:06"
},
{
"word": "playing",
"in": "00:00:06",
"out": "00:00:07"
},
{
"word": "in",
"in": "00:00:07",
"out": "00:00:08"
},
{
"word": "the",
"in": "00:00:08",
"out": "00:00:09"
},
{
"word": "park",
"in": "00:00:09",
"out": "00:00:10"
},
{
"word": "is",
"in": "00:00:10",
"out": "00:00:11"
},
{
"word": "a",
"in": "00:00:11",
"out": "00:00:12"
},
{
"word": "delightful",
"in": "00:00:12",
"out": "00:00:13"
},
{
"word": "sound.",
"in": "00:00:13",
"out": "00:00:14"
}
]
},
{
"filename": "riot_of_color_in_the_garden.wav",
"duration": "",
"words": [
{
"word": "The",
"in": "00:00:00",
"out": "00:00:01"
},
{
"word": "garden",
"in": "00:00:01",
"out": "00:00:02"
},
{
"word": "is",
"in": "00:00:02",
"out": "00:00:03"
},
{
"word": "a",
"in": "00:00:03",
"out": "00:00:04"
},
{
"word": "riot",
"in": "00:00:04",
"out": "00:00:05"
},
{
"word": "of",
"in": "00:00:05",
"out": "00:00:06"
},
{
"word": "color,",
"in": "00:00:06",
"out": "00:00:07"
},
{
"word": "with",
"in": "00:00:07",
"out": "00:00:08"
},
{
"word": "flowers",
"in": "00:00:08",
"out": "00:00:09"
},
{
"word": "blooming",
"in": "00:00:09",
"out": "00:00:10"
},
{
"word": "in",
"in": "00:00:10",
"out": "00:00:11"
},
{
"word": "every",
"in": "00:00:11",
"out": "00:00:12"
},
{
"word": "corner.",
"in": "00:00:12",
"out": "00:00:13"
}
]
},
{
"filename": "stars_in_the_night_sky.wav",
"duration": "",
"words": [
{
"word": "The",
"in": "00:00:00",
"out": "00:00:01"
},
{
"word": "stars",
"in": "00:00:01",
"out": "00:00:02"
},
{
"word": "in",
"in": "00:00:02",
"out": "00:00:03"
},
{
"word": "the",
"in": "00:00:03",
"out": "00:00:04"
},
{
"word": "night",
"in": "00:00:04",
"out": "00:00:05"
},
{
"word": "sky",
"in": "00:00:05",
"out": "00:00:06"
},
{
"word": "are",
"in": "00:00:06",
"out": "00:00:07"
},
{
"word": "like",
"in": "00:00:07",
"out": "00:00:08"
},
{
"word": "diamonds,",
"in": "00:00:08",
"out": "00:00:09"
},
{
"word": "shining",
"in": "00:00:09",
"out": "00:00:10"
},
{
"word": "brightly",
"in": "00:00:10",
"out": "00:00:11"
},
{
"word": "in",
"in": "00:00:11",
"out": "00:00:12"
},
{
"word": "the",
"in": "00:00:12",
"out": "00:00:13"
},
{
"word": "darkness.",
"in": "00:00:13",
"out": "00:00:14"
}
]
},
{
"filename": "moon_on_the_calm_sea.wav",
"duration": "",
"words": [
{
"word": "The",
"in": "00:00:00",
"out": "00:00:01"
},
{
"word": "moon",
"in": "00:00:01",
"out": "00:00:02"
},
{
"word": "shines",
"in": "00:00:02",
"out": "00:00:03"
},
{
"word": "brightly",
"in": "00:00:03",
"out": "00:00:04"
},
{
"word": "on",
"in": "00:00:04",
"out": "00:00:05"
},
{
"word": "the",
"in": "00:00:05",
"out": "00:00:06"
},
{
"word": "calm",
"in": "00:00:06",
"out": "00:00:07"
},
{
"word": "sea.",
"in": "00:00:07",
"out": "00:00:08"
}
]
},
{
"filename": "ripening_cherries_in_summer_sun.wav",
"duration": "",
"words": [
{
"word": "Cherries",
"in": "00:00:00",
"out": "00:00:01"
},
{
"word": "ripen",
"in": "00:00:01",
"out": "00:00:02"
},
{
"word": "quickly",
"in": "00:00:02",
"out": "00:00:03"
},
{
"word": "in",
"in": "00:00:03",
"out": "00:00:04"
},
{
"word": "the",
"in": "00:00:04",
"out": "00:00:05"
},
{
"word": "warm",
"in": "00:00:05",
"out": "00:00:06"
},
{
"word": "summer",
"in": "00:00:06",
"out": "00:00:07"
},
{
"word": "sun.",
"in": "00:00:07",
"out": "00:00:08"
}
]
},
{
"filename": "content_kitten_purring.wav",
"duration": "",
"words": [
{
"word": "A",
"in": "00:00:00",
"out": "00:00:01"
},
{
"word": "kitten",
"in": "00:00:01",
"out": "00:00:02"
},
{
"word": "purrs",
"in": "00:00:02",
"out": "00:00:03"
},
{
"word": "softly",
"in": "00:00:03",
"out": "00:00:04"
},
{
"word": "when",
"in": "00:00:04",
"out": "00:00:05"
},
{
"word": "it",
"in": "00:00:05",
"out": "00:00:06"
},
{
"word": "is",
"in": "00:00:06",
"out": "00:00:07"
},
{
"word": "content.",
"in": "00:00:07",
"out": "00:00:08"
}
]
},
{
"filename": "mighty_oak_tree_in_the_forest.wav",
"duration": "",
"words": [
{
"word": "The",
"in": "00:00:00",
"out": "00:00:01"
},
{
"word": "mighty",
"in": "00:00:01",
"out": "00:00:02"
},
{
"word": "oak",
"in": "00:00:02",
"out": "00:00:03"
},
{
"word": "tree",
"in": "00:00:03",
"out": "00:00:04"
},
{
"word": "stands",
"in": "00:00:04",
"out": "00:00:05"
},
{
"word": "tall",
"in": "00:00:05",
"out": "00:00:06"
},
{
"word": "in",
"in": "00:00:06",
"out": "00:00:07"
},
{
"word": "the",
"in": "00:00:07",
"out": "00:00:08"
},
{
"word": "forest.",
"in": "00:00:08",
"out": "00:00:09"
}
]
},
{
"filename": "rustling_leaves_in_the_trees.wav",
"duration": "",
"words": [
{
"word": "A",
"in": "00:00:00",
"out": "00:00:01"
},
{
"word": "gentle",
"in": "00:00:01",
"out": "00:00:02"
},
{
"word": "breeze",
"in": "00:00:02",
"out": "00:00:03"
},
{
"word": "rustles",
"in": "00:00:03",
"out": "00:00:04"
},
{
"word": "the",
"in": "00:00:04",
"out": "00:00:05"
},
{
"word": "leaves",
"in": "00:00:05",
"out": "00:00:06"
},
{
"word": "in",
"in": "00:00:06",
"out": "00:00:07"
},
{
"word": "the",
"in": "00:00:07",
"out": "00:00:08"
},
{
"word": "trees.",
"in": "00:00:08",
"out": "00:00:09"
}
]
},
{
"filename": "aroma_of_fresh_bread.wav",
"duration": "",
"words": [
{
"word": "The",
"in": "00:00:00",
"out": "00:00:01"
},
{
"word": "aroma",
"in": "00:00:01",
"out": "00:00:02"
},
{
"word": "of",
"in": "00:00:02",
"out": "00:00:03"
},
{
"word": "fresh",
"in": "00:00:03",
"out": "00:00:04"
},
{
"word": "bread",
"in": "00:00:04",
"out": "00:00:05"
},
{
"word": "wafts",
"in": "00:00:05",
"out": "00:00:06"
},
{
"word": "from",
"in": "00:00:06",
"out": "00:00:07"
},
{
"word": "the",
"in": "00:00:07",
"out": "00:00:08"
},
{
"word": "bakery.",
"in": "00:00:08",
"out": "00:00:09"
}
]
},
{
"filename": "old_stone_bridge_crosses_brook.wav",
"duration": "",
"words": [
{
"word": "The",
"in": "00:00:00",
"out": "00:00:01"
},
{
"word": "old",
"in": "00:00:01",
"out": "00:00:02"
},
{
"word": "stone",
"in": "00:00:02",
"out": "00:00:03"
},
{
"word": "bridge",
"in": "00:00:03",
"out": "00:00:04"
},
{
"word": "crosses",
"in": "00:00:04",
"out": "00:00:05"
},
{
"word": "the",
"in": "00:00:05",
"out": "00:00:06"
},
{
"word": "babbling",
"in": "00:00:06",
"out": "00:00:07"
},
{
"word": "brook.",
"in": "00:00:07",
"out": "00:00:08"
}
]
},
{
"filename": "flashing_lightning_and_rumbling_thunder.wav",
"duration": "",
"words": [
{
"word": "Lightning",
"in": "00:00:00",
"out": "00:00:01"
},
{
"word": "flashes",
"in": "00:00:01",
"out": "00:00:02"
},
{
"word": "and",
"in": "00:00:02",
"out": "00:00:03"
},
{
"word": "thunder",
"in": "00:00:03",
"out": "00:00:04"
},
{
"word": "rumbles",
"in": "00:00:04",
"out": "00:00:05"
},
{
"word": "in",
"in": "00:00:05",
"out": "00:00:06"
},
{
"word": "the",
"in": "00:00:06",
"out": "00:00:07"
},
{
"word": "stormy",
"in": "00:00:07",
"out": "00:00:08"
},
{
"word": "night.",
"in": "00:00:08",
"out": "00:00:09"
}
]
},
{
"filename": "silence_of_the_forest.wav",
"duration": "",
"words": [
{
"word": "In",
"in": "00:00:00",
"out": "00:00:01"
},
{
"word": "the",
"in": "00:00:01",
"out": "00:00:02"
},
{
"word": "depths",
"in": "00:00:02",
"out": "00:00:03"
},
{
"word": "of",
"in": "00:00:03",
"out": "00:00:04"
},
{
"word": "the",
"in": "00:00:04",
"out": "00:00:05"
},
{
"word": "forest,",
"in": "00:00:05",
"out": "00:00:06"
},
{
"word": "silence",
"in": "00:00:06",
"out": "00:00:07"
},
{
"word": "reigns",
"in": "00:00:07",
"out": "00:00:08"
},
{
"word": "supreme.",
"in": "00:00:08",
"out": "00:00:09"
}
]
},
{
"filename": "the_old_clock.wav",
"duration": "",
"words": [
{
"word": "The",
"in": "00:00:00",
"out": "00:00:01"
},
{
"word": "old",
"in": "00:00:01",
"out": "00:00:02"
},
{
"word": "clock",
"in": "00:00:02",
"out": "00:00:03"
},
{
"word": "ticks",
"in": "00:00:03",
"out": "00:00:04"
},
{
"word": "steadily",
"in": "00:00:04",
"out": "00:00:05"
},
{
"word": "in",
"in": "00:00:05",
"out": "00:00:06"
},
{
"word": "the",
"in": "00:00:06",
"out": "00:00:07"
},
{
"word": "corner.",
"in": "00:00:07",
"out": "00:00:08"
}
]
},
{
"filename": "with_tenure_suzie.wav",
"duration": "",
"words": [
{
"word": "With",
"in": "00:00:00",
"out": "00:00:01"
},
{
"word": "tenure,",
"in": "00:00:01",
"out": "00:00:02"
},
{
"word": "Suzie’d",
"in": "00:00:02",
"out": "00:00:03"
},
{
"word": "have",
"in": "00:00:03",
"out": "00:00:04"
},
{
"word": "all",
"in": "00:00:04",
"out": "00:00:05"
},
{
"word": "the",
"in": "00:00:05",
"out": "00:00:06"
},
{
"word": "more",
"in": "00:00:06",
"out": "00:00:07"
},
{
"word": "leisure",
"in": "00:00:07",
"out": "00:00:08"
},
{
"word": "for",
"in": "00:00:08",
"out": "00:00:09"
},
{
"word": "yachting,",
"in": "00:00:09",
"out": "00:00:10"
},
{
"word": "but",
"in": "00:00:10",
"out": "00:00:11"
},
{
"word": "her",
"in": "00:00:11",
"out": "00:00:12"
},
{
"word": "publications",
"in": "00:00:12",
"out": "00:00:13"
},
{
"word": "are",
"in": "00:00:13",
"out": "00:00:14"
},
{
"word": "no",
"in": "00:00:14",
"out": "00:00:15"
},
{
"word": "good.",
"in": "00:00:15",
"out": "00:00:16"
}
]
},
{
"filename": "shaw_beige_hooks.wav",
"duration": "",
"words": [
{
"word": "Shaw,",
"in": "00:00:00",
"out": "00:00:01"
},
{
"word": "those",
"in": "00:00:01",
"out": "00:00:02"
},
{
"word": "twelve",
"in": "00:00:02",
"out": "00:00:03"
},
{
"word": "beige",
"in": "00:00:03",
"out": "00:00:04"
},
{
"word": "hooks",
"in": "00:00:04",
"out": "00:00:05"
},
{
"word": "are",
"in": "00:00:05",
"out": "00:00:06"
},
{
"word": "joined",
"in": "00:00:06",
"out": "00:00:07"
},
{
"word": "if",
"in": "00:00:07",
"out": "00:00:08"
},
{
"word": "I",
"in": "00:00:08",
"out": "00:00:09"
},
{
"word": "patch",
"in": "00:00:09",
"out": "00:00:10"
},
{
"word": "a",
"in": "00:00:10",
"out": "00:00:11"
},
{
"word": "young,",
"in": "00:00:11",
"out": "00:00:12"
},
{
"word": "gooey",
"in": "00:00:12",
"out": "00:00:13"
},
{
"word": "mouth.",
"in": "00:00:13",
"out": "00:00:14"
}
]
},
{
"filename": "shy_eurasian_footwear.wav",
"duration": "",
"words": [
{
"word": "Are",
"in": "00:00:00",
"out": "00:00:01"
},
{
"word": "those",
"in": "00:00:01",
"out": "00:00:02"
},
{
"word": "shy",
"in": "00:00:02",
"out": "00:00:03"
},
{
"word": "Eurasian",
"in": "00:00:03",
"out": "00:00:04"
},
{
"word": "footwear,",
"in": "00:00:04",
"out": "00:00:05"
},
{
"word": "cowboy",
"in": "00:00:05",
"out": "00:00:06"
},
{
"word": "chaps,",
"in": "00:00:06",
"out": "00:00:07"
},
{
"word": "or",
"in": "00:00:07",
"out": "00:00:08"
},
{
"word": "jolly",
"in": "00:00:08",
"out": "00:00:09"
},
{
"word": "earthmoving",
"in": "00:00:09",
"out": "00:00:10"
},
{
"word": "headgear?",
"in": "00:00:10",
"out": "00:00:11"
}
]
},
{
"filename": "beige_hue_waters_loch.wav",
"duration": "",
"words": [
{
"word": "The",
"in": "00:00:00",
"out": "00:00:01"
},
{
"word": "beige",
"in": "00:00:01",
"out": "00:00:02"
},
{
"word": "hue",
"in": "00:00:02",
"out": "00:00:03"
},
{
"word": "on",
"in": "00:00:03",
"out": "00:00:04"
},
{
"word": "the",
"in": "00:00:04",
"out": "00:00:05"
},
{
"word": "waters",
"in": "00:00:05",
"out": "00:00:06"
},
{
"word": "of",
"in": "00:00:06",
"out": "00:00:07"
},
{
"word": "the",
"in": "00:00:07",
"out": "00:00:08"
},
{
"word": "loch",
"in": "00:00:08",
"out": "00:00:09"
},
{
"word": "impressed",
"in": "00:00:09",
"out": "00:00:10"
},
{
"word": "all,",
"in": "00:00:10",
"out": "00:00:11"
},
{
"word": "including",
"in": "00:00:11",
"out": "00:00:12"
},
{
"word": "the",
"in": "00:00:12",
"out": "00:00:13"
},
{
"word": "French",
"in": "00:00:13",
"out": "00:00:14"
},
{
"word": "queen,",
"in": "00:00:14",
"out": "00:00:15"
},
{
"word": "before",
"in": "00:00:15",
"out": "00:00:16"
},
{
"word": "she",
"in": "00:00:16",
"out": "00:00:17"
},
{
"word": "heard",
"in": "00:00:17",
"out": "00:00:18"
},
{
"word": "that",
"in": "00:00:18",
"out": "00:00:19"
},
{
"word": "symphony",
"in": "00:00:19",
"out": "00:00:20"
},
{
"word": "again,",
"in": "00:00:20",
"out": "00:00:21"
},
{
"word": "just",
"in": "00:00:21",
"out": "00:00:22"
},
{
"word": "as",
"in": "00:00:22",
"out": "00:00:23"
},
{
"word": "young",
"in": "00:00:23",
"out": "00:00:24"
},
{
"word": "Arthur",
"in": "00:00:24",
"out": "00:00:25"
},
{
"word": "wanted.",
"in": "00:00:25",
"out": "00:00:26"
}
]
}
]
}
A beautiful day brings joy and happiness, like a bird singing in the morning light.
The gentle river flows through the green valley, bringing life to the quiet village.
The sun's golden rays bring warmth to the clear, blue sky, while the soft breeze whispers through the trees.
In the distance, the majestic mountain stands tall, untouched by the passing of time.
The book on the shelf is filled with enchanting tales of love, courage, and adventure.
On a cold winter's day, the warm fire in the hearth is a comforting presence.
A cup of hot tea and a good book are the perfect companions for a rainy day.
The laughter and chatter of children playing in the park is a delightful sound.
The garden is a riot of color, with flowers blooming in every corner.
The stars in the night sky are like diamonds, shining brightly in the darkness
The moon shines brightly on the calm sea.
Cherries ripen quickly in the warm summer sun.
A kitten purrs softly when it is content.
The mighty oak tree stands tall in the forest.
A gentle breeze rustles the leaves in the trees.
The aroma of fresh bread wafts from the bakery.
The old stone bridge crosses the babbling brook.
Lightning flashes and thunder rumbles in the stormy sky.
A shoal of fish dart through the crystal-clear water.
The lighthouse stands guard over the rocky shore.
Bright city lights twinkle in the distance.
A pebble skips across the surface of the pond.
The train chugs slowly up the steep hill.
A butterfly flutters among the beautiful flowers.
The farmer tends to his fields with care.
A squirrel scurries up the tall tree trunk.
The old clock in the hall chimes the hour.
A painter captures the beauty of the sunset on canvas.
The waves lap gently against the sandy beach.
The rustle of pages is the only sound in the library.
I am overflowing with a profound sense of happiness, as if every fiber of my being is infused with radiant sunshine and an endless supply of joy!
My heart is filled to the brim with an exuberant and contagious happiness that radiates like a thousand sunbeams, illuminating every corner of my existence!
I am immersed in an ocean of pure bliss and uncontainable giddiness, riding the waves of euphoria and reveling in the sheer delight of the present moment!
Every breath I take is infused with an intoxicating concoction of happiness, bubbling up within me like effervescent champagne, tickling my senses and making me positively giddy with delight!
My soul is dancing to the rhythm of unbridled joy, twirling and spinning in a symphony of happiness, creating an irresistible aura of positivity and contagious laughter!
I am enveloped in a cocoon of unadulterated happiness and serenity, where worries fade away, and my spirit soars like a carefree butterfly, intoxicated by the nectar of pure joy!
With each passing moment, my heart becomes a kaleidoscope of vibrant emotions, painting my world with an explosion of colors, and leaving me in a state of uncontainable, euphoric happiness!
I am floating on a cloud of unparalleled happiness, where time stands still, and my spirit soars high above the ordinary, embracing the extraordinary and relishing in the sheer bliss of the present!
My happiness knows no bounds, for it transcends the limitations of mere words, engulfing me in a warm embrace of unfiltered delight, like a symphony playing the most melodious notes that resonate within my soul!
In this moment, my entire being is a symphony of laughter, love, and unyielding happiness, as if the universe itself conspired to gift me an abundance of joy, filling my life with endless possibilities and immeasurable delight!
I am absolutely ecstatic and filled with an overwhelming sense of joy and happiness!
I can't contain my excitement and happiness; it's like fireworks exploding in my heart!
Every fiber of my being is buzzing with pure delight and an indescribable feeling of euphoria!
I am so incredibly elated and thrilled that I feel like I could leap for joy and dance with abandon!
My heart is brimming with uncontainable happiness, and I can't help but radiate positivity and bliss!
I am on cloud nine, surrounded by an aura of immense joy and a contagious, infectious happiness!
A wave of sheer delight and unadulterated glee washes over me, filling my soul with boundless happiness!
I'm positively overjoyed, my spirit soaring high as if I have wings and the whole world is my playground!
The sheer magnitude of happiness and euphoria I feel right now is indescribable and beyond measure!
In this moment, my heart is a symphony of jubilation, and my soul is dancing with uncontainable happiness and pure, unbridled joy!
import json
from typing import List, Dict
class Sentence:
def __init__(self, filename: str, duration: str, words: List[Dict[str, str]]):
self.filename = filename
self.duration = duration
self.words = words
class SentenceSerializer:
@staticmethod
def serialize(sentence: Sentence) -> str:
return json.dumps(sentence.__dict__)
@staticmethod
def deserialize(data: str) -> Sentence:
obj = json.loads(data)
return Sentence(obj["filename"], obj["duration"], obj["words"])
@staticmethod
def to_smpte_timecode(timecode: str) -> str:
hours, minutes, seconds = timecode.split(":")
frames = int(round(float(seconds) * 30)) # Assuming 30 frames per second for SMPTE timecode
return f"{int(hours):02d}:{int(minutes):02d}:{int(seconds):02d}:{frames:02d}"
@staticmethod
def from_smpte_timecode(timecode: str) -> str:
hours, minutes, seconds, frames = timecode.split(":")
seconds = float(frames) / 30
return f"{int(hours):02d}:{int(minutes):02d}:{seconds:06.3f}"
# Usage Example
data = """
{
"sentences": [
{
"filename": "laughter_of_children_in_the_park.wav",
"duration": "",
"words": [
{
"word": "The",
"in": "00:00:00",
"out": "00:00:01"
},
...
]
},
...
]
}
"""
# Deserialize JSON to Sentence objects
sentences_data = json.loads(data)["sentences"]
sentences = [SentenceSerializer.deserialize(json.dumps(s)) for s in sentences_data]
# Convert timecodes to SMPTE format
for sentence in sentences:
for word in sentence.words:
word["in"] = SentenceSerializer.to_smpte_timecode(word["in"])
word["out"] = SentenceSerializer.to_smpte_timecode(word["out"])
# Convert SMPTE timecodes back to original format (if needed)
# for sentence in sentences:
# for word in sentence.words:
# word["in"] = SentenceSerializer.from_smpte_timecode(word["in"])
# word["out"] = SentenceSerializer.from_smpte_timecode(word["out"])
# Serialize Sentence objects to JSON
serialized_sentences = [SentenceSerializer.serialize(s) for s in sentences]
print(serialized_sentences)
@manchuwook
Copy link
Author

manchuwook commented Jun 4, 2023

Be sure to speak these phrases naturally at a regular pace

  • Give yourself time to rest your vocal cords between sentences.
  • Keep a glass of water or tea with honey to soothe long recording sessions.
  • Don't clear your throat too often.
  • Record 3-4 takes of the sentences at different times during the day before meals.

A high-quality microphone is recommended.

  • Use a pop filter to take the edge off of 'p' and 't' words, as this will cause clipping
    • If you can't afford one, you can take a round hoop and put a sheer fabric around it; give yourself a diameter of 6-8 inches on the hoop
  • Keep the recording room small
    • Try to surround the area with dampening curtains or towels
    • The less reverb (echo) and background noise, the better the ML can focus on your voice
    • Apply peak limiting in your equalizer if a hum or whine is in the background, like an A/C in the environment or lightbulbs buzzing. This will typically be in a fixed kilohertz range. Drag the peak down to lessen the effect.

While Audacity is often used in YouTube videos because it is free, there are better recording, editing, mixing, and mastering tools like Cockos Reaper.

  • Yes, it costs money, but if you're putting in this kind of effort, you'll want to spring on the $60 (as of June 2023)
  • It has a ton of FX for free called VSTs
  • It can be scripted with Python, LUA, and EEL

@manchuwook
Copy link
Author

Create new sentences and actively listen to your TTS results. To reduce hallucinations in your generated samples, you may have to go back and create additional recordings using replacements for robotic-sounding words in the context of a sentence.

  • 'nother instead of Another may mean you started too softly at the start of a sentence
  • Skipping or overextending syllables may mean it can't correctly concatenate words. Try to enunciate multisyllabic words more clearly.
  • Listen to how others pronounce the word - you may need to say it correctly. (It's okay, sometimes people see the word written only and not spoken)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment