KittenTTS: TTS Model Under 25MB

3 months ago 6

Kitten TTS is an open-source realistic text-to-speech model with just 15 million parameters, designed for lightweight deployment and high-quality voice synthesis.

pip install https://github.com/KittenML/KittenTTS/releases/download/0.1/kittentts-0.1.0-py3-none-any.whl

from kittentts import KittenTTS m = KittenTTS("KittenML/kitten-tts-nano-0.1") audio = m.generate("This high quality TTS model works without a GPU", voice='expr-voice-2-f' ) # available_voices : [ 'expr-voice-2-m', 'expr-voice-2-f', 'expr-voice-3-m', 'expr-voice-3-f', 'expr-voice-4-m', 'expr-voice-4-f', 'expr-voice-5-m', 'expr-voice-5-f' ] # Save the audio import soundfile as sf sf.write('output.wav', audio, 24000)

Read Entire Article

KittenTTS: TTS Model Under 25MB

Related

Getting Britain Out of the Hole

Comparing Programming Communities on Reddit

Shader Execution Reordering Benchmarked