Get the latest tech news
Kitten TTS: 25MB CPU-Only, Open-Source Voice Model
Kitten TTS: a 25MB, CPU-only, open-source voice model. Build real-time speech without GPUs or fees. Install in minutes and ship fast.
Kitten TTS is aggressively CPU-optimized to run on your everyday laptop, a cheap Raspberry Pi, your Android phone... and probably even a smart toaster if you're feeling adventurous. The smart money, especially among the folks at r/LocalLLaMA, is that Kitten TTS is built on an architecture that's very similar to VITS(Variational Inference with Adversarial Learning for End-to-End Text-to-Speech) or possibly StyleTTS2. This unlocks applications like voice-enabled industrial sensors, talking toys for kids that don't spy on them, and smart home assistants that actually respect your privacy.The move to on-device processing is a direct response to growing public concern over data privacy, and Kitten is perfectly positioned to power this new wave of secure-by-design products.
Or read this on Hacker News