Show HN: Three new Kitten TTS models – smallest less than 25MB
439 by rohan_joshi | 162 comments on Hacker News.
Kitten TTS ( https://ift.tt/MbvNAFU ) is an open-source series of tiny and expressive text-to-speech models for on-device applications. We had a thread last year here: https://ift.tt/LExbUqP . Today we're releasing three new models with 80M, 40M and 14M parameters. The largest model (80M) has the highest quality. The 14M variant reaches new SOTA in expressivity among similar sized models, despite being <25MB in size. This release is a major upgrade from the previous one and supports English text-to-speech applications in eight voices: four male and four female. Here's a short demo: https://www.youtube.com/watch?v=ge3u5qblqZA . Most models are quantized to int8 + fp16, and they use ONNX for runtime. Our models are designed to run anywhere eg. raspberry pi, low-end smartphones, wearables, browsers etc. No GPU required! This release aims to bridge the gap between on-device and cloud models for tts applications. Multi-lingual model release is coming soon. On-device AI is bottlenecked by one thing: a lack of tiny models that actually perform. Our goal is to open-source more models to run production-ready voice agents and apps entirely on-device. We would love your feedback!
-
U.S. Postal Service starts nationwide electric vehicle fleet, buying 9,250 EVs 444 by lxm | 336 comments on Hacker News.
-
Learn Postgres at the Playground – Postgres compiled to WASM running in browser 543 by samwillis | 144 comments on Hacker News.
-
NSA, NIST, and post-quantum crypto: my second lawsuit against the US government 486 by trulyrandom | 143 comments on Hacker News.