Pocket TTS is a lightweight text-to-speech application designed to run efficiently on CPUs. It generates audio from text with low latency and can handle infinitely long text inputs

Pocket TTS | Best AI for Speech | Find AI Tools & Apps

Pocket TTS is a lightweight text-to-speech application designed to run efficiently on CPUs. It generates audio from text with low latency and can handle infinitely long text inputs. The application supports Python 3.10 and above and requires PyTorch 2.5+, but does not need the GPU version of PyTorch. 
The application has a small model size of 100M parameters and uses only 2 CPU cores. It can stream audio and has a faster than real-time speed of approximately 6x real-time on a MacBook Air M4. Pocket TTS also supports voice cloning and has a catalog of pre-made voices, including alba, marius, and javert, among others. 
Pocket TTS can be used as a Python library, and its functionality can be accessed through a command-line interface or a local server. The application has a web interface that can be accessed at http://localhost:8000, allowing users to input text and select different voices. It also has a serve command that keeps the model in memory between requests, making it faster than the command line.

Pocket TTS

Key Features

Zero to AI Engineer

Subscribe to the AI Search Newsletter