Fish Audio S2.1 Pro

NEW

Key Features

Free Fish Audio API model exposed through the s2.1-pro-free model string.
Generates natural text-to-speech audio for developer applications.
Supports 83 languages from a single model and endpoint.
Targets low-latency speech generation with roughly 90 ms TTFA on standard calls.
Supports voice cloning from reference audio through Fish Audio workflows.
Uses the same Fish API endpoint pattern as paid plans.
Designed for voice agents, narration, localization, games, and prototyping.
Runs under fair-use terms with paid plans available for SLA and production guarantees.

The model supports 83 languages through one API endpoint and can be used by setting the Fish API model header to s2.1-pro-free. Fish Audio describes S2.1 Pro as its current state-of-the-art voice model, with roughly 90 ms time to first audio on standard API calls, higher concurrency throughput, and the same endpoint structure as paid plans.


S2.1 Pro is useful for voice agents, audiobook and narration pipelines, game NPC dialogue, multilingual product experiences, and prototype voice-cloning workflows. The free access is governed by fair-use constraints and does not include production SLA guarantees, so teams can evaluate quality and latency before moving commercial workloads to paid terms.

Get more likes & reach the top of search results by adding this button on your site!

Embed button preview - Light theme
Embed button preview - Dark theme
TurboType Banner
Zero to AI Engineer Program

Zero to AI Engineer

Skip the degree. Learn real-world AI skills used by AI researchers and engineers. Get certified in 8 weeks or less. No experience required.

Subscribe to the AI Search Newsletter

Get top updates in AI to your inbox every weekend. It's free!