{}

Listen to Page Powered by Fish Audio S1 {voices.length > 1 ?

{isDropdownOpen &&

{voices.map((voice, index) => )}

}

{}

; }; ## Prerequisites Sign up for a free Fish Audio account to get started with our API. 1. Go to [fish.audio/auth/signup](https://fish.audio/auth/signup) 2. Fill in your details to create an account, complete steps to verify your account. 3. Log in to your account and navigate to the [API section](https://fish.audio/app/api-keys) Once you have an account, you'll need an API key to authenticate your requests. 1. Log in to your [Fish Audio Dashboard](https://fish.audio/app/api-keys/) 2. Navigate to the API Keys section 3. Click "Create New Key" and give it a descriptive name, set a expiration if desired 4. Copy your key and store it securely Keep your API key secret! Never commit it to version control or share it publicly. ## Basic Usage Generate speech from text: ```typescript theme={null} import { FishAudioClient, play } from "fish-audio"; const fishAudio = new FishAudioClient({ apiKey: process.env.FISH_API_KEY }); const audio = await fishAudio.textToSpeech.convert({ text: "Hello, world!", }); await play(audio); ``` ## Using Voice Models Specify a voice model for consistent voice generation: ```typescript theme={null} import { FishAudioClient } from "fish-audio"; const fishAudio = new FishAudioClient(); const audio = await fishAudio.textToSpeech.convert({ text: "This is my custom voice", reference_id: "your_model_id", // Your model ID from fish.audio }); await play(audio); ``` ### Getting Model IDs The `reference_id` is the model ID from the URL when viewing a model on Fish Audio: * Model URL: `https://fish.audio/m/802e3bc2b27e49c2995d23ef70e6ac89` * Reference ID: `802e3bc2b27e49c2995d23ef70e6ac89` You can also get model IDs programmatically: ```typescript theme={null} // List your models const results = await fishAudio.voices.search({ self: true }); for (const model of results.items ?? []) { console.log(`${model.title}: ${model._id}`); } // Get specific model details const model = await fishAudio.voices.get("your_model_id"); console.log(`Model: ${model.title}, ID: ${model._id}`); ``` ## Emotions Add emotional expressions to your text: ```typescript theme={null} import type { TTSRequest } from "fish-audio"; const text = ` (happy) I'm excited to share this! (sad) Unfortunately, it didn't work out. (whispering) This is a secret. `; const request: TTSRequest = { text, reference_id: "model_id" }; ``` Common emotions: `(happy)`, `(sad)`, `(angry)`, `(excited)`, `(calm)`, `(surprised)`, `(whispering)`, `(shouting)`, `(laughing)`, `(sighing)` For more advanced control over speech generation, including phoneme-level control and additional paralanguage features, see [Fine-grained Control](/developer-guide/core-features/fine-grained-control). ## Audio Formats Choose output format based on your needs: ```typescript theme={null} // MP3 (default) await fishAudio.textToSpeech.convert({ text: "...", format: "mp3", mp3_bitrate: 192 }); // WAV - uncompressed await fishAudio.textToSpeech.convert({ text: "...", format: "wav", sample_rate: 44100 }); // Opus - efficient for streaming await fishAudio.textToSpeech.convert({ text: "...", format: "opus", opus_bitrate: 48 }); // PCM - raw audio data await fishAudio.textToSpeech.convert({ text: "...", format: "pcm", sample_rate: 16000 }); ``` ## Prosody Control Adjust speech speed and volume: ```typescript theme={null} const audio = await fishAudio.textToSpeech.convert({ text: "Adjusted speech", prosody: { speed: 1.2, // 0.5 - 2.0 volume: 5, // -20 - 20 }, }); ``` ## Advanced Parameters Fine-tune generation: ```typescript theme={null} const audio = await client.textToSpeech.convert({ text: "Your text here", chunk_length: 200, // Characters per chunk (100-300) normalize: true, // Normalize text latency: "balanced", // "normal" or "balanced" temperature: 0.7, // Randomness (0.0-1.0) top_p: 0.7, // Token selection (0.0-1.0) }); ``` ## Choosing Backend Our state-of-the-art [S1 model](/developer-guide/models-pricing/models-overview) is the default backend model for TTS. Optionally specify the model via the second argument (`backend: Backends`). ```typescript theme={null} const audio = await fishAudio.textToSpeech.convert({ text: "Hello, world!", }, "s1"); ``` ## Streaming For real-time streaming, see the [WebSocket guide](/developer-guide/sdk-guide/javascript/websocket). ## Error Handling Handle common errors: ```typescript theme={null} async function generateWithRetry(request: Record, maxRetries = 3) { const fishAudio = new FishAudioClient(); for (let attempt = 0; attempt < maxRetries; attempt++) { try { return await fishAudio.textToSpeech.convert(request); } catch (e: any) { const status = e?.status || e?.response?.status; if (status === 429) await new Promise(r => setTimeout(r, 2 ** attempt * 1000)); else if (status === 401) throw new Error("Invalid API key"); else throw e; } } } ``` ## Request Parameters | Parameter | Type | Description | Default | | -------------- | --------- | -------------------- | ---------- | | `text` | string | Text to convert | Required | | `reference_id` | string | Voice model ID | None | | `references` | object\[] | Reference audio | \[] | | `format` | string | Audio format | "mp3" | | `chunk_length` | number | Chunk size (100-300) | 200 | | `normalize` | boolean | Normalize text | true | | `latency` | string | Speed vs quality | "balanced" | | `prosody` | object | Speed/volume | None | | `temperature` | number | Randomness | 0.7 | | `top_p` | number | Token selection | 0.7 | ## Next Steps * [Fine-grained control](/developer-guide/core-features/fine-grained-control) for phoneme-level control and paralanguage * [Voice cloning](/developer-guide/sdk-guide/javascript/voice-cloning) for custom voices * [WebSocket streaming](/developer-guide/sdk-guide/javascript/websocket) for real-time apps * [Guide and Best Practices](/developer-guide/core-features/text-to-speech) for production use * [API reference](/api-reference/endpoint/openapi-v1/text-to-speech) for direct API calls