Voice Changer
Apply real-time voice effects and transformations to any audio.
Overview
The Voice Changer API lets you transform audio with a library of built-in effects and fine-grained controls for pitch, intensity, and format. Process recordings or pipe in live audio for real-time transformations.
Quickstart
Apply an effect to an audio file in just a few lines of code.
Transform Audio
/v1/voice-changer/transformApply a voice effect to an uploaded audio file. You can control the effect intensity, apply pitch shifting, and choose the output format. Supports WAV, MP3, FLAC, and OGG inputs.
| Parameter | Type | Description |
|---|---|---|
| fileREQUIRED | file | Audio file to transform (WAV, MP3, FLAC, OGG) |
| effectREQUIRED | string | Effect preset: deep, high, robotic, whisper, echo, chipmunk, or radio |
| intensity | number | Effect strength from 0.0 to 1.0. Defaults to 0.5 |
| pitch_shift | number | Pitch adjustment in semitones (-12 to 12). Defaults to 0 |
| output_format | string | Output format: mp3, wav, flac, or ogg. Defaults to mp3 |
Response
List Effects
/v1/voice-changer/effectsRetrieve a list of all available voice effect presets. No parameters required. Each effect includes a description, default intensity, and a preview audio URL.
Response
Effects Reference
Overview of all built-in effect presets and their characteristics.
| Effect | Description | Default Intensity | Best For |
|---|---|---|---|
| deep | Lowers pitch for a resonant voice | 0.5 | Narration, trailers |
| high | Raises pitch for a lighter voice | 0.5 | Character voices |
| robotic | Synthetic modulation | 0.6 | Sci-fi, games |
| whisper | Soft whisper conversion | 0.7 | ASMR, storytelling |
| echo | Spatial echo and reverb | 0.4 | Ambience, music |
| chipmunk | High-speed pitch shift | 0.8 | Comedy, entertainment |
| radio | Vintage AM/FM simulation | 0.5 | Podcasts, retro style |
Response Objects
Reference for the objects returned by Voice Changer endpoints.
Transform Result Object
| Field | Type | Description |
|---|---|---|
| audio_url | string | URL to download the transformed audio file |
| duration | number | Duration of the output audio in seconds |
| effect_applied | string | Name of the effect that was applied |
| intensity | number | Intensity level that was used (0.0 to 1.0) |
| format | string | Output format of the audio file |
Best Practices
Start with moderate intensity
Begin with the default intensity value for each effect and adjust incrementally. High intensity values can introduce artifacts, especially with the robotic and echo presets. Values between 0.4 and 0.7 usually produce the most natural results.
Combine pitch shift with effects thoughtfully
Pitch shifting works independently from effect presets. A subtle pitch shift of 1-2 semitones combined with an effect can produce unique results, but large shifts paired with heavy effects may degrade audio quality.
Choose the right output format
Use WAV for maximum quality when processing audio further downstream. Use MP3 for smaller file sizes in web delivery. FLAC offers lossless compression if you need both quality and reduced file size.
Use clean source audio
Effects are most effective when applied to clean recordings with minimal background noise. Consider running audio through the Nur denoiser endpoint first if the source contains unwanted ambient sound.