API Reference

/v1/audio/speech

Text-to-speech. Model: `kokoro-82m`. Focus on neutral Spanish (ES + LATAM); also broad catalogue in English, Brazilian Portuguese, French, Italian, Japanese, Mandarin and Hindi. Latencies <200 ms for short phrases.

Overview

Synthesises natural-sounding audio from text. Designed for IVR, conversational agents and short narration. Low footprint (82M parameters) keeps CPU latency low for real-time use.

Endpoint and model

POST `https://api.tesseraai.cloud/v1/audio/speech`. `model`: `kokoro-82m`. `voice`: see catalogue below.

AttributeValue
Upstream modelhexgrad/Kokoro-82M
P50 latency<200 ms for phrases under 30 chars
Output formatsmp3 (default), wav, opus, aac, flac, pcm
Sample rate24 kHz mono
LicenceApache 2.0

Available voices

Each voice encodes language + gender in its prefix: `e*` Spanish, `a*` American English, `b*` British English, `p*` Brazilian Portuguese, `f*` French, `i*` Italian, `j*` Japanese, `z*` Mandarin, `h*` Hindi. The second letter is `f` (female) or `m` (male).

LanguageVoices
Spanish (neutral, ES + LATAM)`ef_dora`, `em_alex`, `em_santa`
American English`af_heart`, `af_sky`, `af_bella`, `af_nicole`, `af_sarah`, `af_aoede`, `af_kore`, `af_jessica`, `af_nova`, `af_river`, `af_jadzia`, `am_michael`, `am_adam`, `am_eric`, `am_fenrir`, `am_liam`, `am_onyx`, `am_puck`, `am_santa`
British English`bf_alice`, `bf_emma`, `bf_isabella`, `bf_lily`, `bm_daniel`, `bm_fable`, `bm_george`, `bm_lewis`
Brazilian Portuguese`pf_dora`, `pm_alex`, `pm_santa`
French`ff_siwis`
Italian`if_sara`, `im_nicola`
Japanese`jf_alpha`, `jf_gongitsune`, `jf_nezumi`, `jf_tebukuro`, `jm_kumo`
Mandarin`zf_xiaobei`, `zf_xiaoni`, `zf_xiaoxiao`, `zf_xiaoyi`, `zm_yunjian`, `zm_yunxi`, `zm_yunxia`, `zm_yunyang`
Hindi`hf_alpha`, `hf_beta`, `hm_omega`, `hm_psi`

Hit Play to hear any voice

54 voices, 9 languages. Filter by language and gender; copy the ID to your clipboard to drop straight into your request.

54 voices
  • Dora

    FemaleSpanish (neutral)

  • Alex

    MaleSpanish (neutral)

  • Santa

    MaleSpanish (neutral)

  • Alloy

    FemaleAmerican English

  • Aoede

    FemaleAmerican English

  • Bella

    FemaleAmerican English

  • Heart

    FemaleAmerican English

  • Jadzia

    FemaleAmerican English

  • Jessica

    FemaleAmerican English

  • Kore

    FemaleAmerican English

  • Nicole

    FemaleAmerican English

  • Nova

    FemaleAmerican English

  • River

    FemaleAmerican English

  • Sarah

    FemaleAmerican English

  • Sky

    FemaleAmerican English

  • Adam

    MaleAmerican English

  • Echo

    MaleAmerican English

  • Eric

    MaleAmerican English

  • Fenrir

    MaleAmerican English

  • Liam

    MaleAmerican English

  • Michael

    MaleAmerican English

  • Onyx

    MaleAmerican English

  • Puck

    MaleAmerican English

  • Santa

    MaleAmerican English

  • Alice

    FemaleBritish English

  • Emma

    FemaleBritish English

  • Lily

    FemaleBritish English

  • Daniel

    MaleBritish English

  • Fable

    MaleBritish English

  • George

    MaleBritish English

  • Lewis

    MaleBritish English

  • Dora

    FemaleBrazilian Portuguese

  • Alex

    MaleBrazilian Portuguese

  • Santa

    MaleBrazilian Portuguese

  • Siwis

    FemaleFrench

  • Sara

    FemaleItalian

  • Nicola

    MaleItalian

  • Alpha

    FemaleJapanese

  • Gongitsune

    FemaleJapanese

  • Nezumi

    FemaleJapanese

  • Tebukuro

    FemaleJapanese

  • Kumo

    MaleJapanese

  • Xiaobei

    FemaleMandarin

  • Xiaoni

    FemaleMandarin

  • Xiaoxiao

    FemaleMandarin

  • Xiaoyi

    FemaleMandarin

  • Yunjian

    MaleMandarin

  • Yunxi

    MaleMandarin

  • Yunxia

    MaleMandarin

  • Yunyang

    MaleMandarin

  • Alpha

    FemaleHindi

  • Beta

    FemaleHindi

  • Omega

    MaleHindi

  • Psi

    MaleHindi

Request

POST /v1/audio/speech
curl https://api.tesseraai.cloud/v1/audio/speech \
  -H "Authorization: Bearer $TESSERA_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "kokoro-82m",
    "input": "Welcome to Tessera. Your request has been received.",
    "voice": "af_heart",
    "response_format": "mp3"
  }' \
  --output welcome.mp3

Response

The response body is raw audio in the requested format. No JSON wrapper.