All models

Gemini 3.1 Flash TTS

google/gemini-3.1-flash-tts
Google

Quick start

# Inspect the price — a plain request returns the 402 challenge:
curl -i https://api.glianalabs.com/v1/infer \
  -H "content-type: application/json" \
  -d '{
    "model": "google/gemini-3.1-flash-tts",
    "text": <string>
  }'

# Pay + run in one step with the mppx CLI (create a wallet: npx mppx account create):
npx mppx https://api.glianalabs.com/v1/infer \
  -J '{"model": "google/gemini-3.1-flash-tts", "text": "<string>"}'

Examples

Parameters

Input
maxOutputTokens integer

Maximum number of tokens to generate

stopSequences array

Sequences where the model will stop generating further tokens

temperature number

Controls randomness in generation (0-2)

text string required

The text to convert to speech. Maximum 10,000 characters.

topK integer

Only sample from the top K tokens. Smaller K = more focused, larger K = more diverse

topP number

Nucleus sampling threshold (0-1). Tokens with cumulative probability up to topP are considered

voice string

The voice to use for speech synthesis

Output
audio: Base64-encoded audio data (WAV format)