OpenAI TTS

audio

by OpenAI · Updated February 15, 2026

OpenAI's Text-to-Speech (TTS) models convert text into natural-sounding speech using 6 distinct voice options (Alloy, Echo, Fable, Onyx, Nova, Shimmer). Available in standard and HD quality, the models support real-time streaming, 24+ languages, and produce clear, expressive speech suitable for narration, accessibility, and conversational AI applications.

Best For

Text-to-speechReal-time streamingConversational AIAccessibilityMultilingual speech

Prompting Tips

1Choose the right voice: Alloy (neutral), Echo (warm), Fable (storytelling), Onyx (deep), Nova (bright), Shimmer (gentle)
2Use HD model for highest quality, standard for lower latency
3Punctuation controls pacing — use commas, periods, and ellipses deliberately
4Supports 24+ languages automatically based on input text