This guide demonstrates how to deploy a Text-to-Speech (TTS) API using the Zonos model from Zyphra. The API converts input text into spoken audio, leveraging a pre-trained transformer model and speaker embeddings derived from an example audio file. We use Beam’s infrastructure for compute and file output handling.
curl -X POST 'https://app.beam.cloud/endpoint/zonos-tts/v1' \-H 'Content-Type: application/json' \-H 'Authorization: Bearer {YOUR_AUTH_TOKEN}' \-d '{"text": "On Beam run AI workloads anywhere with zero complexity. One line of Python, global GPUs, full control"}'