Blog

Tutorials, guides, and product updates from the SpeakEasy team.

·speech-to-text

Translate Audio to English in One API Call

Pass audio in any of 99+ languages, get back English text. No intermediate translation step, no extra API, no extra cost. Just set translate=true.

·speech-to-text

Generate SRT and VTT Subtitle Files Directly from Audio

SpeakEasy is the only affordable speech API that returns SRT and VTT subtitle files natively. No post-processing, no custom formatter — one API call.

·speech-to-text

How to Use the Prompt Parameter to Improve Whisper Transcription Accuracy

The prompt parameter lets you feed Whisper context about your audio — brand names, speaker names, technical vocabulary. Here's how to use it effectively.

·speech-to-text

Async Transcription with Callback URLs: Transcribe Long Audio Without Waiting

Learn how to transcribe long audio files asynchronously using a callback URL. Fire-and-forget transcription for podcasts, meetings, and large video files.

·Python

Python Speech-to-Text API: Transcribe Audio in 5 Lines of Code

A complete guide to using a speech-to-text API in Python. Install the OpenAI SDK, point it at SpeakEasy, and get transcripts with speaker labels, timestamps, and async processing.

·JavaScript

Speech-to-Text API in JavaScript: Complete Guide

Complete guide to using a speech-to-text API in JavaScript and Node.js. Learn file uploads, URL-based transcription, and streaming with the SpeakEasy API.

·Comparison

Best Speech-to-Text APIs in 2026: Compared by Price, Accuracy & Features

We tested 8 leading speech-to-text APIs on accuracy, price, latency, and developer experience. Here's exactly what we found — with real benchmark data.

·Comparison

Looking for a Whisper API Alternative? Try SpeakEasy

Frustrated with OpenAI Whisper API pricing and rate limits? SpeakEasy is a drop-in Whisper API alternative with lower costs, speaker diarization, and full OpenAI SDK compatibility.

·Speech-to-Text

Speaker Diarization: Identify Who Said What in Audio

Learn what speaker diarization is, why it matters, and how to use the SpeakEasy API to automatically identify speakers in audio recordings with code examples.

·Text-to-Speech

Build a Text-to-Speech App in 5 Minutes

A quick guide to generating speech from text using the SpeakEasy text-to-speech API. Covers voice selection, API calls, streaming audio, and word-level timestamps with Python and curl examples.

$1. 50 hours. Both STT and TTS.

Your current speech API provider is charging you too much. Switch in one line of code.

SPEAKY