Get API
Back to Marketplace
🤖
Pay-per-callv2.3.0

TranscribeAI Speech

AI & Machine Learning Infrastructure

TranscribeAI is built on a Whisper-large-v3 fine-tuned ensemble. Supports MP3, WAV, M4A, FLAC, and MP4 up to 2GB. Real-time streaming transcription via WebSocket, batch processing via REST. Returns word-level timestamps, speaker labels, punctuation, and confidence scores.

RESTStreamingReal-timeOpenAPIPython
https://api.transcribeai.io/v2
Trust Score

4.7 / 5

189 dev notes

Live Uptime

99.92%

Guaranteed SLA

Avg Latency

1800ms

Global Edge

API Traffic

4.5M

monthly average

Available Endpoints

POST/transcribeTranscribe an audio file URL
POST/diarizeTranscribe with speaker labels
GET/jobs/:idPoll async transcription job status
WS/streamReal-time streaming transcription
Implementation Example
const res = await fetch('https://api.transcribeai.io/v2/transcribe', {
  method: 'POST',
  headers: { 'X-API-Key': process.env.TRANSCRIBE_KEY, 'Content-Type': 'application/json' },
  body: JSON.stringify({ audio_url: 'https://example.com/meeting.mp3', language: 'en', diarize: true }),
});
const { transcript, speakers } = await res.json();

API Access

✓ FREE TIER ACTIVE

First 300 calls are free every month.

Personal$5/mo
Team$12/mo
Enterprise$39/mo
Technical Docs

API Provider

Priya Sharma

Priya Sharma

NLP researcher shipping language AI via REST APIs.