You can call a phone number and ask an AI to find you the perfect vinyl based on a mood or memory.
We're kicking off our weekly Voice Agent Spotlight with The Record Store Oracle, built by @Gyurmatag, and honestly, the personality in this one sets the bar.
Speak a feeling. An
AssemblyAI
2,724 posts
Access powerful AI models to transcribe and understand speech via a simple API.
Try our no-code playground for free π assemblyai.com/playground
Joined October 2017
- AssemblyAI repostedAI medical voice agent in production. - 14 hours of call volume - 38 appointments booked, more captured revenue for practices - 11 appointments cancelled -> reduces no show rate and saves cost on valuable chair time Thx @livekit@cartesia@rimelabs@AssemblyAI
- This Wednesday afternoon you could have a working voice agent you built yourself. @dan_aai from @AssemblyAI is running a live workshop. Claude Code + AssemblyAI Voice Agent API, built from scratch in about an hour. π€ Claude Code does the building. You direct it. No coding
- Everything we shipped in May, in 2 minutes. π₯ Follow the changelog for more: assemblyai.com/changelog00:00
- Universal-3 Pro just got better across the board. π Five upgrades, live now: π Code-switching: ~19% relative WER improvement on multilingual benchmarks π£οΈ Disfluencies: ~5.9% WER improvement on verbatim datasets β‘ Turnaround time: P50 latency up to 30% faster, P99 up to
- AssemblyAI repostedBefore @AssemblyAI, @DylanJFox was teaching himself ML from textbooks at night. I sat down with Dylan on Skywatch, @getbluejay_ai's car podcast. A few things that stuck with me: STT is not transcription. It is an intelligent listening layer. Nobody using voice AI cares about00:00
- Ryan Johnson's first question about Universal-3 Pro Streaming was "why is it so good?" So @ryanseams showed him, trackside at the Miami Grand Prix, with names, emails, and phone numbers flying and F1 cars passing by. @CallRail chose to partner with AssemblyAI so their team can00:00
- Bad news: yet another Friday with no F1 race on the calendar. Good news: our team was at the Miami GP last weekend putting Universal-3-Pro Streaming through its pacesβcode switching, numbers, and engine and crowd noise. The conditions were... not ideal. That was the point. See00:00
- Calling multiple LLM providers in production shouldn't mean juggling separate accounts, bills, and rate limitsβand one provider outage taking your whole product down with it. Our LLM Gateway just got a significant upgrade so you can: πΉ Route across providers with automaticOne OpenAI-compatible endpoint. Zero markup on provider costs. Same AssemblyAI API key you already have. If you're building voice agents on our STT, there's no extra network hopβspeech to LLM to action in one system.
- Ask a research question out loud. Under 60 seconds later, you have a complete, sourced answer. We built a reference architecture with @render using AssemblyAI's Voice Agent API + Render's new Workflows. Core insight: keep the voice channel separate from background
- Today we're shipping a major upgrade to streaming diarization, and it pulls us decisively ahead of the competition on the metrics that matter in production. Head-to-head vs. the competition: π― 2x better cpWER on 2-speaker telephony π 13% better cpWER on 4-speaker meetings
- A voice agent. One prompt. Under 15 minutes. That's what Mart built using the AssemblyAI Voice Agent API and Claude Codeβand we captured the whole thing on video. Here's what the build actually looked like: πΉ Install the AssemblyAI MCP server β docs auto-inject into your
- Introducing the Voice Agent API. One WebSocket. Stream audio in, get audio back. We handle the full voice stack so you can focus on your product. Powered by Universal-3 Pro, our speech model built for real-world audio. $4.50/hr. No SDK. Ship today β assemblyai.com/voice-agent00:00
