This workflow demonstrates how agentic tools are collapsing the gap between architectural intent and functional deployment, turning complex voice engineering into a high-speed assembly task. It proves that when documentation is seamlessly integrated via MCP, the only remaining bottleneck is the developer's ability to clearly define the problem.
Deep Dive
Prerequisite Knowledge
- No data available.
Where to go next
- No data available.
Deep Dive
From Idea to Live Voice Agent in Minutes (Claude Code)Added:
In this video, you'll get a backseat into how I built this voice agent with our new voice agent API powered by Cloud Code. It was a truly magical experience of putting in one prompt and getting a voice agent out on the other end that was ready in production. I'll show [snorts] you my tips and tricks so that you'll have everything you need to build great voice AI agents on top of AssemblyAI. Let's dive right in.
So, this week at AssemblyAI is extra exciting because we're releasing our voice agent API. That's a big launch.
What was the one moment during the build that made you realize it actually worked?
>> I mean, for me was when I was able to go from zero to a fully working voice agent just by using our AssemblyAI MCP server and also just putting in one prompt into Cloud Code. So, the whole prototype was built in 20 minutes with a single prompt, just tweaking it afterward. What was the first bug that showed up and made you realize how fast [music] the loop was? I mean, for me it was um it was a bug related to Railway, which is where I'm hosting the app, >> [music] >> but I don't want this to show up in my LinkedIn post because it's kind of irrelevant to the voice agent API. You said it's about letting you focus on your expertise. What's the one thing you're most excited to build [music] now that the hard part is gone?
>> Wow, I mean, I have a ton of ideas for voice agents that I'm sitting on, but the one that I'm most excited is actually this voice agent that I'm using right now. If you were to say this to a friend in one sentence, what would you tell them about why they should try this now? They should try this now because it's never been easier to build a voice agent on top of AssemblyAI's voice AI platform. So, the first thing I'd recommend is that you install our AssemblyAI MCP server. The full command to install our MCP server is in our docs and the link in the description. Once you have that installed, our MCP server will automatically pull our latest docs into your Cloud Code session. There's no more needing to search our documentation for updated information. All of that gets automatically injected into your Cloud Code >> [music] >> instance. All right, so kicking off our demo project, I tell Cloud Code to build a voice agent that interviews me and asks me questions and to help me write better LinkedIn posts. So, that's the big idea of this voice agent that I'm trying to build.
Um as you can see, I copy and paste some docs links just to make sure that it [music] gets it just right.
Um and I wanted to use the browser integration page as well.
So, from there I just ask it to build a very simple demo that showcases the API [music] and I'm planning to host this voice agent on Railway and so I mentioned that from the very start.
So, Cloud Code starts churning away at the code, building some front end and back end code as we can see here.
>> [music] >> I've sped it up so that we don't wait too long.
Um but as you can see in about 10 minutes or so, [music] we have a GitHub repository up and running with our voice agent demo. Um I go ahead and add it into Railway, uh which is where I've chosen to host my app.
Now, I'm just looking up what command I need to start running [music] the app.
So, I set the root directory to the front end as well as the back end.
And I insert my AssemblyAI API key.
So, I go on to our dashboard, sign in with Google. So, once I have my API key, I insert it into Railway and save and now we're ready to deploy our [music] app.
From there, I need to add another instance, >> [music] >> um which will then host our front end and that's the URL that we'll connect to to see our [music] app and to play with it.
So, this one has a root directory of front end.
As you can tell, I've used Railway a couple of times and so I'm kind of familiar with the process, but what I've learned is that Cloud Code is actually really good at using the Railway CLI to do a lot of this for you.
And the great thing about our voice agent API is that it's framework agnostic. It's a simple API that works with practically any coding language and any platform. [music] Um and so if you want to host it on Vercel, Railway, if you want to self-host it on your own server, um anything works because it's just so simple and so [music] easy to deploy.
So, here now I'm setting the back end URL with the Railway service, um but eventually [music] I figure out that actually uh Cloud Code can just help do it for me.
And after a little more wrangling and finagling of Railway, um in any second now I should be live with my voice [music] agent app. All right, so there's our app and as we hit start interview, we bump into yet another error of failed to fetch.
So, this was because I think on the front end we set the [music] wrong back end URL to call for a token.
Uh so, for context, um >> [music] >> if you're deploying the voice agent on the browser, you can make a request to your back end for [music] a token that you can use to authenticate your session on the front end.
And that [music] way, we're able to start um the voice agent on the client side.
Um without having to expose your API [music] key.
All right, great. So, here we go. We have our very first voice agent in right just a little under 15 minutes. Um and yeah, it was really exciting to get it up and running so quickly, um but we're going to work on some improvements right away now that we see that it's working.
Um the first [music] thing I'm going to want to do is to style it and kind of make it look like LinkedIn a little more.
Um just to kind of nail that design and really get people into the mood of like writing something for their social media.
So, I go ahead and tell Cloud Code it worked great. Um can you um you know, make some design improvements, styling changes, and we kind of want to make it look like [music] LinkedIn.
So, we'll see fairly quickly um that uh Cloud Code is able to design um my voice agent to look like LinkedIn and lean into their brand and palette.
Um and so after we push to get and refresh, we should slowly see um that yep, it looks like [music] LinkedIn. It's got those colors and the nice off-white background. Um and maybe we'll just start another interview to see uh how it works.
Yep, it's working great.
Um and so on to the next thing.
So, one of our features of our voice agent API is that we support tool calling right out of the box.
Um so, the tool that I choose to add to this voice agent is Exa Search, um which will basically allow the voice agent to kind of search up sources [music] to back up certain claims that we make in our post, um which will make it look a little more informed and backed by research. [music] So, I go ahead and get my Exa API key and insert it onto Railway while Cloud Code chips away at implementing Exa on uh the back end.
So, I can see here it's uh made lots of progress.
We should be getting a voice agent with Exa any time now.
All right, so to test out tool calling, um I just started a new session and you know, stated a stat >> [music] >> uh just to see if the voice agent is smart enough to go and look it up using Exa.
And sure enough, it finds this report by MIT on Medium, um which is great.
Lastly, we're going to add some different voices to the voice agent. So, AssemblyAI supports a wide range of voices with more coming soon.
Um and I want to let the user pick the voice that they choose uh for their session.
So, what we'll do is we'll just add a drop down with uh all of the choices for voices with some sound defaults and >> [music] >> um in no time you should see that live as well.
If you do end up trying our voice agent API, I'd love to know [music] what your favorite voice is.
Um our team put a lot of work into picking great voices.
>> [music] >> Uh so, let me know in the comments below if there's any voice that stood out to you. All right, so um Cloud has gone ahead and made the change and as you can see, [music] here's our full list of voices that are available.
Um I just hit start interview with a new voice and uh there we go. There we have a fully fleshed out voice AI agent in, you know, very little time. Okay, the draft is on screen now. Want to tweak the angle or zoom in on a different moment? [music] >> No, this looks great. Thank you so much.
>> You're welcome.
Enjoy launching the API. So, on that note, it concludes our voice agent build-along today. I hope you enjoy what we we were able to put together and let me know if you do end up trying our voice agent API. I'm really excited to know what you think and if you have any questions, don't be afraid to reach out to our team in the comments below or on our website. Um our team is always happy to help. So, until the next one, I'll see you. Bye.
>> Woo!
>> [music]
Related Videos
Agentforce NOW AMA: Build with React and Salesforce Multi-Framework
SalesforceDevs
490 views•2026-05-28
How agent o11y differs from traditional o11y — Phil Hetzel, Braintrust
aiDotEngineer
450 views•2026-05-28
WEB TECHNOLOGIES UNIT-2 | Degree 4th sem BCOM Computers web technologies unit-2 full explanation💯✅
LearnwithSahera
1K views•2026-05-29
More tests are always better? How to use AI to identify tests that bring little value
Alliance4Qualification
335 views•2026-05-29
Search Algorithms Explained in 60 Seconds! 🤖💨
samarthtuliofficial
218 views•2026-06-01
People of Game of Thrones using JavaScript DOM
AltCampus
296 views•2026-05-30
Introduction to Problem Solving Part - 1 | Lecture 1 | Intermediate DSA
ascensionix
107 views•2026-05-29
🚀 BCS613C Compiler Design | Module 1 to 5 Schema Evaluation 🔥 | VTU 6th Sem 💯 #VTU #bcs613c #exam
Pranavaa-y4y
104 views•2026-06-02











