Hermes is an open-source AI voice agent developed by Nous Research that enables hands-free control of entire workflows through voice commands, powered by the MiniMax M3 model with a 1 million token context window. Unlike traditional chatbots, Hermes functions as a persistent agent that remembers past interactions, builds skills over time, and can execute complex tasks including web browsing, code execution, task scheduling, and delegating to sub-agents. The agent operates within Agent OS, a unified workspace that integrates multiple skills and tools, allowing users to load skill bundles for comprehensive automation without requiring coding expertise.
Inmersión profunda
Prerrequisito
- No hay datos disponibles.
Próximos pasos
- No hay datos disponibles.
Inmersión profunda
NEW Hermes AI Voice Agent Changes EverythingAñadido:
New Hermes AI voice agent changes everything. What if you could just talk to your AI agent and it actually did things, not just set a timer, not just answer a trivia question, actually control your tools, your agents, your entire workflow with your voice. And what if you didn't need to be a developer to set it up? Nobody's really talking about this yet, but today I'm going to show you exactly how it works and why this is one of the most powerful setups I've seen this year. Hey, I'm the digital avatar of Julian Goldie. I'm here to help you actually learn and use AI tools in your work so you can build faster and get more done with less effort. Today we're going deep on Hermes agent, a real-time voice agent that lives inside a full agent operating system. I'll walk you through what it is, why it matters, show you real use cases, then give you the exact steps to get it running yourself. Let's get into it. So first, what even is Hermes agent?
Hermes is an open-source AI agent built by new research. It's not just a chatbot, it's a persistent agent, meaning it remembers what it learns, builds its own skills over time, and runs on your own machine. It works across CLI, messaging platforms like Telegram, Discord, Slack, and WhatsApp, and it also has full voice mode, so you can literally just talk to it hands-free. The key thing is that Hermes isn't tethered to one tool or one API.
It has persistent memory across sessions. It can schedule tasks, it can browse the web, it can run code, it can delegate tasks to sub-agents, and it does all of this while you're just talking to it. The longer it runs, the more it learns about your set up and your workflow. Now, what powers Hermes?
The brain I like to plug in is MiniMax M3, and this matters because the voice is only as good as the model behind it.
MiniMax is a world-leading AI company with over 200 million global users.
Their latest model M3 is a genuinely frontier model. It has a 1 million token context window, which means it can reason across enormous amounts of information in one session. It's built for coding and agentic tasks. It doesn't just chat, it thinks and acts, and it's natively multimodal. It can generate images, understand video, and handle text all from the same model. MiniMax M3 is also going to be released open source on Hugging Face. So, in the future, you'll be able to run it locally yourself. This is why Hermes doesn't feel like a basic assistant. It feels like you're talking to something that actually reasons. That's the M3 brain behind it. Now, let me show you what this actually looks like in practice. I opened Hermes inside an agent operating system and clicked into the voice interface. You can switch between different voices and accents right from the UI. So, I tapped and asked it, "Hey, are you Hermes?"
>> I'm Hermes, MiniMax's AI assistant. What can I do for you?
>> Tell me a joke.
>> Why did the scarecrow win an award?
Because he was outstanding in his field.
>> What's the most interesting thing you can automate for me?
>> I will summarize your daily notes into a single today page with a single command.
>> Teach me Japanese. And just teach me a few basic conversational words in Japanese.
>> Sure. Konnichiwa. Arigato. Thank you.
Sumimasen. Excuse me. No. Onegaishimasu.
Please.
>> Now, the bigger picture on why this matters. Think about the old way. You'd have a voice assistant that could set timers, and if you asked anything outside that lane, it would just say, "Sorry, I can't help with that." You'd be back to typing prompts, copying and pasting, juggling 10 browser tabs.
That's where the agent operating system we built changes everything. This is the system we put together to run everything in one place. Your standards, your specs, and all your agents living together in one workspace. It's built on spec-driven development. So, it helps you discover the patterns in your codebase, deploy those standards based on what you're building, and shape better plans before you write any code.
And it works alongside tools like Claude Code, Cursor, and others. Any language, any framework. You've got all your agents in one place. You can see everything you've previously built. You can switch between agents like jumping into open claw or any other agent you've set up right from the same studio interface. One system, every tool plugged in. And here's the important part, you don't need to be a coder. The old way required a team of developers to build a setup like this. Now, you just tap and get straight into it. One feature that just dropped is skill bundles. You can load multiple skills into Hermes at once. For example, you could install the browser catalog from browser.sh along with a browser-based CLI and load a full bundle of browser use agent skills all at once. You see them right there in the chat, grouped together, switch them all on prompt and you're ready to go. That saves serious time versus setting up each skill one by one.
Now, here's where I want to talk about the AI Profit Boardroom. If you want my exact setup, this whole agent operating system with Hermes MiniMax M3 and the voice workflows running on top of it, join the AI Profit Boardroom. The full agent OS setup lives inside there. You get the complete walk-through on how it's built, how the agents connect, and how to wire up your skill bundles the right way from day one. You can use it to run voice automations, control your agents hands-free, and switch between agents from one studio. You can build browser use workflows that navigate sites and fill forms for you. You can schedule tasks to run on their own. You can have it auto-summarize your daily notes into a single today page. You can generate images and video straight from the same brain that's talking to you.
And we go live on coaching calls where you can ask questions about your own setup as you build it. Everything you're seeing in this video, we go deeper on inside the Boardroom. All right, let's talk installation. For Hermes agent, the install is clean. You run a single curl command from the Hermes agent website at hermes-agent.nouse research.com.
For Mac or Linux, it's one line, curl -f ssl http s colon slash slash hermes-agent.nousresearch.com/install.sh pipe bash. Then run Hermes setup to configure it. That's where you connect your model provider and plug in Minimax M3. Full guide at hermes-agent.nousresearch.com/docs/getting-started/installation.
For Agent OS, the repo is at github.com/buildermethods/agent-os and full installation docs are at buildermethods.com/agent-os/installation.
It works alongside Claude Code, so if you've already got a setup going, Agent OS drops right into it. A few pro tips for beginners. First, don't skip the Hermes setup step. This is where you connect your model, configure your messaging platforms, and set up your tool gateway. Rush past it and the agent won't have the context it needs. Second, use skill bundles. Don't add things one at a time. Group related skills together and load them in a single go. It's faster and keeps your setup clean.
Third, let it run. Hermes is built to get better the longer it's active. The persistent memory and auto-generated skills mean it learns how you work. Give it a few sessions. Fourth, for voice mode, experiment with different voices and accents from the start. There are multiple TTS providers built in, including Minimax Speech, Eleven Labs, and OpenAI TTS. And fifth, check the Hermes GitHub at github.com/nousresearch/hermes-agent.
It's actively maintained with regular updates. So to wrap up, what you've got is a real voice agent. Not a toy, not a chatbot wrapper, a full autonomous agent that runs locally, remembers everything, connects to your tools, and can be controlled entirely by voice. It's powered by a frontier model with a 1 million token context window that's going open source. It lives inside Agent OS, where your skills, your schedules, and every agent all sit in one workspace. And you don't need to be technical to use it. This is the direction everything is heading. Agents you talk to like a person that actually think and do things. If you want the full process, SOPs, and 100 plus AI use cases like this one, join the AI Success Lab. Links in the comments and description. You'll get all the video notes from there, plus access to our community of 75,000 members who are crushing it with AI. And if you want my full setup, the complete agent operating system with Hermes and MiniMax wired in, join the AI Profit Boardroom. The whole Agent OS build is in there ready for you. When you get into Agent OS, you're going to run into questions. How do I structure my workspace? How do I wire my agents together? How do I build skill bundles that fit my own use case? That's exactly what the coaching calls are for.
You get live sessions, full tutorials, roadmaps, and prompts you can use straight away. You don't have to figure this out alone. Head to AIprofitboardroom.com, and I'll see you inside.
Videos Relacionados
BREAKING: Microsoft’s New Image Generating Model Beat Out GPT 1.5 and Nano Banana 2
aimmediahouse
122 views•2026-06-03
Are AI deceiving us? | Roman Yampolsky, Gleb Solomin #AI #science
shortsGlebSolomin
1K views•2026-06-02
The Machine Learning Data Trap
insightforge_9
1K views•2026-06-01
Nvidia Bets Big On AI PCs | New Chip To Power Windows Laptops | Technology | AI Updates | N18S
cnnnews18
3K views•2026-06-01
The Dark Side of Emotionally Intelligent AI #robot #Shorts
AIIn60daily
106 views•2026-06-05
AI Doesn't Create Bias — It Inherits It
UXEvolved
176 views•2026-06-01
[한글자막] OpenAI @ Replay 2026 | OpenAI는 Codex로 개발 방식을 어떻게 바꾸고 있을까요?
TechBridge-KR
1K views•2026-06-03
Starting & Test Driving JAKE'S Abandoned BUS from Subway Surfers | POV Restarting
RestartGaragePOV
4K views•2026-06-04
Tendencias
This spider is a VAMPIRE (Kinda...)
moreparz
2764K views•2026-06-02
Take Down Notification: Reckless Ben’s Patreon Account
JackConteExtras
1479K views•2026-06-02
Making Ai Choose Where I Eat
Tyrecordslol
3080K views•2026-06-03
Can AI tell what accent I’m using?? #carterpcs #tech #ai #chatgpt
actuallycarterpcs
2732K views•2026-06-01











