AI assistants are evolving from reactive chatbots that respond to prompts into proactive agents that can connect to multiple applications (Gmail, Docs, Calendar, Drive, Search), handle complex workflows autonomously, and perform tasks in the background without requiring explicit user commands. This shift represents a fundamental change in how AI systems interact with users, moving from simple Q&A interactions to intelligent task execution across digital ecosystems.
Inmersión profunda
Prerrequisito
- No hay datos disponibles.
Próximos pasos
- No hay datos disponibles.
Inmersión profunda
Google’s New AI Is The OpenClaw KillerAñadido:
Google is testing Remy, a new 247 Gemini agent that looks like a direct response to OpenClaw, the viral AI that can act on your behalf. Remy can connect to Gmail, Docs, Calendar, Drive, and Search, and handle real tasks in the background. At the same time, a new Gemini 3.2 2 flash model just surfaced with stronger coding, 3D simulations, animation, and design skills. And Google also dropped a major speed upgrade for Gemma 4 that can make AI responses up to three times faster. On the other side, OpenAI just rolled out GPT 5.5 instant as the new default chat GPT model with fewer mistakes, better reasoning, and deeper personalization. And Anthropic is preparing Orbit, a proactive claude assistant that can brief you across Gmail, Slack, GitHub, Figma, Calendar, and Drive before you even ask. So, let's start with the biggest piece because this one is different. Internally at Google, employees are testing a new AI agent called Remy. And this isn't just another feature inside Gemini. It's being described as a 247 personal agent that can actually take actions on your behalf. That wording matters because it moves the system away from something that responds to prompts and into something that actively does things for you. Remy is running inside a staffonly version of the Gemini app right now and it's deeply integrated across Google's entire ecosystem. So we're talking Gmail, Docs, Calendar, Drive, Search, all of it. And the idea is simple on the surface, though the execution is where it gets interesting. Instead of asking the model to help you with tasks, Remy monitors what matters to you, handles complex workflows proactively, and learns your preferences over time. So instead of opening your email, sorting messages, replying, scheduling something, then jumping into docs to write something, and then maybe doing research in search, the agent handles that flow in the background. It acts more like a digital executive assistant than a chatbot. The internal description literally says it elevates the Gemini app into a true assistant that can take actions on your behalf, not just answer questions or generate content. That's a pretty clear shift in positioning. And Google employees are already testing it internally, which is what they call a dog fooding phase. That's standard in tech, where internal teams use the product before it ever reaches the public. Right now, there's no confirmed release timeline, which usually means they're still refining behavior and reliability, especially for something this autonomous. What's interesting is how far this goes compared to what's already out there. Google already rolled out things like agent mode inside Gemini, where the system can handle multi-step tasks, though access depends on your subscription tier and region.
Remy goes further. It's designed to operate continuously, not just when you ask it to do something. And that puts it directly in competition with tools like OpenClaw, which went viral earlier this year. OpenClaw gained attention because it could actually perform tasks like responding to messages or conducting research autonomously, not just assist with them. And it made enough noise that OpenAI ended up hiring its creator back in February. Remy clearly follows that direction. Though Google has one major advantage here, integration because they control the entire ecosystem. They can plug this agent into everything from your calendar events to your documents to your inbox. That gives them a real edge when it comes to everyday productivity. There are also smaller details that hint at how Google sees this system. The name Remy itself might come from the Latin Regius, meaning orsman or rower, which kind of fits the idea of something doing the work for you in the background. It could also be a reference to the rat chef from Ratatouille, which again fits the concept of a hidden assistant running things behind the scenes. And timing wise, this is all lining up with Google IO 2026, which is happening between May 19th and May 29th at the Shoreline Amphitheater in Mountain View. That event is expected to focus heavily on AI breakthroughs, especially around Gemini and Android. If Remy is anywhere close to ready, that's where it would show up.
And speaking of AI doing real work, today's video is sponsored by Higsfield, and this actually fits perfectly with the whole agentic AI direction we're talking about. Higsfield just launched Marketing Studio, a new end-to-end AI ad workflow powered by Cedence 2.0. And the idea is pretty simple. You bring in a product image or paste a product link.
Choose your own avatar or generate one inside the platform. And Marketing Studio turns that into readytouse video ads. The crazy part is that it doesn't just make one random clip. It can generate ads across nine different formats including UGC tutorials, unboxing videos, product reviews, TV style spots, hypermotion ads, virtual tryons, and more. So, instead of hiring a creator, writing a script, booking production, editing footage, and testing only a few angles, you can build a full ad campaign from one workflow. Hello.
Look at our new Science Revolution YouTube channel. That's why Higsfield describes it as replacing an entire $50,000 production pipeline in a single session. one prompt, your entire campaign. For e-commerce, drop shipping, Shopify, Amazon FBA, Tik Tok shop, SAS launches, courses, or even AI marketing agencies. This is really useful because you can test way more creatives without spending weeks or thousands of dollars on production.
>> What science YouTube channel would you actually recommend?
>> Science Revolution. Physics, biology, chemistry, materials, space. No hype.
just real science explained so anyone can actually understand it. Paste a product link, generate multiple ad angles, see what works, and move faster.
So, if you want to try Higsfield Marketing Studio, check the link in the description and pinned comment. All right, back to the video. Now, at the same time as this agent work, something else leaked out and it gives a pretty clear look at what's happening on the model side. Gemini 3.2 2 flash showed up on the Aluther AI arena, which is basically an external testing platform where models get evaluated under real world conditions. That's important because it means Google isn't just testing internally. They're putting the model in environments where it can be compared directly against competitors.
And this version of Gemini looks like a significant upgrade over the current Gemini 3 flash that's available in AI Studio. The improvements are pretty technical, though they translate directly into practical capabilities.
The model shows stronger performance in SVG generation, which means it can create detailed vector graphics with high precision. It also has improved coding abilities, including the ability to generate complex code for interactive 3D environments, things like voxalbased simulations and dynamic systems. Then there's animation processing which has been upgraded to handle smoother transitions and more dynamic outputs.
That matters for anything involving video, interactive content or even UI design. And the responsiveness of the model in interactive scenarios has improved as well. So it can handle tasks that require realtime feedback more effectively. The reason Google is using platforms like Aluther AI Arena is to stress test the model. These environments expose weaknesses faster, especially when the model is pushed across different types of tasks. It also allows Google to benchmark directly against other systems in a more transparent way. From what's been seen so far, Gemini 3.2 Flash isn't just a small iteration. It looks like a more capable system that's being prepared for broader deployment, possibly tied into upcoming announcements. Then there's another piece that doesn't get as much attention, though it's actually one of the most important upgrades happening under the hood. Google released something called multi-token prediction or MTP drafters for the Gemma 4 model family. And this directly targets one of the biggest bottlenecks in large language models, which is inference speed. Right now, most models generate text one token at a time. That means for every word or fragment of a word, the system has to load massive amounts of data from memory into compute units.
This process is memory bandwidth limited, not compute limited, which means the system spends more time moving data around than actually doing calculations. That's why even powerful models can feel slow in real world usage. MTP changes that by using a speculative decoding approach. Instead of generating one token at a time, a smaller, faster model called the drafter predicts multiple tokens ahead. Then a larger, more accurate model verifies those tokens in a single pass. So in practice, the drafter might generate a sequence of tokens very quickly and the main model checks them all at once. If they're correct, the system accepts the entire sequence and even generates one additional token in the same step. That means you're effectively getting multiple tokens generated in the time it would normally take to produce one. And because the final verification still comes from the main model, there's no loss in quality or accuracy. It's a lossless speed improvement. Google claims this can deliver up to three times faster inference speeds, which is a massive gain, especially for production systems. There are also some deeper optimizations here. The drafter models share the same KV cache as the main model, which means they don't need to recomputee attention states. That saves time and reduces redundant processing. For edge devices like mobile hardware, Google added clustering techniques in the embedder layer to speed up the final step where the model converts internal representations into actual word probabilities. That's one of the slowest parts of the process on limited hardware. So optimizing it makes a big difference. Even hardware specific improvements show up here. For example, on Apple silicon, increasing batch sizes can unlock up to around 2.2 times speed improvements, and similar gains are seen on Nvidia A100 GPUs. So, this isn't just about faster text generation. It's about making these models usable at scale across different types of devices. And while Google is pushing all of this forward, Open AI is making a different kind of move, one that focuses more on the user experience. They just rolled out GPT 5.5 Instant as the new default model in chat GPT, replacing GPT 5.3 Instant. And this matters because it affects the highest volume model, the one used by hundreds of millions of people for everyday tasks. The focus here is clarity, speed, and accuracy.
GPT 5.5 Instant produces 52.5% fewer hallucinated claims compared to the previous version. And it reduces inaccurate claims by 37.3% on difficult conversations. That's a big improvement, especially in areas like medicine, law, and finance where accuracy matters more than anything else. The model also improves performance in visual reasoning, math, science, coding, and image analysis. So, it's not just faster, it's more reliable across a wide range of tasks. And then there's personalization, which is becoming a major focus. GPT 5.5 Instant can use context from past chats, uploaded files, and connected Gmail accounts to deliver more tailored responses. It also introduces memory transparency where users can see which past interactions influenced a response and manage that data. Now, there's one more piece and it's coming from Anthropic. They're working on Orbit and it is still unreleased, though it has started showing up inside newer Clawude web and mobile builds. For now, it appears mostly as a settings toggle, which usually means the feature is being staged before launch. Orbit is a proactive briefing tool for Claude Co-work and Claude Code. Instead of waiting for you to ask what's going on, it prepares useful updates for you automatically. And the connectors are the important part here. Orbit is expected to pull from Gmail, Slack, GitHub, Calendar, Drive, and Figma. So, it's not just email summary tool. It's built around the daily workflow of people who write code, manage projects, design products, and work across teams.
That changes the use case. With Orbit, Claude could brief you on what changed in a GitHub repo, what people discussed in Slack, which design updates happened in Figma, what meetings are coming up, and which emails actually matter. All of that can be turned into a short personalized briefing based on your time zone and connected apps. That makes it different from a normal chatbot. You don't open it just to ask a question.
It's more like a work radar running in the background. And the timing is interesting, too. Anthropic Code with Claude conference starts in San Francisco on May 6th with London on May 19th and Tokyo on June 10th. So, Orbit could either get a quiet roll out or a formal reveal around that event. That's the direction everything is moving in right now. Also, if you want more content around science, space, and advanced tech, we've launched a separate channel for that. Links in the description. Go check it out. Anyway, that's it for this one. Let me know what you think about Remy and where this is all heading. Thanks for watching and I'll catch you in the next one.
Videos Relacionados
OpenHuman VS Hermes AI: Who Wins?
JulianGoldieSEO
285 views•2026-05-29
BREAKING: Microsoft’s New Image Generating Model Beat Out GPT 1.5 and Nano Banana 2
aimmediahouse
122 views•2026-06-03
Long-Running Agents — Build an Agent That Never Forgets with Google ADK
suryakunju
142 views•2026-05-30
This computer is made from real human brain cells. And you can buy it.
Talktmsmedia
3K views•2026-05-28
I Made the Same Anime Fight Scene in Every AI Video Generator
NobleGooseAnime
295 views•2026-05-30
Nvidia Bets Big On AI PCs | New Chip To Power Windows Laptops | Technology | AI Updates | N18S
cnnnews18
3K views•2026-06-01
I Tested NEW Opus 4.8 on Four Projects (Updated LLM Leaderboard)
AICodingDaily
298 views•2026-05-29
3D Platformer Update - NO CAPES
SolarLune
294 views•2026-05-30











