Space Agent successfully breaks the "chat-bubble" prison by turning the browser into a local, programmable execution engine. It’s a sophisticated step toward a future where AI is an integrated workspace rather than just a remote service.
Deep Dive
Voraussetzung
- Keine Daten verfügbar.
Nächste Schritte
- Keine Daten verfügbar.
Deep Dive
You've never seen AI Agent like THISHinzugefügt:
This right here is the space agent and it's unlike any other agent you have ever seen. It can completely change the way you work with AI agents. My name is Yan. I am the developer of Agent Zero.
And today I'm here to present our newest project made by Agent Zero, the Space Agent. In this video, I will show you what Space Agent actually is and what it can do, how it operates internally, and also why is it the fastest and most efficient agent out there. Let's start with the important part. Space Agent, just like Agent Zero, is completely free and open source. You can use it for whatever you want. You can change it into whatever you want. It's up to you.
Now, what makes Space Agent really different is this part right here. The agent lives in the front end. So, the agent runs in a browser, but I don't mean it has a web UI like the other agent, and the agent runs on a back end somewhere. This agent actually lives and executes inside the browser's JavaScript runtime. All of the framework, everything you see, everything the agent can do actually executes client side in the front end layer in JavaScript. This comes with some security and privacy benefits. For example, even the inference can be done in the browser using WebGPU. But the main reason to do it this way is this. The spaces spaces are a way to let the agent modify the runtime environment as it needs to.
If the agent needs to show you something, it can simply draw it on screen because the agent lives and operates in the front end layer. So if you want a crypto dashboard with ticker prices, charts, and news, the agent can draw it for you on the screen in real time. or maybe a daily news overview with a list of news. You can choose different sources. You see the top news of the day and weather dashboard. Even games can be created by the agent from scratch in the browser in real time. The way spaces work is that the agent writes a renderer function for individual widgets once and this function can then be used to recreate the widget anytime on this infinite grid. Meaning even if I refresh the page, my widgets stay in place in the same shape they were before and I am not losing any changes made to the DOM. Here, for example, we have a YouTube player with a playlist of agent zero videos.
>> The possibilities of spaces are endless.
You are only limited by limitations of the browser runtime and your imagination. You can use it to create full applications or even systems. Now, for the agent itself, it's this little guy here. And it works like a standard chatbot. So, you can greet it. It will respond. You can switch between compact and full mode. Seeing the history, you can drag it around, you can hide it if you need more space for some of the bigger screens. So, now let's try it out. open a new space and create a analog clock there.
So the agent has navigated to a new space and now it's writing code for an analog clock. We can inspect everything here. We can see the code the agent is writing. It has already executed. It's done. And we have an analog clock here.
Now let's extend it a little. Let's make it a time zone check. So, we should have four clocks now. One for Tokyo, Rome, London, and I think New York was the last one.
I wanted the agent to edit the original one, but I actually didn't tell it to do so. So, yeah, it's a valid result. I can remove this widget now. I don't need it anymore. And let's do one more simple edit.
remove the background from it.
Okay. And here we see it doesn't wrap properly. So we can tell the agent make it wrap.
Much better.
I'm of course aware that checking time is not that impressive, but I wanted to demonstrate how the agent operates, how it can edit existing widgets and I wanted to show how the message loop actually works in space agent because that's another big benefit of this agent. We can take a look into the message history here and see how token efficient and how very simple the communication loop actually is. So this is my first actual message to the agent. Open a new space and create an analog clock there.
It only added 13 tokens into the full conversation.
And all the agent had to do to get into the space took 24 tokens because the agent doesn't use any tool calling. It doesn't use structured output. It responds in plain text. So the agent starts his response in a plain text.
This is the chat bubble. I can see this is the response that gets into the chat window. And then whenever it's followed by these two tokens, these are actually just two tokens. One, two. Anything after JavaScript automatically executes in the browser. So the agent used the API we have prepared to create a new space titled analog clock.
This is the response the agent got.
Again, we use YAML instead of JSON to save tokens.
We only inform the agent that the execution was successful. And this is the result. Then the agent spends 20 tokens to navigate to the space.
I'm not sure yet whether this could be done in one step or not. Maybe we can improve further.
And then he writes the widget code just like this telling me rendering the analog clock analog clock widget now and following with the JavaScript of the widget render. And as you can see it didn't take much code to render an analog clock.
This is the second widget being rendered. And here we can see the edits.
I told the agent to remove the background and all it took was 61 tokens because the agent only responded what it's doing and then used edit function to selectively replace the background.
So everything we just did is just 16,000 tokens but almost 9,000 is the system prompt. So 7,000 tokens is all it took to create and edit these widgets.
Another big strength of the space agent apart from spaces is its ability to browse the web. So I can tell it go to Google and just like that the agent opens the Google window. I'm sorry that this is not English, but it's it's probably using my IP address to determine the language. So, I can just tell the agent accept cookies and it should have no trouble identifying the button and pressing it. Now I can tell it search for agent zero and open the official web.
And just like that, this is our site.
Accept cookies again.
And here we are. Now let's give it something that will require multiple steps.
Now navigate to the GitHub repo and find the oldest release there.
The oldest release is version 0.2 which is correct. And the best part about this is that I can interact with the browser in real time just like the agent does. So if there is a capture, if there is a login screen or whatever, I can do it and tell the agent continue from here. Also, let me show you how efficient the web browsing in space agent is. If I go to the history again, now we only have 22,000 tokens.
If I'm correct, previously it was 16,000. So all this web navigation added just 6,000 more tokens. And that is because we don't leave we don't leave any leftover data in the history.
When a website is opened and we parse it, we show it to the agent like this.
We basically transcribe the whole DOM tree into a list of images, links, text, etc. So the agent can see and can reference anything on the website, can click anything, it can scroll, navigate, etc. But this message lives in the transient space of the prompt. Meaning it's always appended to the very end of the prompt after the last caching break point. So it doesn't break caching and it saves a lot of tokens. Another big advantage of space agent is the customizability and extensibility. The agent lives in a virtual file system that is accessible by the front end. This is my user folder with my spaces being stored here in YAML files.
And the whole framework is designed to be extensible by default. Everything in the framework including the core is built as a module and modules can be added or removed at any time and they can be developed by the agent itself.
The agent is skill-based meaning everything the agent can do lives in the file system in form of skills.md files.
The browser, the spaces, the development documentation, everything is in skills.mmd files. So it's easily extensible and the agent can develop itself further.
Right now we have a few basic functions here like the file browser. We have a testing area for local models if you want to run the AI inference locally in the browser.
It works but requires a beefy GPU.
I don't expect the speed to be great here. And modules like this can be developed by the agent itself at runtime. You don't need to do any changes to a server, restart it or whatever. It can be developed completely on the front end. And the best part is that all the changes are scoped in this framework. Meaning you can have a multi-user system and users can develop inside their home directories completely new functionality without affecting others. Or you can do the same for user groups. If you want multiple users to share the same functionality like an accounting department in a company, you can develop something from them, put it to a group folder and it will be shared across these users. They can have read access or write access. It's up to you how you configure it. This separation also allowed us to implement time travel. Every user folder or every group folder has a G repository automatically created in it. And I can travel back in time and undo any of my changes. Like for example, here somewhere here I created the new space with the analog clock. I can travel back in time before that.
And my analog clock is here containing only the original widget. If I go to the time travel and travel back even more probably somewhere here, it should be gone entirely.
And I can also travel forward back to present.
and my workspace is as it was.
So this way you don't need to worry about breaking anything while developing the space agent itself. You can always fix it simply by traveling back in time or reverting individual changes in your file system or if something really breaks and the page doesn't even render. There's also this admin mode which splits the screen in two.
This one is static. It is built into the firmware of space agent. And this will allow you to browse the files, talk to the agent, and time travel even when the page itself completely breaks for you.
One of the biggest benefits that come naturally from the fact that the agent runs in a browser is that you can run the agent in a browser. So you don't need to install anything. If you want to try space agent now, you can go to our repo and click the try live now button or you can go directly to space-agent.ai and here in your browser you can create a guest account with a single click.
login with another click and you are right in. No installation, no configuration except for setting the API key for your LLM or using the local inference if you have the M machine that can do it. But that's it.
You're right in a working space agent instance. This one is a demo site. These accounts are not permanent. They will be deleted in a few days if you stop using them. So if you want to use space agent persistently and you want to make it your own, you can go back to our GitHub repo and you can download it as a native application for Mac OS, Linux and Windows or if you want to run it in a browser, you can host the back end yourself.
There's a guide for it. There is a very thin backend layer in NodeJS that only serves the files for the front end and manages user permissions and user accounts and low-level stuff like that.
The actual framework and all of its functionality runs on client side in the browser. Obviously, this is just the beginning. We have very little idea what's actually possible with space agent. I believe people will create wild things. If you create something interesting and you want to share, there is a button for it. It will allow you to download and upload a zip. But you can also share it through our website by creating a share link. And you can copy this and share it with your friends. And they can open it in a browser just like that in a sandboxed environment. So it's not in their application, it's not in their environment. So they don't need to worry about malicious code stealing their secrets and they can get into the space just like that to test it. As always, thank you very much for your time and support. Please consider giving us a GitHub star and joining our community on Discord, school, subscribing on YouTube. Everything helps. Thank you very much. See you next time.
Ähnliche Videos
VALORANT's Latest 'Exclusive' Tier Bundle is Rough...
KangaValorant
17K views•2026-05-28
Flight Attendant Mocks Poor Looking Black Woman — Mid Air Announcement Exposes Her Real Power
SkyboundStories-b4r
184 views•2026-05-28
I FIXED My Friend’s Blown Turbo RX-8… Then Sold It
Cameron-RX8
134 views•2026-05-28
NewsWatch 12 at 5: Top Stories
NewsWatch12
1K views•2026-05-28
Simon Jordan & Danny Murphy deliver PREDICTIONS for Arsenal's Champions League FINAL with PSG
talkSPORTArsenal
6K views•2026-05-28
Botting is OUT OF CONTROL in Classic WoW (Again)...
SolheimGaming
108 views•2026-05-28
The "AI Job Apocalypse" is CANCELLED!
WesRoth
9K views•2026-05-28
STREET FIGHTER 6 - INGRID Story Walkthrough @ 4K 60ᶠᵖˢ ✔
RajmanGamingHD
12K views•2026-05-28











