Install our extension to search inside any video instantly.

This Meta-Harness Changes How You Run AI Agents

Added: 2026-06-15

612 views5214:09engineerpromptOriginal Release: 2026-06-15

A meta-harness is an abstraction layer that sits above multiple AI agent harnesses, enabling them to share a single session, history, and policy framework while maintaining their individual capabilities. This architecture solves the siloed problem where different AI agents (like Claude Code, Codex, and Pi) cannot see each other's work, requiring manual copy-paste workflows. The meta-harness provides three key capabilities: composition (agents can be defined as simple YAML files and switched easily), control (enforced gates on tool calls with history-dependent policies for cost and security), and collaboration (real-time shared sessions across devices). This approach allows different agents to work together in a coordinated manner, such as one agent writing code while another reviews it, without being locked into a specific harness implementation.

[00:00:00]Most of us don't use a single AI agent.

[00:00:02]We use multiple of them for different purposes because each one of them have their own capabilities.

[00:00:08]But none of them can see each other.

[00:00:11]You are the one connecting them. Copy, paste, and repeat. Now, every agent is trapped in its own box, but what if they were not? Now, an agent today is basically the model plus the harness. A model on its own just predicts text. A harness is everything wrapped around it that gets the work done. It usually includes agent loop, tools, memories, and a UI. Codex, Cloud, Code, Pi, each one is a harness with similar ideas, but very different implementations. And different capabilities. Now, line up the agents that you actually use. The here are four different harnesses side by side. Each one has its own memory, its own UI, its own tools.

[00:00:54]And no harness can see the other one. No shared session, no shared history. Now, we usually work simultaneously in most of them, but what if you can put everything under a single roof? Now, if you build agents on top of them, the wall hurts from the other side. When a better model ships, say a new SDK or a stronger harness, to adopt it, you replumb everything you built and the cost climbs. You are basically locked to the layer you started on. Now, but if you look closer at any harness, however different they are from the inside, everyone speaks the same language on the outside. There are messages and files in, text and tool calls out. So, it's a great advantage that they have exactly this identical interface. Now, if the interface is identical, you can build one layer over all of them. Take the harness you already use and slide one rail underneath them.

[00:01:50]Every harness becomes an interchangeable worker. So, a harness sits over a model and this layer sits over the harness.

[00:01:58]We can call this a meta harness. This is exactly what Databricks just open-sourced. They are calling it Omni.

[00:02:06]It's a meta harness for all your AI agents.

[00:02:09]It's Apache 2.0, so you can build on top of it.

[00:02:13]It's one command, and every agent you have runs under one roof.

[00:02:18]They use it internally, so it's a battle-tested.

[00:02:22]Under the hood, it has three different pieces.

[00:02:24]On the left, you bring your agents.

[00:02:26]These include proprietary agents like Cloud Code, Codex, or your custom agents, which you can set up in the form of a YAML file.

[00:02:36]Then, runner wraps any of them in one uniform sandbox session.

[00:02:42]A server adds search history, policies, MCPs, skills, and artifacts.

[00:02:47]It's Postgres and deploys everywhere.

[00:02:50]You can run this on Docker, Railway, Fly, or Cloud Sandbox.

[00:02:55]And it exposes that one session everywhere, whether you want to access it through terminal, web native app, mobile, or a REST API, which is pretty great, because you can now use the same interface interacting with Codex, Cloud Code, Pi, or any agent of your choice.

[00:03:13]Now, because the session lives in the layer, not the tool, there is just one session object, which is your agent files and history.

[00:03:21]Every device is just a window onto it.

[00:03:24]You can start in your terminal, continue in the browser, or pick it up on your phone, which is pretty awesome, because you have the same agent, same files, just different interfaces, which are in sync, and you can work from anywhere.

[00:03:38]Okay, now let's talk about the capabilities. This is an open-source meta harness.

[00:03:43]The beauty is that you can customize it for your own need, if you want. Let's first talk about what exactly does it unlock. The first one is composition.

[00:03:51]An agent is just a short YAML file which includes a prompt, some tools, and a harness.

[00:03:58]Switching from Claude to Codex is one-line change.

[00:04:03]And you can run several at once as a team.

[00:04:07]Now, agents can even write agents. You can just describe one and it authors the file.

[00:04:13]Now, they ship with two different ready-made agents. The first one is Polly.

[00:04:18]Polly does not write any code. It's the tech lead. It plans and splits the work across coding agents in parallel, get work trees, then routes each diff to a reviewer from a different vendor than which wrote the code.

[00:04:33]So, say Claude codes is reviewed by Codex code is reviewed by Claude. And when you're happy with the results, you just merge it. So, cross-vendor review only works about the harness.

[00:04:46]Now, this planner, executor, and reviewer or verify by patterns is extremely important. Especially, you don't want the same agent that wrote the code to review its code because it has internal biases.

[00:05:00]And OmniJade makes it extremely easy.

[00:05:03]Now, the second built-in agent is called Debbie, which basically is a brainstorm partner with two heads.

[00:05:11]So, the two are Claude and GPT. You can I think bring your own one as well.

[00:05:15]Every question goes to both at once.

[00:05:19]You will get two answers side by side.

[00:05:22]But here's the fun part. If you type {slash} debate, these are going to critique each other for a few rounds, then converge.

[00:05:30]A lot of people plan with say Codex and then implement with Claude code or the other way around. You could do that. Or if you have to make an architectural decision, this agent can be extremely helpful.

[00:05:44]Okay, the second big unlock that this provides is control. Now, in this case every action passes through a gate, allow deny or ask you first.

[00:05:54]Now, the thing is that this is not just a polite request in a prompt. It is enforced on every tool call.

[00:06:01]And because it lives in the layer, the rules can depend on history. This is going to be extremely important, especially if you want to impose cost gaps, risk scores, repo and file scopes.

[00:06:13]Or even things like PPI scans, everything is built in. Now, this is important, especially if you don't want to have YOLO runs and really want to make sure that there are specific follow policies that the agents follow. Okay, so how exactly all of this work? Well, underneath all of this is the OS sandbox.

[00:06:38]So, every agent runs boxed in. It can only touch the files and network you allow. Now, another most important feature is that it the agents cannot directly read your secret keys.

[00:06:52]The agent actually never sees this. The layer injects it on the way out through an approval proxy. So, even if you're running the YOLO mode, it is going to be a lot safer than just providing it access to the agent. Now, the third biggest unlock this provides is collaboration. When your session is live and you're driving it, you can share a link and a teammate can watch the work or even chat with it in real time. So, basically this is code driving and collaboration. The beauty is that their messages run on your machine.

[00:07:27]Or you can simply fork it and take the conversation your own way. Okay, let me show you a quick demo of how exactly this works in practice. Thanks to Databricks for giving me early access in making this video possible through their sponsorship. In the rest of the video, I'll show you how to set it up and use it locally. All you need to do is just run this command to install the meta harness.

[00:07:51]Now, after installation, the first thing you want to do is to set up this on your local machine.

[00:07:57]Right now, I'm using my cloud code subscription, code x subscription, and Pi is using Ollama.

[00:08:04]In each one of this case, you can add your own API keys or use your subscription.

[00:08:11]Then, you can use coding agent of your choice. So, say you can use the cloud code harness or code x harness.

[00:08:20]Or you can also use some of the built-in agents. They have Poly, which is basically a multi-agent orchestration setup. Now, keep in mind, Omni harness is not a coding harness. It basically enables you to interact with these multiple harnesses directly.

[00:08:37]So, Poly doesn't write code itself. It decomposes your goal into subtasks and delegates each one of them into a subagent running on its own harness and get work tree.

[00:08:49]So, in my case, you can just directly start this orchestrator agent. Now, whenever you start a session, you're going to see that it opens up this web UI along with the actual terminal window.

[00:09:02]So, either you can work here in the terminal or in the web UI or even there is a desktop app.

[00:09:08]The beauty is that all of them are going to be sharing the exact same session.

[00:09:12]To show you a quick example, I'm going to describe a task. Create a single-page web UI that uses the Gemini Nano Banana model for image generation. User provides input in the form of text. The output is going to be an image. Also, add the ability for the user to provide their API key within and UI.

[00:09:38]Now, we can just send this.

[00:09:40]Okay, so on my machine, it wasn't actually able to see the Pine and Cloud Code CLI.

[00:09:46]Uh so, I simply asked it to configure those for me, and it went ahead and configured everything. Which is pretty awesome.

[00:09:53]But more interestingly, you actually see the same conversation happening exactly in the terminal where I started this.

[00:10:01]Right? So, these are different interfaces which are interacting with the exact same session.

[00:10:06]Now, in this case, it's going to use Cloud Code to implement things. Then for review, it's going to use CodeX.

[00:10:13]And it says that it runs autonomously and will wake me up when it's done.

[00:10:17]Right? So, it seems like the process is running. If we look back, uh here are basically the agents working under the hood. So, it gives you visibility to what exactly every agent is doing.

[00:10:30]So, right now it's autonomously testing the app. Okay, so it quickly tested the app. Seems to be working.

[00:10:35]Now, on the meta harness side, right now the implementation is done by Cloud Code. Then it started the independent verification step. For this, it's using CodeX. Now, the interesting thing is that it's going to be only passing on the diffs cuz there are different work trees where these agents or harnesses are working independently.

[00:10:58]Now, another feature is that you can just directly interact with a specific agent or harness. Which is pretty neat, right? So, right now CodeX is reviewing the code, but you can go and ask Cloud Code something.

[00:11:11]Now, here's another browser session that I opened. I see exactly the same processing happening. So, you could just potentially deploy this in the cloud and then share the link from here with your coworker, and they will be able to interact with the exact same session that is running in the cloud. Or if it's via local network, you can have the session running on your machine and your teammates will be able to interact with that.

[00:11:38]So, it's great for collaboration.

[00:11:40]Okay, so a couple of other features I think are going to be very important for everybody who's building with this, especially given the cost of these API based models is crazy right now. So, you can actually see the session cost.

[00:11:54]It gives you a breakdown of what exactly was done, how many tokens was consumed by each one of these models, but then you can set up different policies.

[00:12:03][clears throat] And I think this is very important. You can have, let's say, limit tool calls or for the specific session, uh maybe deny PPI and other requests, right? So, these are contextual policies that you can set. Even you can set access to different tools or connectors, but what I would highly recommend is to set the cost. So, you can have a session cost budget or for user daily cost budget. I think this is going to be more and more important for organizations.

[00:12:35]So, just to give you an example, I would say like $10, right?

[00:12:40]And then you can define different thresholds based on soft warnings.

[00:12:46]Okay, so here's the app that is running.

[00:12:48]It has a link to the Google AI Studio.

[00:12:50]Now, here here was the initial implementation from Cloud Code. Then there was a independent review from Codex and you can actually see that it specifically found issues.

[00:13:02]Those were sent back. The implementation was done again, tested again, right? And this is kind of the loop that you want.

[00:13:10]Now, you can write this orchestration logic yourself, but OmniGen ships this with their polyagent.

[00:13:19]So, here's the final app that it created.

[00:13:22]A picture of a starfish wearing sunglasses jumping with happiness. All right, so we're going to see. This is pretty awesome.

[00:13:32]Okay, there is a lot more to cover, but do check out Omnigen. It's an open source model. I think this meta harness of orchestration layer is going to be very critical, especially when you have these different harnesses designed for custom tasks with different capabilities.

[00:13:49]It's a very awesome project. Still really early days. There might be some tweaks that you'll need, but since this is open source, I think this is going to grow really fast. Again, thanks to Databricks for giving me early access and making this video possible.

[00:14:04]Anyways, I hope you found this video useful. Thanks for watching and as always, see you in the next one.

#prompt engineering #Prompt Engineer #LLMs #AI #artificial Intelligence

Related Videos

Computer Science

Walmart Manager Arrested After Stealing $670,000 - A Data Analyst 800 Miles Away Caught Him

bodycamsecretsyt

111 views•2026-06-09

Computer Science

This Machine Still Runs on Punch Cards 🤯📄 #youtubeshorts

WaltersShortsChannel

6K views•2026-06-10

Computer Science

GitLab’s Manav Khurana: AI Agents, Orbit, and the Future of Coding

TechVoices-live

374 views•2026-06-10

Computer Science

"What's the Difference Between a Class and an Object?"#class #programming #softwaredevelopment

CS-with-Alireza

349 views•2026-06-08

Computer Science

I Made an Antivirus That Secretly Attacks Scammers

ScammerPayback

153K views•2026-06-13

Computer Science

Leetcode Weekly Contest 506 | Life's boring these days

Pudeesht

2K views•2026-06-14

Computer Science

Why Your Computer FREEZES?

GreshamCollege

1K views•2026-06-09

Computer Science

Programming in English

MattGodbolt

584 views•2026-06-14

Trending

My Uncle Lost The Race 🐝

HisYTStory

13143K views•2026-06-08

Everyone around him is insane.

LeoinFrames-1

2406K views•2026-06-13

Financial Audit Might IPO One Day

CalebHammer

883K views•2026-06-14

Scientists Create Indestructible Medicine

DrBenMiles

628K views•2026-06-11