VOID shifts video editing from simple pixel-filling to true causal reasoning, ensuring that removing an object also erases its physical impact on the environment. It is a sophisticated leap toward AI that understands the underlying logic of reality rather than just mimicking its appearance.
Deep Dive
Prerequisite Knowledge
- No data available.
Where to go next
- No data available.
Deep Dive
Netflix Released VOID for Free - Run It Locally and Delete Objects from VideosAdded:
Netflix is famous for canceling your favorite show, but this time they have released something you actually want. An open source AI model that removes objects from videos like they never existed. Let me show you a hands-on demo which is running locally on my system.
So, this is the original video where we will be removing this glass with the help of this model which is called as Void.
As you can see that at the moment inference is running.
Let me quickly show you my VRAM consumption. So, consuming just under 13 gig of VRAM.
While it removes the glass from this video, let me tell you a bit more around this model.
In fact, let me play a video which is going to warm up your heart a bit. There you go.
So, Void stands for video object and interaction deletion. Solves the problem that every existing video editing tool gets wrong almost. When you remove an object from a video today, you get a clean hole where it was, but everything that object touched still behaves like it's there. A ball keeps bouncing after you erase the person who kicked it. Void fixes that. Remove the person and the ball just sits on the ground because a kick never happened. It understands cause and effect and that is the selling point here for this model in my humble opinion.
Not just what an object looks like, but what it does to the world around it.
The way it works is surprisingly elegant and I will tell you all about it in very simple words, but let's go back here to see what is happening. There you go. So, let me play the resultant video.
You see, the glass is no longer there.
How interesting is that? But still that lemon is bouncing off the table. Let me play the original again.
Can we play together? Yeah.
No.
Let me play the original again.
So, you see lemon got dropped and then it is the person is using those sticks.
Let's play it again.
It is bouncing off the table.
Now, these are the sticks which are still not perfect, but again, there is no sign of glass and the lemon drops onto the table instead of just, you know, going in the void and that is the name of the model. Let me quickly show you how you can get it installed to like I have done it and then we will check out the architecture.
So, this is my Ubuntu system. I am using this GPU card Nvidia RTX A6000 with 48 GB of VRAM.
And if you're looking to rent a GPU on very good price, you can also find the link to Mass Compute in video's description with a discount coupon code of 50% for range of GPUs.
Okay, so let's go here. First step, let's get clone the repo and I will drop the link to it in video's description.
Let's install all the prerequisites.
Everything is installed. Now, what I have done, I have just put a Gradio interface on top of their demo. Just a very lightweight one. So, I'm just going to run this and this is what is going to provision this on our local system.
And this is what it looks like as it runs on our local host at port 7860.
Okay, in the next example, what I'm going to do, I'm just going to give it this video where a ball is rolling off the table hitting that duck and then you see that it just rolls off there in that direction.
So, the mask is already there and we need to provide this a ball rolls off the table. So, the prompt which is describing the scene.
And there are a few advanced parameters like inference steps, CFG, which means that how much your prompt the model follows the prompt, but I'm just going to give it the default values. Let's run the Void. It is running. While it runs, let's try to understand what exactly is happening here in this one.
Now, the way Void works is surprisingly elegant. You click on the object you want gone. A vision language model then reasons about everything in the scene that was causally connected to it. What would fall, stop, or change direction without that object present. It encodes all of this into a special mask that guides the video diffusion model telling it exactly what to erase, what to rewrite, and what to leave untouched. The model then generates a physically plausible version of the scene as if the object was never there. And if the result has any shape instability in moving objects, a second refinement pass smooths that out using motion flow from the first result. The whole thing runs on a 5 billion parameter model that Netflix has released completely open source as you can see here and they also have shown some past two refinements for example, look at this. There's a basketball.
There you go.
How good is that? And look at this ukulele one.
You see it falls off and there you go.
Pretty cool.
So, this is a pass two.
Now, I think they have done pretty well. It's not perfect, far from perfect, but a very interesting uh I think model which they have produced. I'm not sure if they are using it any in any of their own series or not.
It is still running and by the way, you can see that it takes a bit of a time.
There you go.
So, around 2 minutes I would say and if I show you uh the terminal, hopefully it is still there. Let me quickly just check out it.
There you go.
2 minutes it has taken.
And I'm not sure if I mentioned it is based on that Cog video model, the base one which we already have covered on the channel, which is another amazing amazing model.
Anyway, let me again first play the original.
There you go.
And now, let me play this resultant one. There you go. So, duck is gone. The ball just rolls off.
And that is what we asked the model to do that there is no duck it removed.
It has kept everything as is. Just look at um the sofa behind it, this table, sorry, [clears throat] the surface.
Now, if you have to nitpick, you can say that, you know, the grain and the texture on the table's surface is not exactly the same.
The model has tried its best.
Um it is a bit smoother than the original one, but still I think pretty good result I would say.
Let's do Let's do one final test.
In the final test, I'm going to give it this video of a marshmallow desert.
It is being flamed.
Let's wait for this original to finish and let me play the resultant one.
So, you see the flame is gone.
But still, you see the marshmallows are being uh you know, they are changing the color from that side.
Pretty good. Of course, you know, there is a this angle is not exactly the same and we can't see that, but still the effort is good.
It's not 100% there yet, but I think already the one this first pass and this first release is quite good. Now, I have really thought hard where where Net Why Netflix built this. Maybe think about it and please share your opinion in the comments, too.
I think every scene with a visible crew member, a stray prop, an unwanted logo, or a continuity mistake that would otherwise cost thousands in reshoots, maybe a model like this can just make it disappear.
The most expensive part of post-production might have just become a few mouse clicks for Netflix.
Uh but let me know what do you think what could be the where Netflix might be using it. That's it. Um please like the video and subscribe and consider becoming a member and please follow me on X if you're looking for AI updates.
Thank you for all the support.
Related Videos
OpenHuman VS Hermes AI: Who Wins?
JulianGoldieSEO
285 views•2026-05-29
Long-Running Agents — Build an Agent That Never Forgets with Google ADK
suryakunju
142 views•2026-05-30
5 Mind Blowing Omni Uses Cases
PaulJLipsky
1K views•2026-06-02
This computer is made from real human brain cells. And you can buy it.
Talktmsmedia
3K views•2026-05-28
BREAKING: Microsoft’s New Image Generating Model Beat Out GPT 1.5 and Nano Banana 2
aimmediahouse
122 views•2026-06-03
I Made the Same Anime Fight Scene in Every AI Video Generator
NobleGooseAnime
295 views•2026-05-30
Nvidia Bets Big On AI PCs | New Chip To Power Windows Laptops | Technology | AI Updates | N18S
cnnnews18
3K views•2026-06-01
I Tested NEW Opus 4.8 on Four Projects (Updated LLM Leaderboard)
AICodingDaily
298 views•2026-05-29











