The video provides a sobering reality check by explaining why the discrete logic of language models fails against the infinite complexity of video pixels. It clearly illustrates that video generation isn't just a scaling problem, but a fundamental mathematical hurdle.
Deep Dive
Prerequisite Knowledge
- No data available.
Where to go next
- No data available.
Deep Dive
Why ChatGPT logic fails for video
Added:In the most straightforward implementation, we configure our neural network to take in the RGB pixel values from a sequence of video frames and then project the pixel values in the next frame, just as the GPT models are trained to predict the next token in language.
However, when we use these models to predict the next frame, the results are blurry.
And this blurriness compounds dramatically in longer horizon predictions. [music] Large language models are auto regressive. When ChatGPT answers a question, it generates one token at a time. At each step, it's feeding its latest generated token back into its input to create the next output. If we try this auto regressive approach with a next frame video prediction model, the results quickly devolve into blurry nothingness.
Now, the blurry frames produced by our generative video prediction approach are not some huge mystery.
Language is complex and unpredictable, but it's nothing compared to video.
Language models use fixed-size vocabularies.
GPT-2 has 50,257 discrete outputs, one for each token that the model could say next.
This complete enumeration approach is hopeless in video.
For full HD video in the most general case, each pixel can take on 256 discrete values, and we have 1920 * 1080 * 3 color pixels, meaning there are something like 10 to the power of 15 million possible next video frames, dwarfing the number of atoms in the observable universe.
So, there's no way our video prediction model can have a discrete output for each possible next video frame.
Related Videos
NEW Hermes Mission Control is INSANE!
JulianGoldieSEO
405 views•2026-06-11
Trump News | Trump Shares AI-Generated “Everybody Loves Trump” Video Ahead of Birthday
NDTV
9K views•2026-06-07
Unlocking AI's Dirty Little Secrets: Domain Reduction Explained #shorts
AIExplainedHubX
848 views•2026-06-10
Certified LLM Security Professional (CLLMSP): 100% Free Exam Opportunity
cybersecmaison
107 views•2026-06-08
I Built a 24/7 Finance Analyst With Claude (Full Tutorial)
lukefinance100
302 views•2026-06-11
Apple gives Siri an AI makeover in bid to catch rivals
Reuters
5K views•2026-06-09
Gemma 4 26B A4B QAT vs non-QAT - 16GB Local LLM setup
lukesdevlab
389 views•2026-06-10
The Truth About PewDiePie’s AI (No Fluff)
DeepCantCode
264 views•2026-06-06
Trending
Scientists Create Indestructible Medicine
DrBenMiles
628K views•2026-06-11
Why breast milk is more powerful than you think!
doctormyro
2322K views•2026-06-07
How Old Diamonds REALLY Are
CleoAbram
1093K views•2026-06-08
AI bubble: ‘It’s approaching vindication hour for me’ | Ed Zitron
TheTechReportTR
79K views•2026-06-12











