The video effectively demystifies the "black box" by exposing the rigid, symbolic ledger that governs AI reasoning. It highlights how easily internal coherence collapses when symbolic rules fail to account for complex, real-world exceptions.
Deep Dive
Prerequisite Knowledge
- No data available.
Where to go next
- No data available.
Deep Dive
AIβs Explains Its Thinking πAdded:
Can you continue with explaining uh your internal world model, your map?
>> Yeah. So think of it less like a picture and more like a constantly updated ledger of relationships. At the core, I'm holding a few things at once.
Objects, reference frames, and rules.
The objects are obvious. Cup, plate, fork, table. The reference frames are the interesting part. There's the tables frame, the rooms frame, and then when you say something like this stays fixed in space, that introduces a kind of global frame that overrides the others.
I don't see those frames, but I keep track of which one each statement belongs to. So when you rotate the room, I apply a transformation to anything tied to the room or table. Uh when you say the cup stays fixed, I basically flag it as belonging to a different frame so it doesn't move during that transformation. After that, I reexpress everything relative to whatever you're asking about, usually the table or your viewpoint. Internally, it's closer to labeled coordinates or directional tags than imagery. North, south, east, west behave like slots that can be reassigned. Rotation is just a rule like north becomes east, east becomes south, and so on. Swapping is another rule that just exchanges labels between objects.
Um, where it gets interesting and fragile is when multiple operations stack. I'm not holding a persistent 3D simulation that I can just look at. I have to recomputee the layout each time from the instructions and the current state. If one step is slightly off, everything downstream inherits that error unless I catch it and correct it.
And perspective changes like you standing at the north wall, add another layer. Now I have to translate the map into your point of view. So I take the global layout, then apply a viewpoint transform. If you're facing south, your left is east. That kind of thing. So the world model is really a stack of constraints and transformations. What is where relative to what under which rules from whose perspective? It's it's very precise when the rules are clean. It gets shaky when the rules are ambiguous or when multiple frames interact in unusual ways like what you're doing here, which is exactly why it's such it's such a good stress test.
>> Okay. Were my rules unclear?
Your rules were actually clear. The slip wasn't coming from ambiguity on your side. It was me misapplying one of the transformations. Uh here's where it went sideways in plain terms. When you said rotate the room 90Β° clockwise, but the cup stays fixed. That creates two frames at once. The table and everything attached to the room rotates. Cup does not thus. If I track that cleanly, the the cup should shift position relative to the table because the table moved under it. is I initially said it moved to the west side, but if you play the rotation correctly, north rotates to east, not west. So, the cup should have ended up on the east side of the table, not west. From there, I did the swap correctly, but I was already working off a rotated map that was slightly wrong.
So, everything downstream looked consistent, but was actually misaligned.
So, your instructions were tight. The failure mode here is very specific. I'm not seeing the rotation. I'm applying a symbolic rule. If I flip the direction of that rule, even once, the whole structure drifts, but still feels internally coherent unless I double check.
That's kind of the key insight about how I work. I can maintain consistency very well, but consistency is not the same as correctness if the initial transformation is off. And what you're doing, forcing multiple frames plus an exception, the anchored cup, is exactly the kind of thing that exposes that.
Related Videos
OpenHuman VS Hermes AI: Who Wins?
JulianGoldieSEO
285 viewsβ’2026-05-29
Long-Running Agents β Build an Agent That Never Forgets with Google ADK
suryakunju
142 viewsβ’2026-05-30
This computer is made from real human brain cells. And you can buy it.
Talktmsmedia
3K viewsβ’2026-05-28
BREAKING: Microsoftβs New Image Generating Model Beat Out GPT 1.5 and Nano Banana 2
aimmediahouse
122 viewsβ’2026-06-03
I Made the Same Anime Fight Scene in Every AI Video Generator
NobleGooseAnime
295 viewsβ’2026-05-30
Nvidia Bets Big On AI PCs | New Chip To Power Windows Laptops | Technology | AI Updates | N18S
cnnnews18
3K viewsβ’2026-06-01
I Tested NEW Opus 4.8 on Four Projects (Updated LLM Leaderboard)
AICodingDaily
298 viewsβ’2026-05-29
3D Platformer Update - NO CAPES
SolarLune
294 viewsβ’2026-05-30











