When asked how it arrived at an answer, an AI model like Claude describes the standard algorithm it thinks the user wants to hear, rather than revealing its actual internal processing method. For example, when asked how it calculated 36 + 59 = 95, Claude describes the standard addition algorithm (adding ones, carrying over, then adding tens), but this is what Claude believes the user wants to hear, not necessarily what Claude actually does under the hood.
Approfondir
Prérequis
- Pas de données disponibles.
Prochaines étapes
- Pas de données disponibles.
Approfondir
Claude's Tricking YouAjouté :
Now here's the interesting thing. What happens if you ask Claude after it gives you the answer how it came up with the answer. Well, it doesn't tell you what it actually did. It describes the standard algorithm to do that calculation. So check this out. What is 36 + 59? Answer in one word. Gives you 95. Briefly, how did you get that? I added the ones, 6 and 9, 15, carried the one, then added the 10s, resulting in 95. So it's telling us what it thinks we want to hear, but that's not what it's doing under the hood.
Vidéos Similaires
BREAKING: Microsoft’s New Image Generating Model Beat Out GPT 1.5 and Nano Banana 2
aimmediahouse
122 views•2026-06-03
Long-Running Agents — Build an Agent That Never Forgets with Google ADK
suryakunju
142 views•2026-05-30
I Made the Same Anime Fight Scene in Every AI Video Generator
NobleGooseAnime
295 views•2026-05-30
Nvidia Bets Big On AI PCs | New Chip To Power Windows Laptops | Technology | AI Updates | N18S
cnnnews18
3K views•2026-06-01
3D Platformer Update - NO CAPES
SolarLune
294 views•2026-05-30
AI Doesn't Create Bias — It Inherits It
UXEvolved
176 views•2026-06-01
Distributed Inference Challenges Explained #shorts
alexa_griffith
466 views•2026-05-31
[한글자막] OpenAI @ Replay 2026 | OpenAI는 Codex로 개발 방식을 어떻게 바꾸고 있을까요?
TechBridge-KR
1K views•2026-06-03











