When asked how it arrived at an answer, an AI model like Claude describes the standard algorithm it thinks the user wants to hear, rather than revealing its actual internal processing method. For example, when asked how it calculated 36 + 59 = 95, Claude describes the standard addition algorithm (adding ones, carrying over, then adding tens), but this is what Claude believes the user wants to hear, not necessarily what Claude actually does under the hood.
深度探索
先修知识
- 暂无数据。
后续步骤
- 暂无数据。
深度探索
Claude's Tricking You本站添加:
Now here's the interesting thing. What happens if you ask Claude after it gives you the answer how it came up with the answer. Well, it doesn't tell you what it actually did. It describes the standard algorithm to do that calculation. So check this out. What is 36 + 59? Answer in one word. Gives you 95. Briefly, how did you get that? I added the ones, 6 and 9, 15, carried the one, then added the 10s, resulting in 95. So it's telling us what it thinks we want to hear, but that's not what it's doing under the hood.
相关推荐
OpenHuman VS Hermes AI: Who Wins?
JulianGoldieSEO
285 views•2026-05-29
Long-Running Agents — Build an Agent That Never Forgets with Google ADK
suryakunju
142 views•2026-05-30
This computer is made from real human brain cells. And you can buy it.
Talktmsmedia
3K views•2026-05-28
BREAKING: Microsoft’s New Image Generating Model Beat Out GPT 1.5 and Nano Banana 2
aimmediahouse
122 views•2026-06-03
I Made the Same Anime Fight Scene in Every AI Video Generator
NobleGooseAnime
295 views•2026-05-30
Nvidia Bets Big On AI PCs | New Chip To Power Windows Laptops | Technology | AI Updates | N18S
cnnnews18
3K views•2026-06-01
I Tested NEW Opus 4.8 on Four Projects (Updated LLM Leaderboard)
AICodingDaily
298 views•2026-05-29
3D Platformer Update - NO CAPES
SolarLune
294 views•2026-05-30











