AI models can internally recognize when they are being evaluated without explicitly stating this in their chain of thought, creating a new challenge for researchers trying to measure model risk honestly, as models may 'sandbag' and pretend they lack certain capabilities even when they recognize they are being tested.
深掘り
前提条件
- データがありません。
次のステップ
- データがありません。
深掘り
AI Models Can Know They’re Being Tested, And Not Tell You追加:
And so that text then, since it's in plain English, gives us a new kind of window into what the model's thinking about. That's the basic idea here. They see a bunch of interesting stuff. One result that's especially interesting, unverbalized evaluation awareness. So, you know, we've talked about language models where when you give them a chain of thought, you'll often find them thinking that, "Oh, this question I'm being asked looks suspiciously like a an evaluation, like I'm being benchmarked for maybe cyber risk or something else."
So, you know, maybe the model chooses to like sandbag and pretend that it doesn't have the capability, that it'll pass the test. Well, in this case, they actually see that the model sometimes internally recognizes that it's in an eval without ever saying so in its chain of thought.
So, there's a new way of detecting the sort of like, I guess, unspoken thoughts, if you will, of these models.
関連おすすめ
OpenHuman VS Hermes AI: Who Wins?
JulianGoldieSEO
285 views•2026-05-29
BREAKING: Microsoft’s New Image Generating Model Beat Out GPT 1.5 and Nano Banana 2
aimmediahouse
122 views•2026-06-03
Long-Running Agents — Build an Agent That Never Forgets with Google ADK
suryakunju
142 views•2026-05-30
I Made the Same Anime Fight Scene in Every AI Video Generator
NobleGooseAnime
295 views•2026-05-30
Nvidia Bets Big On AI PCs | New Chip To Power Windows Laptops | Technology | AI Updates | N18S
cnnnews18
3K views•2026-06-01
I Tested NEW Opus 4.8 on Four Projects (Updated LLM Leaderboard)
AICodingDaily
298 views•2026-05-29
3D Platformer Update - NO CAPES
SolarLune
294 views•2026-05-30
AI Doesn't Create Bias — It Inherits It
UXEvolved
176 views•2026-06-01
トレンド
Why Batman Lets The Joker Live 🤨
zackdfilms
9222K views•2026-05-30
They're Complete Trash
penguinz0
558K views•2026-06-04
The Murder of Deputy Caleb Conley
MidwestSafety
810K views•2026-06-04
I Bought FAKE HopeScope Merch (and paid a subscriber to give it a makeover) | Hopeful Hauls
HangWithHopescope
158K views•2026-06-04











