Install our extension to search inside any video instantly.

Mind the Gap (In your Agent Observability) — Amy Boyd & Nitya Narasimhan, Microsoft
Added:

2,569 views59likes1:20:07aiDotEngineerOriginal Release: 2026-05-14

Agent observability requires continuous evaluation and monitoring to address the gap between agent requirements and actual performance, as agents drift over time due to model changes, prompt modifications, and accumulating edge cases. This gap manifests in three key areas: (1) the drift gap, where agents diverge from original requirements; (2) the detection gap, where issues go unnoticed; and (3) the diagnosis gap, where root causes remain unidentified. Effective observability combines tracing (to understand agent execution paths), built-in evaluators (for quality, safety, and agentic metrics like intent resolution and task adherence), and red teaming (adversarial testing to uncover vulnerabilities). The observe skill demonstrates how coding agents can automate this entire loop by generating evaluation datasets, running batch evaluations, optimizing prompts, comparing versions, and rolling back to optimal configurations—all while surfacing failures that developers may not anticipate.

Related Videos

OpenHuman VS Hermes AI: Who Wins?

JulianGoldieSEO

285 views2026-05-29

Long-Running Agents — Build an Agent That Never Forgets with Google ADK

suryakunju

142 views2026-05-30

This computer is made from real human brain cells. And you can buy it.

Talktmsmedia

3K views2026-05-28

BREAKING: Microsoft’s New Image Generating Model Beat Out GPT 1.5 and Nano Banana 2

aimmediahouse

122 views2026-06-03

I Made the Same Anime Fight Scene in Every AI Video Generator

NobleGooseAnime

295 views2026-05-30

Nvidia Bets Big On AI PCs | New Chip To Power Windows Laptops | Technology | AI Updates | N18S

cnnnews18

3K views2026-06-01

I Tested NEW Opus 4.8 on Four Projects (Updated LLM Leaderboard)

AICodingDaily

298 views2026-05-29

3D Platformer Update - NO CAPES

SolarLune

294 views2026-05-30

Trending

The Casino Had Us Guessing All Day

VegasMatt

157K views2026-06-03

The Dancing Plague...

HoodieGuyStories

1730K views2026-05-30

The Fastest Way To Board A Plane 😮

zackdfilms

6504K views2026-05-29

DOOM Runs On Everything...except Neo Geo

ModernVintageGamer

143K views2026-06-01