Instala nuestra extensión para buscar dentro de cualquier video al instante

Before we ship a Claude model, these teams try to break it.
Añadido: 2026-05-29

2,092 vistas953:13claudeLanzamiento original: 2026-05-28

Anthropic’s "Frontier" initiative effectively bridges the gap between laboratory safety and real-world utility through rigorous, customer-led stress testing. This structured feedback loop transforms unpredictable model behavior into a refined, enterprise-ready tool.

[00:00:01]Before a new Claude model ships, a small group of customers is already testing it, breaking it, and shaping what ships with it.

[00:00:11]We sat down to see what they're learning.

[00:00:20]When you get something new from Anthropic, what is the energy like?

[00:00:23]We know a storm's ahead, but there's something exciting about a storm because it's all hands on deck.

[00:00:28]Yeah, it feels like we're moving at the speed of light.

[00:00:31]That's like getting the call and jumping from whatever you're working on.

[00:00:33]We have something new, let's figure out what it's like.

[00:00:36]The moment we get a new model from Anthropic, we realize the grounding has changed.

[00:00:43]What's it like to work at a company that's helping to shape the frontier?

[00:00:47]It's insanely fun.

[00:00:48]All of us are just in learning mode.

[00:00:50]This moment just feels like a generational opportunity for anyone in this industry.

[00:00:54]I feel very lucky and also very responsible.

[00:00:58]We need to continue to push the envelope, continue innovating, being more secure, and making things easier to build with.

[00:01:04]In a way, I love that I can unlock a new class of developers and builders.

[00:01:11]What's the first thing you throw at a new model?

[00:01:13]The very first thing is we will start automated evals just so that they start running in the background.

[00:01:19]One use case that is a pipe dream that's easy to point to as a particularly complex legal task is drafting an S1.

[00:01:26]Now with agentic capabilities where these models can go out and find information that they need, synthesize it, edit documents, we're getting to larger and larger chunks of the S1 that you can just send the model on its way to do.

[00:01:40]Just by swapping in that one model, every question I ever wanted to ask it started getting answered.

[00:01:45]It went from this agent can sometimes answer questions, sometimes get stuck, to, oh, my God, it is answering every question quickly and accurately.

[00:01:54]The dashboard of the testing agent success rate has just increased by, I think it's 20%.

[00:02:01]Things that don't work today are the best sign for, here's what the next models are going to be way better at.

[00:02:07]Seeing evals that have never worked start working and then start working consistently, this model is going to be something special.

[00:02:15]What's it like working with Anthropic?

[00:02:17]It feels like I have a conversation with you almost every other day.

[00:02:21]The engineers on the team, I feel like, are almost on the same team.

[00:02:23]It's less like we're just buying something from you, and more like we build with you.

[00:02:29]We have a very high trust bar that anything you publish is not going to be slop.

[00:02:35]What is one word or phrase that characterizes what it feels like to actually be building at the frontier?

[00:02:40]Dazzling, if that makes sense.

[00:02:43]It can be blinding at times.

[00:02:44]Just the brightness, opportunity, excitement.

[00:02:46]Compounding, we get the latest tools, which leads to our customers getting a better product, which leads to us getting better products.

[00:02:53]You have a big wave under you that is changing the way your user is working and changing the way you are working.

[00:03:02]And you have to keep your balance.

[00:03:04]And you know there are bigger waves coming.

Videos Relacionados

Inteligencia Artificial

OpenHuman VS Hermes AI: Who Wins?

JulianGoldieSEO

285 views•2026-05-29

Inteligencia Artificial

Long-Running Agents — Build an Agent That Never Forgets with Google ADK

suryakunju

142 views•2026-05-30

Inteligencia Artificial

This computer is made from real human brain cells. And you can buy it.

Talktmsmedia

3K views•2026-05-28

Inteligencia Artificial

BREAKING: Microsoft’s New Image Generating Model Beat Out GPT 1.5 and Nano Banana 2

aimmediahouse

122 views•2026-06-03

Inteligencia Artificial

I Made the Same Anime Fight Scene in Every AI Video Generator

NobleGooseAnime

295 views•2026-05-30

Inteligencia Artificial

Nvidia Bets Big On AI PCs | New Chip To Power Windows Laptops | Technology | AI Updates | N18S

cnnnews18

3K views•2026-06-01

Inteligencia Artificial

I Tested NEW Opus 4.8 on Four Projects (Updated LLM Leaderboard)

AICodingDaily

298 views•2026-05-29

Inteligencia Artificial

3D Platformer Update - NO CAPES

SolarLune

294 views•2026-05-30

Tendencias

The Meta AI Hack Is a DISASTER

LowLevelTV

141K views•2026-06-03

Paris is in SHAMBLES right now 😭

H1T1

4053K views•2026-05-31

The Casino Had Us Guessing All Day

VegasMatt

157K views•2026-06-03

The Dancing Plague...

HoodieGuyStories

1730K views•2026-05-30