Install our extension to search inside any video instantly.

How AI Gets Trained to Sound Human #AI #LLM #DeepLearning
Added:

101 views4likes2:15max-techieOriginal Release: 2026-05-11

Large Language Models (LLMs) are trained through two key stages: first, supervised fine-tuning (SFT) uses human-written question-answer pairs to teach the model what helpful responses look like, and second, reinforcement learning from human feedback (RLHF) uses human rankings of outputs combined with PPO optimization to shape the model's behavior based on human preferences, enabling the model to predict text one word at a time while appearing to understand human intent.

Related Videos

Agentforce NOW AMA: Build with React and Salesforce Multi-Framework

SalesforceDevs

490 viewsβ€’2026-05-28

How agent o11y differs from traditional o11y β€” Phil Hetzel, Braintrust

aiDotEngineer

450 viewsβ€’2026-05-28

WEB TECHNOLOGIES UNIT-2 | Degree 4th sem BCOM Computers web technologies unit-2 full explanationπŸ’―βœ…

LearnwithSahera

1K viewsβ€’2026-05-29

More tests are always better? How to use AI to identify tests that bring little value

Alliance4Qualification

335 viewsβ€’2026-05-29

Search Algorithms Explained in 60 Seconds! πŸ€–πŸ’¨

samarthtuliofficial

218 viewsβ€’2026-06-01

People of Game of Thrones using JavaScript DOM

AltCampus

296 viewsβ€’2026-05-30

Introduction to Problem Solving Part - 1 | Lecture 1 | Intermediate DSA

ascensionix

107 viewsβ€’2026-05-29

So What's Odin Lang Even Good For

TechOverTea

131 viewsβ€’2026-06-01

Trending

Revisiting The Cat Cafe For The Final Time

BenGtalks

3195K viewsβ€’2026-05-29

Lil bro is a menace 🀣

NotAirJordan

2037K viewsβ€’2026-05-31

The Casino Had Us Guessing All Day

VegasMatt

157K viewsβ€’2026-06-03

My response to the Police

RecklessBen

1496K viewsβ€’2026-06-01