In gradient descent, to minimize the cost function, weights should be updated by subtracting the gradient (DJDW) from the current weights, which moves the optimization process downhill in the cost landscape.
Approfondir
Prérequis
- Pas de données disponibles.
Installez notre extension pour rechercher instantanément dans n'importe quelle vidéo
Prochaines étapes
- Pas de données disponibles.
Approfondir
Stop Guessing: Backpropagation to Code (Part 4)Ajouté :
So, how should we change our W's to decrease our cost?
We can now compute DJDW, [music] which tells us which way is uphill in our nine-dimensional optimization space.
If we move this way [music] by adding a scalar times our derivative to all of our weights, our cost will increase.
[music] And if we do the opposite, subtract our gradient from our weights, we will move downhill and reduce our cost.
This simple step downhill is the core of gradient descent >> [music] >> and a key part
Vidéos Similaires
BREAKING: Microsoft’s New Image Generating Model Beat Out GPT 1.5 and Nano Banana 2
aimmediahouse
122 views•2026-06-03
Long-Running Agents — Build an Agent That Never Forgets with Google ADK
suryakunju
142 views•2026-05-30
I Made the Same Anime Fight Scene in Every AI Video Generator
NobleGooseAnime
295 views•2026-05-30
Nvidia Bets Big On AI PCs | New Chip To Power Windows Laptops | Technology | AI Updates | N18S
cnnnews18
3K views•2026-06-01
3D Platformer Update - NO CAPES
SolarLune
294 views•2026-05-30
AI Doesn't Create Bias — It Inherits It
UXEvolved
176 views•2026-06-01
Distributed Inference Challenges Explained #shorts
alexa_griffith
466 views•2026-05-31
[한글자막] OpenAI @ Replay 2026 | OpenAI는 Codex로 개발 방식을 어떻게 바꾸고 있을까요?
TechBridge-KR
1K views•2026-06-03











