This tutorial provides a compelling case for the democratization of specialized LLMs, showing how precision-engineered data can extract remarkable utility from a mere 1 billion parameters. It is a pragmatic blueprint for those who value architectural efficiency over the brute force of massive scale.
深掘り
前提条件
- データがありません。
次のステップ
- データがありません。
深掘り
HRM-Text 101 Tutorial追加:
Hey everyone, this is Yasin from Sapion Intelligence. Welcome to HRM text 101.
HRM text is a 1 billion parameter hierarchal reasoning language model. The text variant of HRM built on the same two time scale recurrence that made HRM effective on symbolic reasoning. In this video, I will walk you through downloading the base checkpoint, fine-tuning it on a real task, and evaluating the results end to end on a single GPU. So, let's get started. The code lives on the this GitHub repository.
Uh, everything you need, the training script, the HRM backbone, data tooling, and a fully worked demo config.
The Redmi has the full quick start with environmental setup. I assume your CUDA and PyTorch are ready and skip straight to the interesting parts. Download the checkpoint, prepare the data, fine-tune and evaluate. The pre-trained HRM text base model leaves on hugging face. One command pulls it down.
The checkpoint is in FSTP2 format. You will see FSTP2 epoch directory with the shorted weights and a couple of config files. We'll point pre-train.py at this in the next step. Now let's prepare the data. You'll find tune on spider a text tossql benchmark with a thousand depth questions across 200 databases.
This script does three things. First, it converts the spider training split into few shot JSONL. Each example carries three in context demos from other databases. So, the model learns to ground SQL in whatever schema you hand it.
Second, it tokenizes the JSON with our EP tokenizer about 8.7 million training tokens across 7,000 examples. Third, it packs the tokens into a box. This packing is what lets us hit high GPU utilization with variable length sequences.
And that's it. Data leaves in shared memory ready for training. So now time to finetune one command.
The base checkpoint loads and training begins. We are doing full supervised finetuning on a single H00 global batch size of 4096 5 epox and 7,000 examples that takes around 15 minutes.
Done. The final checkpoint is in / CKPTs/ SFT/ So let's see what we got.
The AIS script runs the fine tuned model on spider defet and scores execution accuracy. Does the generated skill return the right answer when you actually run it against the database?
Across the full def set, the finetune model hits around 62% execution accuracy, up from around 8% for the base checkpoint. So what does it mean in practice? Before finetuning, the base model sometimes doesn't even return SQL.
Here is an answer with the number 254 to what is the total number of singers on harder queries? it falls into schema token loops or outputs JSON L JSON like noise where SQL should be. Sometimes the failure is subtle unnecessary joins that dilute the count or the classic hallucinated column name. After fine-tuning all five of these are correct. You get a model that has actually learned to read schemas and grounding in them. And that's it. You have gone from a fresh checkpoint to a fine-tuned hierarchal reasoning model.
If you want to follow what we are working on or share what we have built with HRM, come find us. Links are all in the description. See you there.
関連おすすめ
Agentforce NOW AMA: Build with React and Salesforce Multi-Framework
SalesforceDevs
490 views•2026-05-28
How agent o11y differs from traditional o11y — Phil Hetzel, Braintrust
aiDotEngineer
450 views•2026-05-28
Re: 🗣️📍theprophedu📍2026 GST 103 CLASS (E-EXAM REVISION)
theprophedu
636 views•2026-06-04
WEB TECHNOLOGIES UNIT-2 | Degree 4th sem BCOM Computers web technologies unit-2 full explanation💯✅
LearnwithSahera
1K views•2026-05-29
More tests are always better? How to use AI to identify tests that bring little value
Alliance4Qualification
335 views•2026-05-29
Search Algorithms Explained in 60 Seconds! 🤖💨
samarthtuliofficial
218 views•2026-06-01
People of Game of Thrones using JavaScript DOM
AltCampus
296 views•2026-05-30
Instagram accounts got PWNed
EricParker
13K views•2026-06-03











