Spaces:
Running
Running
Update app/src/content/chapters/folding/09-learnings.mdx
#4
by nepyope - opened
app/src/content/chapters/folding/09-learnings.mdx
CHANGED
|
@@ -3,11 +3,7 @@ import Sidenote from "../../../components/Sidenote.astro";
|
|
| 3 |
|
| 4 |
## Learnings
|
| 5 |
|
| 6 |
-
|
| 7 |
-
|
| 8 |
-
### What mattered most
|
| 9 |
-
|
| 10 |
-
Beyond the experiment findings above, several practical insights stood out:
|
| 11 |
|
| 12 |
- **Train a reward model.** [SARM](https://huggingface.co/docs/lerobot/sarm) gave us data scoring, advantage conditioning, and curation in one package. We recommend it even for tasks where you think manual filtering would suffice.
|
| 13 |
- **Invest in recording quality early.** More time upfront on clean, consistent recordings pays off more than extra volume.
|
|
@@ -16,7 +12,7 @@ Beyond the experiment findings above, several practical insights stood out:
|
|
| 16 |
|
| 17 |
### For the community: the order of operations
|
| 18 |
|
| 19 |
-
If you're training a policy for a new manipulation task with LeRobot, here's
|
| 20 |
|
| 21 |
1. **Define your task protocol first.** Before collecting a single episode, agree on exactly how the task should be performed.
|
| 22 |
2. **Collect 50–100 clean demonstrations.** Quality over volume. Consistent technique, good camera angles, deliberate motions. This is your foundation, everything else builds on it.
|
|
|
|
| 3 |
|
| 4 |
## Learnings
|
| 5 |
|
| 6 |
+
We also learned a lot from running all these experiments. Several practical insights stood out:
|
|
|
|
|
|
|
|
|
|
|
|
|
| 7 |
|
| 8 |
- **Train a reward model.** [SARM](https://huggingface.co/docs/lerobot/sarm) gave us data scoring, advantage conditioning, and curation in one package. We recommend it even for tasks where you think manual filtering would suffice.
|
| 9 |
- **Invest in recording quality early.** More time upfront on clean, consistent recordings pays off more than extra volume.
|
|
|
|
| 12 |
|
| 13 |
### For the community: the order of operations
|
| 14 |
|
| 15 |
+
If you're training a policy for a new manipulation task with LeRobot, here's what we recommend:
|
| 16 |
|
| 17 |
1. **Define your task protocol first.** Before collecting a single episode, agree on exactly how the task should be performed.
|
| 18 |
2. **Collect 50–100 clean demonstrations.** Quality over volume. Consistent technique, good camera angles, deliberate motions. This is your foundation, everything else builds on it.
|