Robotics
LeRobot
Safetensors
imitation-learning
aloha
diffusion-policy
baseline
LeTau commited on
Commit
dfd8420
·
verified ·
1 Parent(s): ee8ee5d

Upload folder using huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +14 -5
README.md CHANGED
@@ -97,12 +97,21 @@ lerobot-eval \
97
 
98
  ## Results
99
 
100
- | Steps | Episodes | Success Rate | Avg Sum Reward |
101
- |-------|----------|--------------|----------------|
102
- | 100K | 10 | 10% | 23.7 |
103
- | 200K | 10 | 10% | 23.3 |
 
104
 
105
- **No improvement from 100K to 200K steps.**
 
 
 
 
 
 
 
 
106
 
107
  ## Why Does Diffusion Policy Underperform?
108
 
 
97
 
98
  ## Results
99
 
100
+ | Evaluation | Episodes | Success Rate | Avg Sum Reward |
101
+ |------------|----------|--------------|----------------|
102
+ | Training (100K) | 10 | 10% | 23.7 |
103
+ | Training (200K) | 10 | 10% | 23.3 |
104
+ | Independent | 20 | 10% | 28.3 |
105
 
106
+ **Expected success rate: ~10%**
107
+
108
+ ## Detailed Evaluation Results (Independent)
109
+ ```
110
+ Sum Rewards: [0.0, 0.0, 253.0, 4.0, 0.0, 0.0, 0.0, 81.0, 21.0, 0.0,
111
+ 0.0, 0.0, 0.0, 0.0, 0.0, 207.0, 0.0, 0.0, 0.0, 0.0]
112
+
113
+ Successes: 2/20 episodes
114
+ ```
115
 
116
  ## Why Does Diffusion Policy Underperform?
117