|
|
--- |
|
|
title: Trajectory Reviewer |
|
|
emoji: π― |
|
|
colorFrom: blue |
|
|
colorTo: purple |
|
|
sdk: gradio |
|
|
sdk_version: 6.1.0 |
|
|
app_file: app.py |
|
|
pinned: false |
|
|
short_description: Review and filter training data trajectories |
|
|
--- |
|
|
|
|
|
# π― Trajectory Reviewer |
|
|
|
|
|
Quick dataset curation tool for assessing trajectory quality. Designed for rapid keep/remove decisions on training data. |
|
|
|
|
|
## Features |
|
|
|
|
|
- **Quick Decisions**: One-click Keep/Remove/Review buttons |
|
|
- **Issue Tracking**: Flag task mismatches, already completed tasks, and custom issues |
|
|
- **Smart Ledger**: CSV-based tracking with live updates |
|
|
- **Shared Dataset**: Collect evaluations from all users into centralized HF Dataset |
|
|
- **HF Space Sync**: Auto-commits evaluations to Space repository |
|
|
- **Playback Speed**: Adjustable 1x/2x/4x speed for efficient review |
|
|
- **Auto-advance**: Automatically loads next trajectory after save |
|
|
|
|
|
## Usage |
|
|
|
|
|
1. **Load Dataset**: Enter HuggingFace dataset repo and number of samples |
|
|
2. **Review**: Watch video (use 2-3x playback speed for efficiency) |
|
|
3. **Decide**: Click Keep/Remove/Review button |
|
|
4. **Add Details** (optional): Flag issues and add notes for downstream analysis |
|
|
5. **Continue**: App auto-advances to next trajectory |
|
|
|
|
|
## Workflow |
|
|
|
|
|
``` |
|
|
Load 20 samples β Review video β Quick decision β Auto-advance |
|
|
``` |
|
|
|
|
|
**Speed**: ~20-30 seconds per trajectory |
|
|
|
|
|
## Output |
|
|
|
|
|
All evaluations saved to `evaluations.csv`: |
|
|
|
|
|
```csv |
|
|
dataset_repo,config_name,trajectory_id,task,decision,issue_type,notes,timestamp |
|
|
``` |
|
|
|
|
|
Use the **Download** button in the table to export your evaluations. |
|
|
|
|
|
## Tips |
|
|
|
|
|
- Use quick buttons (Keep/Remove/Review) for most trajectories |
|
|
- Add detailed notes only when needed for issue aggregation |
|
|
- Use 2x/4x playback speed buttons for faster review |
|
|
|
|
|
## Data Persistence |
|
|
|
|
|
### Shared HF Dataset |
|
|
|
|
|
**Collect all users' evaluations into one centralized dataset**: |
|
|
|
|
|
1. Create dataset: `huggingface-cli repo create traj-evaluations --type dataset` |
|
|
2. Add to Space secrets: |
|
|
- `HF_TOKEN`: Your HF write token |
|
|
- `EVAL_DATASET_REPO`: `username/traj-evaluations` |
|
|
3. Deploy Space |
|
|
|
|
|
All evaluations automatically append to the shared dataset. |
|
|
|
|
|
### CSV Auto-Commit |
|
|
|
|
|
**On HF Space**: Evaluations CSV automatically commits to the Space's git repository (zero config!) |
|
|
|
|
|
Check the Files tab in your Space to download accumulated evaluations. |
|
|
|
|
|
## Related Tools |
|
|
|
|
|
- **Trajectory Endpoint Labeler**: For precise frame-level endpoint labeling |
|
|
- Use this reviewer first for quality filtering |
|
|
|
|
|
--- |
|
|
|
|
|
Built with Gradio, OpenCV, and HuggingFace Datasets |
|
|
|