traj-eval / README.md
KaushikSid
Rename from Trajectory Evaluator to Trajectory Reviewer
7aa7eba
---
title: Trajectory Reviewer
emoji: 🎯
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 6.1.0
app_file: app.py
pinned: false
short_description: Review and filter training data trajectories
---
# 🎯 Trajectory Reviewer
Quick dataset curation tool for assessing trajectory quality. Designed for rapid keep/remove decisions on training data.
## Features
- **Quick Decisions**: One-click Keep/Remove/Review buttons
- **Issue Tracking**: Flag task mismatches, already completed tasks, and custom issues
- **Smart Ledger**: CSV-based tracking with live updates
- **Shared Dataset**: Collect evaluations from all users into centralized HF Dataset
- **HF Space Sync**: Auto-commits evaluations to Space repository
- **Playback Speed**: Adjustable 1x/2x/4x speed for efficient review
- **Auto-advance**: Automatically loads next trajectory after save
## Usage
1. **Load Dataset**: Enter HuggingFace dataset repo and number of samples
2. **Review**: Watch video (use 2-3x playback speed for efficiency)
3. **Decide**: Click Keep/Remove/Review button
4. **Add Details** (optional): Flag issues and add notes for downstream analysis
5. **Continue**: App auto-advances to next trajectory
## Workflow
```
Load 20 samples β†’ Review video β†’ Quick decision β†’ Auto-advance
```
**Speed**: ~20-30 seconds per trajectory
## Output
All evaluations saved to `evaluations.csv`:
```csv
dataset_repo,config_name,trajectory_id,task,decision,issue_type,notes,timestamp
```
Use the **Download** button in the table to export your evaluations.
## Tips
- Use quick buttons (Keep/Remove/Review) for most trajectories
- Add detailed notes only when needed for issue aggregation
- Use 2x/4x playback speed buttons for faster review
## Data Persistence
### Shared HF Dataset
**Collect all users' evaluations into one centralized dataset**:
1. Create dataset: `huggingface-cli repo create traj-evaluations --type dataset`
2. Add to Space secrets:
- `HF_TOKEN`: Your HF write token
- `EVAL_DATASET_REPO`: `username/traj-evaluations`
3. Deploy Space
All evaluations automatically append to the shared dataset.
### CSV Auto-Commit
**On HF Space**: Evaluations CSV automatically commits to the Space's git repository (zero config!)
Check the Files tab in your Space to download accumulated evaluations.
## Related Tools
- **Trajectory Endpoint Labeler**: For precise frame-level endpoint labeling
- Use this reviewer first for quality filtering
---
Built with Gradio, OpenCV, and HuggingFace Datasets