Spaces:

rewardfm
/

traj-eval

Sleeping

App Files Files Community

traj-eval / README.md

KaushikSid

Rename from Trajectory Evaluator to Trajectory Reviewer

7aa7eba about 1 month ago

preview code

raw

history blame contribute delete

2.51 kB

	---
	title: Trajectory Reviewer
	emoji: 🎯
	colorFrom: blue
	colorTo: purple
	sdk: gradio
	sdk_version: 6.1.0
	app_file: app.py
	pinned: false
	short_description: Review and filter training data trajectories
	---

	# 🎯 Trajectory Reviewer

	Quick dataset curation tool for assessing trajectory quality. Designed for rapid keep/remove decisions on training data.

	## Features

	- Quick Decisions: One-click Keep/Remove/Review buttons
	- Issue Tracking: Flag task mismatches, already completed tasks, and custom issues
	- Smart Ledger: CSV-based tracking with live updates
	- Shared Dataset: Collect evaluations from all users into centralized HF Dataset
	- HF Space Sync: Auto-commits evaluations to Space repository
	- Playback Speed: Adjustable 1x/2x/4x speed for efficient review
	- Auto-advance: Automatically loads next trajectory after save

	## Usage

	1. Load Dataset: Enter HuggingFace dataset repo and number of samples
	2. Review: Watch video (use 2-3x playback speed for efficiency)
	3. Decide: Click Keep/Remove/Review button
	4. Add Details (optional): Flag issues and add notes for downstream analysis
	5. Continue: App auto-advances to next trajectory

	## Workflow

	```
	Load 20 samples → Review video → Quick decision → Auto-advance
	```

	Speed: ~20-30 seconds per trajectory

	## Output

	All evaluations saved to `evaluations.csv`:

	```csv
	dataset_repo,config_name,trajectory_id,task,decision,issue_type,notes,timestamp
	```

	Use the Download button in the table to export your evaluations.

	## Tips

	- Use quick buttons (Keep/Remove/Review) for most trajectories
	- Add detailed notes only when needed for issue aggregation
	- Use 2x/4x playback speed buttons for faster review

	## Data Persistence

	### Shared HF Dataset

	Collect all users' evaluations into one centralized dataset:

	1. Create dataset: `huggingface-cli repo create traj-evaluations --type dataset`
	2. Add to Space secrets:
	- `HF_TOKEN`: Your HF write token
	- `EVAL_DATASET_REPO`: `username/traj-evaluations`
	3. Deploy Space

	All evaluations automatically append to the shared dataset.

	### CSV Auto-Commit

	On HF Space: Evaluations CSV automatically commits to the Space's git repository (zero config!)

	Check the Files tab in your Space to download accumulated evaluations.

	## Related Tools

	- Trajectory Endpoint Labeler: For precise frame-level endpoint labeling
	- Use this reviewer first for quality filtering

	---

	Built with Gradio, OpenCV, and HuggingFace Datasets