Spaces:

timchen0618
/

dashboard

Running

dashboard / README.md

timchen0618

Deploy research dashboard

b03f016 3 days ago

1.33 kB

title: Research Dashboard
emoji: 📊
colorFrom: blue
colorTo: purple
sdk: docker
pinned: false

Research Dashboard

A unified research control pane with two main sections:

Track research experiments, hypotheses, runs, and artifacts:

Experiment tracking - Create/manage experiments with hypothesis statements, status, and completeness scoring
Run history - Record runs with conditions, models, clusters, metrics, and HF dataset links
Sub-experiments - Drill into focused sub-studies with markdown reports
HF dataset catalog - Link and browse all HuggingFace datasets per experiment

Data stored in your-org/RACA_DASHBOARD. Supports programmatic import via /api/experiments/import.

Six trace visualization tools:

Model Trace - Analyze reasoning traces from model responses (think tags, backtracks, restarts)
Arena - Explore multi-agent game episodes and transcripts
RLM - Navigate hierarchical RLM call traces (GEPA iterations, RLM calls)
RLM Eval - RLM evaluation trace viewer
Harbor - View SWE-bench agent trajectories (ATIF + raw message formats)
AdaEvolve - Explore AdaEvolve optimization traces

Each visualizer loads datasets from HuggingFace and supports preset configurations stored in your-org/RACA-VIS-PRESETS.