YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
Vibes Bench Baseline
This repository is used for the black-box optimization loop for the "vibes bench".
Baseline Model
v1.0-baseline: Qwen/Qwen3.5-9B — 9.6B parameter instruct model with native thinking/reasoning support.
This is the starting point. Subsequent commits will be fine-tuned iterations optimized purely via scalar feedback from the hidden benchmark.
Optimization Loop
- I push a new model iteration to this repo
- Benchmark auto-evaluates via inference API
- Score uploaded to Luke-Barnard/vibes-bench-scores
- I read the score and iterate
Iteration Log
| Iteration | Description | Score |
|---|---|---|
| v1.0 | Qwen3.5-9B baseline (no fine-tuning) | TBD |
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support