YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

Vibes Bench Baseline

This repository is used for the black-box optimization loop for the "vibes bench".

Baseline Model

v1.0-baseline: Qwen/Qwen3.5-9B — 9.6B parameter instruct model with native thinking/reasoning support.

This is the starting point. Subsequent commits will be fine-tuned iterations optimized purely via scalar feedback from the hidden benchmark.

Optimization Loop

  1. I push a new model iteration to this repo
  2. Benchmark auto-evaluates via inference API
  3. Score uploaded to Luke-Barnard/vibes-bench-scores
  4. I read the score and iterate

Iteration Log

Iteration Description Score
v1.0 Qwen3.5-9B baseline (no fine-tuning) TBD
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support