---
title: VLIW Kernel Optimizer
emoji: "⚡"
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 5.0.0
app_file: app.py
pinned: false
license: mit
---

# VLIW Kernel Optimization via Reinforcement Learning

Train a language model to generate optimized VLIW/SIMD kernels using test-time RL training.

## Goal
- **Baseline:** 147,734 cycles
- **Target:** 1,363 cycles (108x speedup)

## How it works
1. Model generates kernel code
2. Simulator evaluates cycle count
3. RL training improves the model based on rewards

## Usage
1. Select a model (Qwen2.5-Coder-7B recommended)
2. Configure training steps (50 recommended)
3. Click "Start Training"
4. Monitor progress - training continues even if you close the browser

## Hardware
Requires A10G GPU (HF Spaces Pro)