A100-CUDA-RL / README.md
William Chen
feat: KernelForge OpenEnv demo β€” Streamlit on Docker
072d3e6
metadata
title: A100 CUDA RL - KernelForge OpenEnv
emoji: πŸ”₯
colorFrom: blue
colorTo: purple
sdk: docker
pinned: false
license: mit

KernelForge-OpenEnv: A100 CUDA Kernel RL

Interactive demo for the KernelForge reinforcement learning environment that trains LLMs to write optimized CUDA kernels targeting NVIDIA A100 GPUs.

Features

  • Live Optimization β€” CUDA kernel editor with real-time metrics (compilation, correctness, speedup)
  • Training Monitor β€” Reward progress, success rates, and model performance tracking
  • Hardware Telemetry β€” A100/H100 SM utilization, memory throughput, and feature availability
  • PAC Verification β€” Mathematical correctness verification for graph kernels (WCC)

Architecture

This demo showcases the OpenEnv-compatible RL environment:

  • Discrete milestone reward: {-1, 1, 2, 3} based on correctness and speedup tiers
  • Multi-turn episodes: 3 turns per episode (model sees errors, iterates)
  • Anti-hack checks: 5 Dr. Kernel-inspired runtime checks
  • Task pool: Ops-6K operators + doubleGraph topology-aware tasks

Links