ben-dlwlrma's picture
Update README.md
912195c verified

A newer version of the Gradio SDK is available: 6.15.1

Upgrade
metadata
title: Representation Over Routing Demo
emoji: 🚀
colorFrom: blue
colorTo: indigo
sdk: gradio
sdk_version: 6.14.0
python_version: '3.13'
app_file: app.py
pinned: false
license: cc-by-4.0
short_description: Interactive LunarLander demo for Representation over Routing
models:
  - ben-dlwlrma/Representation-Over-Routing

Representation over Routing Demo

Interactive demo for the preprint "Representation over Routing: Overcoming Surrogate Hacking in Multi-Timescale PPO".

This Space visualizes the four pretrained ablation-stage agents from the associated model repository. The default selection shows Stage 4: Target Decoupling, the proposed method, performing a LunarLander rollout with a fixed deterministic demo seed.

Links

Notes

The paper experiments were conducted on LunarLander-v2. This hosted demo uses LunarLander-v3 for compatibility with current Gymnasium releases, while keeping the same actor architecture and pretrained weights.