Spaces:

Jellyfish042
/

Compression-Lens

Running

App Files Files Community

Compression-Lens / README.md

Jellyfish042

Update README.md

ddd27a5 verified 12 days ago

preview code

raw

history blame contribute delete

2.05 kB

A newer version of the Gradio SDK is available: 6.5.1

Upgrade

metadata

title: Compression-Lens
emoji: 🔬
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: false
license: mit

UncheatableEval: Qwen3 vs RWKV7 Byte-Level Comparison

Compare the byte-level prediction performance between models.

Features

Byte-level analysis: See exactly where each model performs better or worse
Interactive visualization: Hover over tokens to see detailed predictions
Color-coded comparison:
- 🟢 Green = Qwen3 predicts better (lower loss)
- 🔴 Red = RWKV7 predicts better (lower loss)
Top-10 predictions: View each model's top predictions for every token
Word occurrence linking: See how repeated words are predicted differently

How to Use

Enter or paste your text (max 4000 characters)
Click "Run Comparison"
Explore the interactive visualization
Download the HTML file for offline viewing

Models

Model	Type	Parameters	Architecture
Qwen3-1.7B-Base	Transformer	1.7B	Dense attention
RWKV7-G1C-1.5B	RWKV	1.5B	Linear attention

Technical Details

This tool uses the UncheatableEval framework to:

Tokenize input text with each model's tokenizer
Calculate per-token cross-entropy loss
Map token losses to byte-level losses
Generate interactive HTML visualization

Local Development

# Clone the repository
git clone https://huggingface.co/spaces/YOUR_USERNAME/UncheatableEval-Visualization

# Install dependencies
pip install -r requirements.txt

# Run locally
python app.py

Requirements

CUDA-capable GPU (16GB+ VRAM recommended)
Python 3.10+
See requirements.txt for package dependencies

License

MIT License

Acknowledgments

UncheatableEval - Original evaluation framework
Qwen - Qwen model family
RWKV - RWKV model family