Spaces:

Jellyfish042
/

Compression-Lens

Running

App Files Files Community

Compression-Lens / README.md

Jellyfish042

Update README.md

ddd27a5 verified 12 days ago

preview code

raw

history blame contribute delete

2.05 kB

	---
	title: Compression-Lens
	emoji: 🔬
	colorFrom: blue
	colorTo: green
	sdk: gradio
	sdk_version: 4.44.0
	app_file: app.py
	pinned: false
	license: mit
	---

	# UncheatableEval: Qwen3 vs RWKV7 Byte-Level Comparison

	Compare the byte-level prediction performance between models.

	## Features

	- Byte-level analysis: See exactly where each model performs better or worse
	- Interactive visualization: Hover over tokens to see detailed predictions
	- Color-coded comparison:
	- 🟢 Green = Qwen3 predicts better (lower loss)
	- 🔴 Red = RWKV7 predicts better (lower loss)
	- Top-10 predictions: View each model's top predictions for every token
	- Word occurrence linking: See how repeated words are predicted differently

	## How to Use

	1. Enter or paste your text (max 4000 characters)
	2. Click "Run Comparison"
	3. Explore the interactive visualization
	4. Download the HTML file for offline viewing

	## Models

	\| Model \| Type \| Parameters \| Architecture \|
	\|-------\|------\|------------\|--------------\|
	\| Qwen3-1.7B-Base \| Transformer \| 1.7B \| Dense attention \|
	\| RWKV7-G1C-1.5B \| RWKV \| 1.5B \| Linear attention \|

	## Technical Details

	This tool uses the [UncheatableEval](https://github.com/Jellyfish042/UncheatableEval) framework to:

	1. Tokenize input text with each model's tokenizer
	2. Calculate per-token cross-entropy loss
	3. Map token losses to byte-level losses
	4. Generate interactive HTML visualization

	## Local Development

	```bash
	# Clone the repository
	git clone https://huggingface.co/spaces/YOUR_USERNAME/UncheatableEval-Visualization

	# Install dependencies
	pip install -r requirements.txt

	# Run locally
	python app.py
	```

	## Requirements

	- CUDA-capable GPU (16GB+ VRAM recommended)
	- Python 3.10+
	- See `requirements.txt` for package dependencies

	## License

	MIT License

	## Acknowledgments

	- [UncheatableEval](https://github.com/Jellyfish042/UncheatableEval) - Original evaluation framework
	- [Qwen](https://github.com/QwenLM/Qwen) - Qwen model family
	- [RWKV](https://github.com/BlinkDL/RWKV-LM) - RWKV model family