Spaces:

arudradey
/

formula-engine-chatbot

Sleeping

Update ML Intern artifact metadata

77aad40 verified 18 days ago

1.21 kB

	---
	title: Formula Engine Chatbot
	emoji: 🧮
	colorFrom: purple
	colorTo: blue
	sdk: gradio
	sdk_version: 5.32.0
	app_file: app.py
	pinned: false
	license: apache-2.0
	tags:
	- ml-intern
	---

	# 🧮 Formula Engine Chatbot

	## AI-Powered Weight Compression: Qwen 0.5B from Mathematical Formulas

	This Space demonstrates the Formula Engine concept:
	- Instead of storing the full 942 MB Qwen 0.5B model
	- We store compact mathematical formulas (~474 MB) that reconstruct the weights on-the-fly
	- 49.7% disk space saved while maintaining 99.99% accuracy

	### How it works:
	1. Formula Discovery: Analyzes each weight matrix to find the most compact representation
	2. 4-bit Quantization: `W ≈ scale × W_q + zero_point` (75% compression per layer)
	3. SVD Factorization: `W ≈ U_r × S_r × V_r^T` (for rectangular matrices)
	4. On-the-fly Reconstruction: Formulas regenerate weights at startup

	### Links:
	- 📦 [Formula Weights Repository](https://huggingface.co/arudradey/qwen-formula-engine)
	- 🤖 [Base Model: Qwen2.5-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-0.5B-Instruct)

	### Note
	This Space requires ~2GB RAM to reconstruct and run the model. It uses CPU inference.