Revise README for improved clarity and structure

Updated README to enhance clarity and structure, including detailed descriptions of operational modes, validation results, quick start guide, technical details, installation instructions, citation, and contact information.

Files changed (1) hide show

README.md +136 -2

README.md CHANGED Viewed

@@ -1,2 +1,136 @@
-# Unified-LoRa
-Adaptive Single/Multi/Mirror LoRA framework with automatic mode switching driven by synaptic stress signal φ(t). Validated on Llama-3.2-1B (Tinker) and GLUE MRPC with full stress-recovery cycle demonstration.

+# Unified LoRA
+**Adaptive Single/Multi/Mirror LoRA framework with automatic mode switching driven by synaptic stress signals.**
+[![License](https://img.shields.io/badge/License-Apache%202.0-blue.svg)](https://opensource.org/licenses/Apache-2.0)
+## Overview
+Unified LoRA is a dynamic parameter-efficient fine-tuning system that automatically switches between three operational modes based on training stress:
+- **Mode 0 (Single)**: Shared adapter for low-conflict scenarios
+- **Mode 1 (Multi)**: Task-specific LoRA adapters for moderate stress
+- **Mode 2 (Mirror)**: Stability snapshots for catastrophic forgetting prevention
+The system uses a synaptic control parameter **φ(t)** derived from:
+- **C**: Task conflict (weight space variance)
+- **E**: Multi-task error
+- **S**: Memory stability
+```
+φ(t) = f(C, E, S, ΔC, ΔE, ΔS)
+```
+## Validation Results
+### 🧪 Production LLM (Tinker + Llama-3.2-1B)
+Full stress→recovery cycle demonstrated on cloud GPU:
+```
+[250] Mode=1 φ=0.333 E_s=0.500 lr=5e-4  (Multi stable)
+>>> SHOCK @ step 300
+[350] Mode=2 φ=0.827 E_s=4.979 lr=1e-4  (Mirror activated)
+>>> RECOVERY @ step 500
+[550] Mode=1 φ=0.371 E_s=0.521 lr=5e-4  (Multi return)
+[700] Mode=1 φ=0.333 E_s=0.500 lr=5e-4  (baseline restored)
+```
+**Key finding**: Complete reversibility (φ: 0.33 → 0.83 → 0.33)
+### 📊 Standard Benchmark (GLUE MRPC + DistilBERT)
+Performance parity with baseline LoRA:
+| Method | F1 | Accuracy | φ final | Mode |
+|--------|-----|----------|---------|------|
+| Baseline LoRA | 0.785 | 0.646 | - | - |
+| Unified LoRA | 0.785 | 0.646 | 0.367 | 1 |
+**Key finding**: Zero degradation with adaptive control active
+## Quick Start
+```python
+from unified_lora import UnifiedController
+# Initialize controller
+controller = UnifiedController(
+    alpha=0.1,      # φ(t) learning rate
+    beta=0.9,       # EMA smoothing
+    theta0=0.3,     # Single/Multi threshold
+    theta1=0.7      # Multi/Mirror threshold
+)
+# Training loop
+for step, batch in enumerate(train_loader):
+    outputs = model(**batch)
+    loss = outputs.loss
+    # Update controller and get adaptive LR
+    new_lr = controller.update(loss.item())
+    # Apply new learning rate
+    for g in optimizer.param_groups:
+        g['lr'] = new_lr
+    # Standard backprop
+    loss.backward()
+    optimizer.step()
+    optimizer.zero_grad()
+```
+## Technical Details
+### FSM State Transitions
+```
+φ < 0.3  → Mode 0 (Single)   LR = 5e-5
+φ < 0.7  → Mode 1 (Multi)    LR = 3e-5
+φ ≥ 0.7  → Mode 2 (Mirror)   LR = 1e-5
+```
+### Stress Signal Computation
+The metrics **C** (conflict), **E** (error), and **S** (stability) are normalized to [0,1] to maintain φ(t) well-conditioned:
+```python
+E_smooth = β * E_smooth + (1 - β) * loss
+D = E_smooth / (1 + E_smooth)  # Normalize to [0,1]
+φ = (1 - α) * φ + α * D         # EMA update
+```
+This normalization ensures stable FSM transitions and prevents numerical instabilities during training.
+## Installation
+```bash
+pip install transformers peft torch
+```
+## Citation
+If you use Unified LoRA in your research, please cite:
+```bibtex
+@software{unified_lora_2025,
+  author = {Simona Vargiu},
+  title = {Unified LoRA: Adaptive Parameter-Efficient Fine-Tuning},
+  year = {2025},
+  url = {https://github.com/Sva76/Unified-LoRA}
+}
+```
+## License
+Apache License 2.0 - see [LICENSE](LICENSE) for details.
+## Contact
+**Simona Vargiu** (Independent Researcher)
+For collaboration inquiries: [simona.vargiu.malta@gmail.com](mailto:simona.vargiu.malta@gmail.com)
+---
+**Status**: Validated on production LLM (Tinker/Llama-3.2-1B) and standard benchmarks (GLUE MRPC). Ready for research collaboration and testing.