Text Generation
Transformers
PyTorch
English
veronica
polymorphic-mlp
mixture-of-branches
entropy-regularized-routing
decoder-only
causal-lm
rope
expandable-architecture
research
Instructions to use MhaWay/Veronica with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use MhaWay/Veronica with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="MhaWay/Veronica")# Load model directly from transformers import AutoModelForCausalLM model = AutoModelForCausalLM.from_pretrained("MhaWay/Veronica", dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use MhaWay/Veronica with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "MhaWay/Veronica" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "MhaWay/Veronica", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/MhaWay/Veronica
- SGLang
How to use MhaWay/Veronica with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "MhaWay/Veronica" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "MhaWay/Veronica", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "MhaWay/Veronica" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "MhaWay/Veronica", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use MhaWay/Veronica with Docker Model Runner:
docker model run hf.co/MhaWay/Veronica
Update README.md
Browse files
README.md
CHANGED
|
@@ -337,6 +337,20 @@ if num_funcs >= 4:
|
|
| 337 |
|
| 338 |
---
|
| 339 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 340 |
## Roadmap
|
| 341 |
| Version | Goal |
|
| 342 |
|---------|------|
|
|
|
|
| 337 |
|
| 338 |
---
|
| 339 |
|
| 340 |
+
## Router Stability (Important)
|
| 341 |
+
|
| 342 |
+
Dynamic soft‑routing is powerful but sensitive. The training methodology is under active refinement to ensure healthy branch growth without premature specialization.
|
| 343 |
+
|
| 344 |
+
- Instability: routing entropy can drop early, leading to branch collapse. This is highly sensitive to temperature (τ) scheduling, entropy auxiliary weight (λ), and any warmup forcing.
|
| 345 |
+
- Current safeguards: high initial τ, extended freeze period, entropy‑max regularization, and selective forcing during early steps.
|
| 346 |
+
- Expectations: training curves may show transient oscillations in branch usage while the router and branches co‑adapt.
|
| 347 |
+
- What to monitor: `entropy_norm ≥ 0.75` in the first 3–5k steps; no branch persistently < 15%.
|
| 348 |
+
- Intervention playbook: increase `router_aux_weight`, extend `router_tau_freeze_steps`, temporarily raise `router_tau_start`, or apply targeted forcing to the weakest branch.
|
| 349 |
+
- Fine‑tuning note: if using the standard HF Trainer, consider `router_aux_weight=0` (or use `scripts/train_veronica.py`, which handles entropy‑max correctly).
|
| 350 |
+
- Status: ongoing refinement. Default τ/λ schedules may evolve; core API will remain stable.
|
| 351 |
+
|
| 352 |
+
---
|
| 353 |
+
|
| 354 |
## Roadmap
|
| 355 |
| Version | Goal |
|
| 356 |
|---------|------|
|