Instructions to use legesher/language-decoded-lora with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use legesher/language-decoded-lora with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="legesher/language-decoded-lora")

# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("legesher/language-decoded-lora", dtype="auto")

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use legesher/language-decoded-lora with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "legesher/language-decoded-lora"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "legesher/language-decoded-lora",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/legesher/language-decoded-lora

SGLang

How to use legesher/language-decoded-lora with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "legesher/language-decoded-lora" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "legesher/language-decoded-lora",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "legesher/language-decoded-lora" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "legesher/language-decoded-lora",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Unsloth Studio

How to use legesher/language-decoded-lora with Unsloth Studio:

Install Unsloth Studio (macOS, Linux, WSL)

curl -fsSL https://unsloth.ai/install.sh | sh
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for legesher/language-decoded-lora to start chatting

Install Unsloth Studio (Windows)

irm https://unsloth.ai/install.ps1 | iex
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for legesher/language-decoded-lora to start chatting

Using HuggingFace Spaces for Unsloth

# No setup required
# Open https://huggingface.co/spaces/unsloth/studio in your browser
# Search for legesher/language-decoded-lora to start chatting

Load model with FastModel

pip install unsloth
from unsloth import FastModel
model, tokenizer = FastModel.from_pretrained(
    model_name="legesher/language-decoded-lora",
    max_seq_length=2048,
)

Docker Model Runner
How to use legesher/language-decoded-lora with Docker Model Runner:
```
docker model run hf.co/legesher/language-decoded-lora
```

madiedgar commited on Mar 14

Commit

f48842c

verified ·

1 Parent(s): b51e5c6

init: create README.md

Browse files

Files changed (1) hide show

README.md +118 -3

README.md CHANGED Viewed

@@ -1,3 +1,118 @@
----
-license: apache-2.0
----

+---
+license: cc-by-nc-4.0
+language:
+  - multilingual
+tags:
+  - lora
+  - aya
+  - tiny-aya
+  - multilingual
+  - code
+  - legesher
+  - tiny-aya-expedition
+  - language-decoded
+library_name: transformers
+base_model:
+  - CohereLabs/tiny-aya-global
+  - CohereLabs/tiny-aya-fire
+  - CohereLabs/tiny-aya-earth
+  - CohereLabs/tiny-aya-water
+pipeline_tag: text-generation
+---
+# Language Decoded LoRA
+LoRA adapters fine-tuned on multilingual code conditions for the **Language Decoded** project (part of Cohere's Tiny Aya Expedition).
+## Research Question
+> Does fine-tuning on non-English code improve multilingual reasoning — and is the benefit language-dependent or structure-dependent?
+## Base Models
+All adapters are trained on [Tiny Aya](https://huggingface.co/collections/CohereLabs/tiny-aya) (3.35B parameters), a multilingual model optimized for 70+ languages.
+| Model | HF ID | Regional Strength |
+|---|---|---|
+| **Global** | `CohereLabs/tiny-aya-global` | Balanced across all languages |
+| **Fire** | `CohereLabs/tiny-aya-fire` | South Asian (Urdu) |
+| **Earth** | `CohereLabs/tiny-aya-earth` | West Asian & African (Amharic) |
+| **Water** | `CohereLabs/tiny-aya-water` | European & Asia Pacific (Chinese) |
+## Model Structure
+This repo contains LoRA adapters organized by experimental condition and base model variant:
+| Subdirectory | Condition | Training Data |
+|---|---|---|
+| `global/baseline/` | Condition 1 | No code augmentation |
+| `global/english-code/` | Condition 2 | English-keyword Python code |
+| `global/multilingual-code/` | Condition 3 | Python transpiled to Urdu, Amharic, Chinese keywords |
+| `global/multilingual-text/` | Condition 4 | Non-code multilingual text |
+| `fire/multilingual-code/` | Regional | Urdu-keyword Python on Fire variant |
+| `earth/multilingual-code/` | Regional | Amharic-keyword Python on Earth variant |
+| `water/multilingual-code/` | Regional | Chinese-keyword Python on Water variant |
+## Usage
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
+from peft import PeftModel
+# Load base model (Global variant)
+base_model = AutoModelForCausalLM.from_pretrained("CohereLabs/tiny-aya-global")
+tokenizer = AutoTokenizer.from_pretrained("CohereLabs/tiny-aya-global")
+# Load a LoRA adapter (e.g., multilingual code on Global)
+model = PeftModel.from_pretrained(base_model, "Legesher/language-decoded-lora", subfolder="global/multilingual-code")
+# Or load a regional variant (e.g., Urdu code on Fire)
+base_fire = AutoModelForCausalLM.from_pretrained("CohereLabs/tiny-aya-fire")
+model_fire = PeftModel.from_pretrained(base_fire, "Legesher/language-decoded-lora", subfolder="fire/multilingual-code")
+```
+## Training Details
+- **Base models**: Tiny Aya 3.35B — Global, Fire, Earth, Water ([CohereLabs](https://huggingface.co/CohereLabs))
+- **Method**: QLoRA (Quantized Low-Rank Adaptation)
+- **Training data**: [Legesher/language-decoded-data](https://huggingface.co/datasets/Legesher/language-decoded-data)
+- **Parameters**: 3.35B base, ~0.1% trainable via LoRA
+*Detailed hyperparameters and training configs will be added as training completes.*
+## Evaluation
+Models are evaluated on multilingual reasoning benchmarks:
+| Benchmark | Task | Languages |
+|---|---|---|
+| XNLI | Natural language inference | 15 |
+| XStoryCloze | Story completion | 11 |
+| TyDi QA | Question answering | 11 |
+| MMLU | Knowledge | Multilingual |
+*Results will be added as evaluation completes.*
+## Related Resources
+- **Base models**: [Tiny Aya Collection](https://huggingface.co/collections/CohereLabs/tiny-aya)
+- **Training data**: [Legesher/language-decoded-data](https://huggingface.co/datasets/Legesher/language-decoded-data)
+- **Community code**: [Legesher/language-decoded-community](https://huggingface.co/datasets/Legesher/language-decoded-community)
+- **Experiments**: [Legesher/language-decoded-experiments](https://huggingface.co/datasets/Legesher/language-decoded-experiments)
+- **Transpilation tool**: [Legesher](https://github.com/Legesher/legesher)
+## Citation
+```bibtex
+@misc{language-decoded-2026,
+  title={Language Decoded: Investigating Language-Dependent vs. Structure-Dependent Reasoning Benefits of Code},
+  author={Madison Edgar and Saad Bazaz and Rafay Mustafa and Sarah Jawaid and Rashik Shahjahan and Khojasteh Mirza and Sohaib Bazaz},
+  year={2026},
+  publisher={Hugging Face},
+  url={https://huggingface.co/Legesher/language-decoded-lora}
+}
+```
+## License
+CC-BY-NC 4.0 (inherits from Tiny Aya base models)