Instructions to use MrMoeeee/lamp-models with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use MrMoeeee/lamp-models with llama-cpp-python:

# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="MrMoeeee/lamp-models",
	filename="lamp-gemma-4b-v2-gguf/lamp-gemma-4b-v2-Q8_0.gguf",
)

llm.create_chat_completion(
	messages = [
		{
			"role": "user",
			"content": "What is the capital of France?"
		}
	]
)

Notebooks
Google Colab
Kaggle
Local Apps

llama.cpp

How to use MrMoeeee/lamp-models with llama.cpp:

Install from brew

brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf MrMoeeee/lamp-models:Q8_0
# Run inference directly in the terminal:
llama-cli -hf MrMoeeee/lamp-models:Q8_0

Install from WinGet (Windows)

winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf MrMoeeee/lamp-models:Q8_0
# Run inference directly in the terminal:
llama-cli -hf MrMoeeee/lamp-models:Q8_0

Use pre-built binary

# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf MrMoeeee/lamp-models:Q8_0
# Run inference directly in the terminal:
./llama-cli -hf MrMoeeee/lamp-models:Q8_0

Build from source code

git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf MrMoeeee/lamp-models:Q8_0
# Run inference directly in the terminal:
./build/bin/llama-cli -hf MrMoeeee/lamp-models:Q8_0

Use Docker

docker model run hf.co/MrMoeeee/lamp-models:Q8_0

LM Studio
Jan

vLLM

How to use MrMoeeee/lamp-models with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "MrMoeeee/lamp-models"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "MrMoeeee/lamp-models",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/MrMoeeee/lamp-models:Q8_0

Ollama
How to use MrMoeeee/lamp-models with Ollama:
```
ollama run hf.co/MrMoeeee/lamp-models:Q8_0
```

Unsloth Studio new

How to use MrMoeeee/lamp-models with Unsloth Studio:

Install Unsloth Studio (macOS, Linux, WSL)

curl -fsSL https://unsloth.ai/install.sh | sh
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for MrMoeeee/lamp-models to start chatting

Install Unsloth Studio (Windows)

irm https://unsloth.ai/install.ps1 | iex
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for MrMoeeee/lamp-models to start chatting

Using HuggingFace Spaces for Unsloth

# No setup required
# Open https://huggingface.co/spaces/unsloth/studio in your browser
# Search for MrMoeeee/lamp-models to start chatting

Docker Model Runner
How to use MrMoeeee/lamp-models with Docker Model Runner:
```
docker model run hf.co/MrMoeeee/lamp-models:Q8_0
```

Lemonade

How to use MrMoeeee/lamp-models with Lemonade:

Pull the model

# Download Lemonade from https://lemonade-server.ai/
lemonade pull MrMoeeee/lamp-models:Q8_0

Run and chat with the model

lemonade run user.lamp-models-Q8_0

List all available models

lemonade list

MrMoeeee commited on Feb 11

Commit

d050d18

verified ·

1 Parent(s): 57279d5

Add model card with training details and graphs

Browse files

Files changed (1) hide show

README.md +74 -43

README.md CHANGED Viewed

@@ -1,67 +1,98 @@
 ---
 license: apache-2.0
 tags:
-  - lamp
-  - iot
-  - pixel-art
-  - light-control
-  - fine-tuned
-  - gguf
-  - ollama
 ---
-# LAMP Fine-Tuned Models
-Fine-tuned language models for the **LAMP** (Moonside Lamp Emulator) project. These models generate JSON light programs from natural language descriptions.
 ## Models
-| Model | Base | Parameters | GGUF Size | Final Eval Loss |
-|-------|------|-----------|-----------|-----------------|
-| `lamp-llama-3b.Q8_0.gguf` | Llama 3.2 3B Instruct | 3.2B | 3.2 GB | 0.0294 |
-| `lamp-gemma-4b.Q8_0.gguf` | Gemma 3 4B IT | 4.3B | 3.9 GB | 0.0247 |
 ## Training Details
-- **Method**: Full fine-tune (100% parameters trainable, not LoRA)
-- **Precision**: bf16
-- **Hardware**: NVIDIA H200 (140GB VRAM)
-- **Framework**: Unsloth + HuggingFace TRL
-- **Dataset**: 2,268 train / 253 validation examples
-- **Epochs**: 3
-- **Batch size**: 16 (4 per device x 4 gradient accumulation)
-- **Learning rate**: 2e-5 with cosine decay
-- **Optimizer**: AdamW
-## Training Results
-![Training Results](training_results.png)
-Both models converged well:
-- **Llama 3.2 3B**: Loss 1.38 -> 0.018, eval loss 0.0294
-- **Gemma 3 4B**: Loss 1.37 -> 0.018, eval loss 0.0247
-## Usage with Ollama
-1. Download the GGUF file
-2. Create a Modelfile:
-```
-# Modelfile.lamp-llama-3b
-FROM ./lamp-llama-3b.Q8_0.gguf
-PARAMETER temperature 0.3
-PARAMETER num_predict 4096
-PARAMETER stop <|eot_id|>
-```
-3. Create and run:
 ```bash
-ollama create lamp-llama-3b -f Modelfile.lamp-llama-3b
-ollama run lamp-llama-3b "warm and cozy"
 ```
-## Example
-**Input**: "Create a light program for: warm and cozy"
-**Output**: A JSON program controlling LED pixels with colors, animations, and timing for a warm ambient effect.

 ---
 license: apache-2.0
+base_model: google/gemma-3-4b-it
 tags:
+- gemma3
+- gguf
+- fine-tuned
+- lamp
+- lighting
+- smart-home
+- json
+datasets:
+- custom
+pipeline_tag: text-generation
 ---
+# LAMP Models — Fine-tuned for Smart Lighting Control
+Fine-tuned language models that generate JSON lighting programs from natural language descriptions.
 ## Models
+| Model | Base | Params | GGUF Size | Final Eval Loss |
+|-------|------|--------|-----------|-----------------|
+| **lamp-gemma-4b-v2** | Gemma 3 4B IT | 4.3B | ~4.1 GB (Q8_0) | 0.0288 |
 ## Training Details
+- **Fine-tune Type:** Full parameter (no LoRA) — all 4,300,079,472 parameters trained
+- **Precision:** bf16 (bfloat16)
+- **Dataset:** 6,567 training examples + 730 validation examples
+- **Epochs:** 2
+- **Effective Batch Size:** 16 (8 per device × 2 gradient accumulation)
+- **Learning Rate:** 2e-5 with cosine schedule
+- **Optimizer:** AdamW (weight decay 0.01)
+- **Training Time:** 38.1 minutes on NVIDIA H200
+- **Peak VRAM:** 24.3 GB
+## Training Loss
+![Training Loss](lamp-gemma-4b-v2/graphs/training_loss.png)
+## Training Details
+![Training Details](lamp-gemma-4b-v2/graphs/training_details.png)
+## Summary
+![Training Summary](lamp-gemma-4b-v2/graphs/training_summary.png)
+## Usage
+### With Ollama (GGUF)
 ```bash
+# Download the GGUF file and Modelfile from lamp-gemma-4b-v2-gguf/
+ollama create lamp-gemma -f Modelfile
+ollama run lamp-gemma "warm and cozy lighting"
+```
+### With Transformers (HuggingFace)
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
+model = AutoModelForCausalLM.from_pretrained("MrMoeeee/lamp-models", subfolder="lamp-gemma-4b-v2")
+tokenizer = AutoTokenizer.from_pretrained("MrMoeeee/lamp-models", subfolder="lamp-gemma-4b-v2")
 ```
+## Files
+```
+lamp-gemma-4b-v2/          # Full model weights + training logs
+  ├── model-00001-of-00002.safetensors
+  ├── model-00002-of-00002.safetensors
+  ├── config.json
+  ├── tokenizer.json
+  ├── training_config.json
+  ├── training_log.json
+  ├── training_metrics.csv
+  ├── metrics_detailed.json
+  └── graphs/
+      ├── training_loss.png
+      ├── training_details.png
+      └── training_summary.png
+lamp-gemma-4b-v2-gguf/     # Quantized GGUF for inference
+  ├── lamp-gemma-4b-v2-Q8_0.gguf
+  └── Modelfile
+```
+## Dataset
+The LAMP dataset consists of natural language lighting requests paired with JSON lighting programs. Each program controls RGB LEDs with support for:
+- Static colors and gradients
+- Animations (breathing, rainbow, chase, etc.)
+- Multi-step sequences with timing
+- Brightness and speed control