Instructions to use ekunish/exp_p1_alpha128 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use ekunish/exp_p1_alpha128 with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("unsloth/qwen3-4b-instruct-2507-unsloth-bnb-4bit") model = PeftModel.from_pretrained(base_model, "ekunish/exp_p1_alpha128") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- Unsloth Studio new
How to use ekunish/exp_p1_alpha128 with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for ekunish/exp_p1_alpha128 to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for ekunish/exp_p1_alpha128 to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for ekunish/exp_p1_alpha128 to start chatting
Load model with FastModel
pip install unsloth from unsloth import FastModel model, tokenizer = FastModel.from_pretrained( model_name="ekunish/exp_p1_alpha128", max_seq_length=2048, )
exp_p1_alpha128
Model ID: ekunish/exp_p1_alpha128
Phase 1 alpha sweep: lora_alpha=128, alpha/r=2.0 (base=exp008a reproduction)
Training Configuration
| Parameter | Value |
|---|---|
| Base model | Qwen/Qwen3-4B-Instruct-2507 |
| Method | QLoRA (4-bit) |
| Max sequence length | 512 |
| Epochs | 1 |
| Learning rate | 1e-06 |
| LoRA r | 64 |
| LoRA alpha | 128 |
| Batch size | 2 × 8 = 16 |
Usage
from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel
import torch
base = "Qwen/Qwen3-4B-Instruct-2507"
adapter = "ekunish/exp_p1_alpha128"
tokenizer = AutoTokenizer.from_pretrained(base)
model = AutoModelForCausalLM.from_pretrained(
base,
torch_dtype=torch.float16,
device_map="auto",
)
model = PeftModel.from_pretrained(model, adapter)
Training Data
- Dataset:
data/sft_u10bei_only - License: CC-BY-4.0 (where applicable)
Sources & License
- Training Data: u-10bei/structured_data_with_cot_dataset_512_v2, daichira/structured-3k-mix-sft, etc.
- Dataset License: Creative Commons Attribution (CC-BY-4.0)
- Compliance: Users must comply with both the dataset's attribution requirements and the base model's original terms of use.
Competition
松尾研LLMコミュニティ 2025年度講座 メインコンペ (StructEval-T)
- Downloads last month
- 2
Model tree for ekunish/exp_p1_alpha128
Base model
Qwen/Qwen3-4B-Instruct-2507