chem-Phi-Mini / README.md
rish3on3AI's picture
better readme
4ce551a verified
|
Raw
History Blame Contribute Delete
2.66 kB
---
base_model: unsloth/phi-4-mini-reasoning-unsloth-bnb-4bit
library_name: peft
model_name: ChemPhi-Mini
license: apache-2.0
pipeline_tag: text-generation
tags:
- chemistry
- educational-ai
- lora
- qlora
- sft
- transformers
- trl
- unsloth
- reasoning
- peft
- local-llm
---
# ChemPhi-Mini
ChemPhi-Mini is a lightweight chemistry-focused reasoning model fine-tuned from `unsloth/phi-4-mini-reasoning-unsloth-bnb-4bit` using supervised fine-tuning (SFT).
This project explores efficient domain adaptation for educational AI systems under constrained hardware environments. The goal was to build a compact chemistry tutoring and reasoning assistant capable of running locally with minimal GPU resources while maintaining useful scientific explanation capabilities.
---
# Project Goals
This project was built to explore:
- Parameter-efficient fine-tuning (PEFT)
- Low-resource LLM training workflows
- Chemistry-focused educational reasoning
- Lightweight local AI systems
- Quantized inference and deployment
- Linux-based AI experimentation
The model is part of a broader self-hosted AI and systems engineering learning workflow involving:
- Linux infrastructure
- Local inference pipelines
- GPU-constrained experimentation
- Open-source AI tooling
---
# Base Model
Base model used:
`unsloth/phi-4-mini-reasoning-unsloth-bnb-4bit`
Core characteristics:
- Phi-4 Mini Reasoning architecture
- 4-bit quantized
- Optimized using the Unsloth ecosystem
- Designed for efficient fine-tuning and inference
---
# Training Method
This model was fine-tuned using:
- LoRA (Low-Rank Adaptation)
- PEFT
- TRL SFTTrainer
- 4-bit quantization
- Supervised Fine-Tuning (SFT)
Training focused on:
- Chemistry explanations
- Conceptual reasoning
- Educational QA
- Scientific response formatting
---
# Hardware & Environment
Training environment:
- Google Colab
- NVIDIA T4 GPU
- CUDA-enabled PyTorch stack
This project specifically explored practical AI development under limited VRAM conditions.
---
# Tech Stack
- Transformers
- TRL
- PEFT
- Unsloth
- PyTorch
- Hugging Face ecosystem
Framework versions:
- PEFT 0.19.1
- TRL 0.24.0
- Transformers 5.5.0
- PyTorch 2.10.0+cu128
- Datasets 4.3.0
- Tokenizers 0.22.2
---
# Example Usage
```python
from transformers import pipeline
generator = pipeline(
"text-generation",
model="rish3on3AI/ChemPhi-Mini",
device="cuda"
)
messages = [
{
"role": "user",
"content": "Explain why increasing temperature favors endothermic reactions."
}
]
output = generator(
messages,
max_new_tokens=256,
return_full_text=False
)
print(output[0]["generated_text"])