Instructions to use shiv207/gpt_oss_AGENTBOI with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use shiv207/gpt_oss_AGENTBOI with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="shiv207/gpt_oss_AGENTBOI")# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("shiv207/gpt_oss_AGENTBOI", dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use shiv207/gpt_oss_AGENTBOI with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "shiv207/gpt_oss_AGENTBOI" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "shiv207/gpt_oss_AGENTBOI", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/shiv207/gpt_oss_AGENTBOI
- SGLang
How to use shiv207/gpt_oss_AGENTBOI with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "shiv207/gpt_oss_AGENTBOI" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "shiv207/gpt_oss_AGENTBOI", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "shiv207/gpt_oss_AGENTBOI" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "shiv207/gpt_oss_AGENTBOI", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Unsloth Studio
How to use shiv207/gpt_oss_AGENTBOI with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for shiv207/gpt_oss_AGENTBOI to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for shiv207/gpt_oss_AGENTBOI to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for shiv207/gpt_oss_AGENTBOI to start chatting
Load model with FastModel
pip install unsloth from unsloth import FastModel model, tokenizer = FastModel.from_pretrained( model_name="shiv207/gpt_oss_AGENTBOI", max_seq_length=2048, ) - Docker Model Runner
How to use shiv207/gpt_oss_AGENTBOI with Docker Model Runner:
docker model run hf.co/shiv207/gpt_oss_AGENTBOI
GPT-OSS AgentBoi
A parameter-efficient fine-tuning of GPT-OSS-20B focused on improving agentic reasoning, structured tool use, and ReAct-style instruction following.
This model was fine-tuned using LoRA adapters on the ReAct subset of Agent-FLAN with the goal of making GPT-OSS more reliable at multi-step reasoning, tool selection, action-observation workflows, and structured agent behavior.
Overview
Large language models are often strong conversationalists but can struggle with:
- Multi-step planning
- Tool selection and invocation
- ReAct-style reasoning workflows
- Structured action generation
- Separating reasoning from final responses
GPT-OSS AgentBoi adapts GPT-OSS-20B toward these agent-oriented tasks while remaining trainable on consumer hardware through parameter-efficient fine-tuning.
Model Details
| Item | Value |
|---|---|
| Model Name | GPT-OSS AgentBoi |
| Author | shiv207 |
| Base Model | unsloth/gpt-oss-20b-unsloth-bnb-4bit |
| Training Method | LoRA |
| Framework | Unsloth |
| Dataset | Agent-FLAN (ReAct subset) |
| Primary Task | Agentic Tool Use |
| Language | English |
| License | Apache 2.0 |
Training Data
The model was fine-tuned using examples from the Agent-FLAN dataset, specifically the ReAct-style instruction trajectories.
These examples teach the model to:
- Break complex tasks into intermediate steps
- Decide when tool usage is appropriate
- Generate structured actions
- Follow action-observation loops
- Produce concise final responses
Training Setup
Training was performed using:
- GPT-OSS-20B
- Unsloth
- TRL
- LoRA adapters
- Google Colab Tesla T4 GPU
The objective was to improve agentic behavior while keeping training accessible on limited hardware.
Intended Use
This model is intended for:
- AI agents
- Tool-calling systems
- Research assistants
- Retrieval-augmented generation workflows
- Multi-step planning tasks
- Agentic reasoning experiments
Potential applications include:
- Search agents
- Knowledge retrieval systems
- Function-calling assistants
- Research copilots
- Workflow automation agents
Example
User
Search for the latest SpaceX launch and summarize it.
Expected Agent Behavior
- Analyze the request.
- Determine that external information is required.
- Generate a structured search action.
- Process retrieved information.
- Produce a concise final answer.
The fine-tuning objective is to increase consistency in these workflows compared to the base model.
Loading the Model
from unsloth import FastLanguageModel
model, tokenizer = FastLanguageModel.from_pretrained(
"shiv207/gpt_oss_AGENTBOI"
)
Limitations
- Evaluated primarily through qualitative testing.
- No formal benchmark suite was used.
- Training utilized only a subset of Agent-FLAN.
- Performance may vary on unseen tool schemas.
- Not optimized for general-purpose instruction tuning beyond agent-oriented tasks.
Acknowledgments
This project builds upon the work of:
- OpenAI for GPT-OSS and the Harmony conversation format.
- Unsloth for efficient GPT-OSS fine-tuning support.
- InternLM for the Agent-FLAN dataset.
Repository
Source code and training notebook:
GitHub: https://github.com/shiv207
Author
shiv207
If you find this project useful, feel free to open issues, share feedback, or build on top of it.
Model tree for shiv207/gpt_oss_AGENTBOI
Base model
openai/gpt-oss-20b