Instructions to use shiv207/gpt_oss_AGENTBOI with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use shiv207/gpt_oss_AGENTBOI with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="shiv207/gpt_oss_AGENTBOI")

# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("shiv207/gpt_oss_AGENTBOI", dtype="auto")

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use shiv207/gpt_oss_AGENTBOI with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "shiv207/gpt_oss_AGENTBOI"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "shiv207/gpt_oss_AGENTBOI",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/shiv207/gpt_oss_AGENTBOI

SGLang

How to use shiv207/gpt_oss_AGENTBOI with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "shiv207/gpt_oss_AGENTBOI" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "shiv207/gpt_oss_AGENTBOI",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "shiv207/gpt_oss_AGENTBOI" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "shiv207/gpt_oss_AGENTBOI",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Unsloth Studio

How to use shiv207/gpt_oss_AGENTBOI with Unsloth Studio:

Install Unsloth Studio (macOS, Linux, WSL)

curl -fsSL https://unsloth.ai/install.sh | sh
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for shiv207/gpt_oss_AGENTBOI to start chatting

Install Unsloth Studio (Windows)

irm https://unsloth.ai/install.ps1 | iex
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for shiv207/gpt_oss_AGENTBOI to start chatting

Using HuggingFace Spaces for Unsloth

# No setup required
# Open https://huggingface.co/spaces/unsloth/studio in your browser
# Search for shiv207/gpt_oss_AGENTBOI to start chatting

Load model with FastModel

pip install unsloth
from unsloth import FastModel
model, tokenizer = FastModel.from_pretrained(
    model_name="shiv207/gpt_oss_AGENTBOI",
    max_seq_length=2048,
)

Docker Model Runner
How to use shiv207/gpt_oss_AGENTBOI with Docker Model Runner:
```
docker model run hf.co/shiv207/gpt_oss_AGENTBOI
```

GPT-OSS AgentBoi

A parameter-efficient fine-tuning of GPT-OSS-20B focused on improving agentic reasoning, structured tool use, and ReAct-style instruction following.

This model was fine-tuned using LoRA adapters on the ReAct subset of Agent-FLAN with the goal of making GPT-OSS more reliable at multi-step reasoning, tool selection, action-observation workflows, and structured agent behavior.

Overview

Large language models are often strong conversationalists but can struggle with:

Multi-step planning
Tool selection and invocation
ReAct-style reasoning workflows
Structured action generation
Separating reasoning from final responses

GPT-OSS AgentBoi adapts GPT-OSS-20B toward these agent-oriented tasks while remaining trainable on consumer hardware through parameter-efficient fine-tuning.

Model Details

Item	Value
Model Name	GPT-OSS AgentBoi
Author	shiv207
Base Model	unsloth/gpt-oss-20b-unsloth-bnb-4bit
Training Method	LoRA
Framework	Unsloth
Dataset	Agent-FLAN (ReAct subset)
Primary Task	Agentic Tool Use
Language	English
License	Apache 2.0

Training Data

The model was fine-tuned using examples from the Agent-FLAN dataset, specifically the ReAct-style instruction trajectories.

These examples teach the model to:

Break complex tasks into intermediate steps
Decide when tool usage is appropriate
Generate structured actions
Follow action-observation loops
Produce concise final responses

Training Setup

Training was performed using:

GPT-OSS-20B
Unsloth
TRL
LoRA adapters
Google Colab Tesla T4 GPU

The objective was to improve agentic behavior while keeping training accessible on limited hardware.

Intended Use

This model is intended for:

AI agents
Tool-calling systems
Research assistants
Retrieval-augmented generation workflows
Multi-step planning tasks
Agentic reasoning experiments

Potential applications include:

Search agents
Knowledge retrieval systems
Function-calling assistants
Research copilots
Workflow automation agents

Example

User

Search for the latest SpaceX launch and summarize it.

Expected Agent Behavior

Analyze the request.
Determine that external information is required.
Generate a structured search action.
Process retrieved information.
Produce a concise final answer.

The fine-tuning objective is to increase consistency in these workflows compared to the base model.

Loading the Model

from unsloth import FastLanguageModel

model, tokenizer = FastLanguageModel.from_pretrained(
    "shiv207/gpt_oss_AGENTBOI"
)

Limitations

Evaluated primarily through qualitative testing.
No formal benchmark suite was used.
Training utilized only a subset of Agent-FLAN.
Performance may vary on unseen tool schemas.
Not optimized for general-purpose instruction tuning beyond agent-oriented tasks.

Acknowledgments

This project builds upon the work of:

OpenAI for GPT-OSS and the Harmony conversation format.
Unsloth for efficient GPT-OSS fine-tuning support.
InternLM for the Agent-FLAN dataset.

Repository

Source code and training notebook:

GitHub: https://github.com/shiv207

Author

shiv207

If you find this project useful, feel free to open issues, share feedback, or build on top of it.

Downloads last month: -; Downloads are not tracked for this model. How to track

Model tree for shiv207/gpt_oss_AGENTBOI

Base model

openai/gpt-oss-20b

Quantized

unsloth/gpt-oss-20b-unsloth-bnb-4bit

Adapter

(72)

this model