The Pivot — RL-trained Startup Advisor (LoRA)

Fine-tuned with GRPO on the The Pivot OpenEnv environment.

Trained to navigate hidden market phase shifts across 5 startup scenarios over 150 episodes.

Usage

from transformers import AutoTokenizer, AutoModelForCausalLM
from peft import PeftModel

base = AutoModelForCausalLM.from_pretrained("Qwen/Qwen2.5-1.5B-Instruct")
model = PeftModel.from_pretrained(base, "Harshit-Makraria/the-pivot-lora-quick")
tokenizer = AutoTokenizer.from_pretrained("Harshit-Makraria/the-pivot-lora-quick")

Built for the Meta PyTorch OpenEnv Hackathon 2026.

Downloads last month: 36

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Harshit-Makraria/the-pivot-lora-quick

Base model

Qwen/Qwen2.5-1.5B

Finetuned

Qwen/Qwen2.5-1.5B-Instruct

Adapter

(962)

this model