How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="rfvasile/LinAlgZero-GRPO")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)
# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("rfvasile/LinAlgZero-GRPO", dtype="auto")
Quick Links

Model Card for LinalgZero-GSPO

Information and code used to train this model is available on Github.

This model is a fine-tuned version of atomwalk12/LinalgZero-SFT on the atomwalk12/linalgzero-grpo dataset using the GSPO algorithm. It has been trained using ART.

Downloads last month
2
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for rfvasile/LinAlgZero-GRPO

Adapter
(2)
this model

Space using rfvasile/LinAlgZero-GRPO 1