How to use TheFloatingString/qwen2-0.5b-instruct-math-grpo with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("TheFloatingString/qwen2-0.5b-instruct-math-grpo", dtype="auto")