Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Pierizvi
/
infused-reasoning-phi2
like
0
Text Generation
PEFT
Safetensors
gsm8k
English
reasoning
mathematics
grpo
reinforcement-learning
phi-2
step-by-step
mathematical-reasoning
rlhf
License:
mit
Model card
Files
Files and versions
xet
Community
Use this model
main
infused-reasoning-phi2
/
tokenizer.json
Commit History
epoch-639
c9d92fa
verified
Pierizvi
commited on
Jun 4, 2025