Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

DatPySci
/
PreRLVR-Controlled

Safetensors
Model card Files Files and versions
xet
Community
PreRLVR-Controlled / models
56.9 GB
  • 1 contributor
History: 48 commits
DatPySci's picture
DatPySci
Upload folder using huggingface_hub
2af2ebc verified 10 days ago
  • EvoLM-1B-160BT-MixedFW8FM42-100k-evolm-GRPO
    Upload folder using huggingface_hub 10 days ago
  • EvoLM-1B-160BT-MixedFW8FM42-100k-omega-GRPO
    Upload folder using huggingface_hub 10 days ago
  • EvoLM-1B-160BT-MixedFW8FM42-100k-polaris-GRPO
    Upload folder using huggingface_hub 10 days ago
  • EvoLM-1B-160BT-MixedFW8FM42-400k-evolm-GRPO-step300
    Upload folder using huggingface_hub 14 days ago
  • EvoLM-1B-160BT-MixedFW8FM42-400k-omega-GRPO-step300
    Upload folder using huggingface_hub 14 days ago
  • Llama-3.2-3B-evolm-GRPO-step300
    Upload folder using huggingface_hub 10 days ago
  • Llama-3.2-3B-omega-GRPO-step300
    Upload folder using huggingface_hub 10 days ago
  • Llama-3.2-3B-polaris-GRPO-step300
    Upload folder using huggingface_hub 10 days ago
  • Qwen2.5-Math-1.5B-evolm-GRPO-step300
    Upload folder using huggingface_hub 13 days ago
  • Qwen2.5-Math-1.5B-omega-GRPO-step300
    Upload folder using huggingface_hub 13 days ago
  • Qwen2.5-Math-1.5B-polaris-GRPO
    Upload folder using huggingface_hub 10 days ago
  • SFT
    Upload folder using huggingface_hub 10 days ago