Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

DatPySci
/
PreRLVR-Controlled

Safetensors
Model card Files Files and versions
xet
Community
PreRLVR-Controlled
1.06 GB
  • 1 contributor
History: 15 commits
DatPySci's picture
DatPySci
Delete models/EvoLM-1B-160BT-MixedFW8FM42-GRPO-alpha-0.0-step300
681ba7c verified 23 days ago
  • Data
    Upload folder using huggingface_hub 27 days ago
  • EvoLM-1B-160BT-Warmup-LoRA-RL-step100
    Upload folder using huggingface_hub 27 days ago
  • EvoLM-1B-160BT-Warmup-LoRA-RL-step150
    Upload folder using huggingface_hub 27 days ago
  • EvoLM-1B-160BT-Warmup-LoRA-RL-step200
    Upload folder using huggingface_hub 27 days ago
  • EvoLM-1B-160BT-Warmup-LoRA-RL-step250
    Upload folder using huggingface_hub 27 days ago
  • EvoLM-1B-160BT-Warmup-LoRA-RL-step300
    Upload folder using huggingface_hub 27 days ago
  • EvoLM-1B-160BT-Warmup-LoRA-RL-step50
    Upload folder using huggingface_hub 27 days ago
  • models
    Delete models/EvoLM-1B-160BT-MixedFW8FM42-GRPO-alpha-0.0-step300 23 days ago
  • .gitattributes
    1.52 kB
    initial commit 27 days ago
  • verl.zip
    21.6 MB
    xet
    Upload verl.zip with huggingface_hub 24 days ago