Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

zz1358m
/
SofT-GRPO-master

TensorBoard
Safetensors
Model card Files Files and versions
xet
Metrics Training metrics Community
1
SofT-GRPO-master
52.2 GB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 18 commits
zz1358m's picture
zz1358m
Update README.md
128d4db verified 6 months ago
  • Results
    Delete Results/training curves/tensorboard_show-1.5B/run.txt 7 months ago
  • Soft-Thinking+noise+loss-main
    Upload 4 files 7 months ago
  • assets
    Upload mainprocess.png 7 months ago
  • saved_weight
    Upload folder using huggingface_hub 7 months ago
  • verl-0.4.x
    Upload folder using huggingface_hub 7 months ago
  • .gitattributes
    3.26 kB
    Upload mainprocess.png 7 months ago
  • .gitignore
    39 Bytes
    Upload folder using huggingface_hub 7 months ago
  • README.md
    5.57 kB
    Update README.md 6 months ago
  • SofT-GRPO-deepscaler-8k-dir.sh
    3.18 kB
    Upload 5 files 7 months ago
  • SofT-GRPO-deepscaler-8k-gau.sh
    3.12 kB
    Upload 5 files 7 months ago
  • SofT-GRPO-deepscaler-8k-llama3.sh
    3.28 kB
    Update SofT-GRPO-deepscaler-8k-llama3.sh 7 months ago
  • SofT-GRPO-deepscaler-8k-qwen7.sh
    3.3 kB
    Update SofT-GRPO-deepscaler-8k-qwen7.sh 7 months ago
  • SofT-GRPO-deepscaler-8k.sh
    3.28 kB
    Update SofT-GRPO-deepscaler-8k.sh 7 months ago
  • requirements.txt
    4.01 kB
    Upload 2 files 7 months ago