Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

BechusRantus
/
injected_thinking

Safetensors
Model card Files Files and versions
xet
Community
injected_thinking / third_party /ms-swift /docs /source /Instruction /GRPO /DeveloperGuide
43.7 kB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 1 commit
BechusRantus's picture
BechusRantus
Upload folder using huggingface_hub
7134ce7 verified about 2 months ago
  • gym_env.md
    7.85 kB
    Upload folder using huggingface_hub about 2 months ago
  • index.rst
    179 Bytes
    Upload folder using huggingface_hub about 2 months ago
  • loss_types.md
    3.97 kB
    Upload folder using huggingface_hub about 2 months ago
  • multi_task.md
    2.1 kB
    Upload folder using huggingface_hub about 2 months ago
  • multi_turn.md
    14.6 kB
    Upload folder using huggingface_hub about 2 months ago
  • reward_function.md
    8.2 kB
    Upload folder using huggingface_hub about 2 months ago
  • reward_model.md
    6.73 kB
    Upload folder using huggingface_hub about 2 months ago