Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
BechusRantus
/
injected_thinking
like
0
Safetensors
Model card
Files
Files and versions
xet
Community
main
injected_thinking
/
third_party
/
ms-swift
/
docs
/
source
/
Instruction
/
GRPO
/
DeveloperGuide
43.7 kB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
BechusRantus
Upload folder using huggingface_hub
7134ce7
verified
about 2 months ago
gym_env.md
7.85 kB
Upload folder using huggingface_hub
about 2 months ago
index.rst
179 Bytes
Upload folder using huggingface_hub
about 2 months ago
loss_types.md
3.97 kB
Upload folder using huggingface_hub
about 2 months ago
multi_task.md
2.1 kB
Upload folder using huggingface_hub
about 2 months ago
multi_turn.md
14.6 kB
Upload folder using huggingface_hub
about 2 months ago
reward_function.md
8.2 kB
Upload folder using huggingface_hub
about 2 months ago
reward_model.md
6.73 kB
Upload folder using huggingface_hub
about 2 months ago