Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
BechusRantus
/
injected_thinking
like
0
Safetensors
Model card
Files
Files and versions
xet
Community
main
injected_thinking
/
third_party
/
ms-swift
/
docs
/
source
/
Instruction
/
GRPO
/
AdvancedResearch
44.5 kB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
BechusRantus
Upload folder using huggingface_hub
7134ce7
verified
about 2 months ago
CHORD.md
2.82 kB
Upload folder using huggingface_hub
about 2 months ago
CISPO.md
2.88 kB
Upload folder using huggingface_hub
about 2 months ago
DAPO.md
3.78 kB
Upload folder using huggingface_hub
about 2 months ago
GSPO.md
3.54 kB
Upload folder using huggingface_hub
about 2 months ago
REINFORCEPP.md
3.36 kB
Upload folder using huggingface_hub
about 2 months ago
RLOO.md
3.89 kB
Upload folder using huggingface_hub
about 2 months ago
SAPO.md
3.74 kB
Upload folder using huggingface_hub
about 2 months ago
deepeyes.md
4.73 kB
Upload folder using huggingface_hub
about 2 months ago
entropy_mask.md
2.32 kB
Upload folder using huggingface_hub
about 2 months ago
index.rst
247 Bytes
Upload folder using huggingface_hub
about 2 months ago
training_inference_mismatch.md
9.5 kB
Upload folder using huggingface_hub
about 2 months ago
treepo.md
3.68 kB
Upload folder using huggingface_hub
about 2 months ago