Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
PahaII
/
or_v2_rl_v0.1
like
0
Reinforcement Learning
Safetensors
qwen3_5_moe_text
openresearcher
rloo
multi-turn-agent
tool-use
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
or_v2_rl_v0.1
69.3 GB
Ctrl+K
Ctrl+K
1 contributor
History:
3 commits
PahaII
Upload global_step_62_fused
3ee588c
verified
13 days ago
.gitattributes
Safe
1.57 kB
Upload global_step_62_fused
13 days ago
README.md
Safe
1.37 kB
Upload README.md with huggingface_hub
13 days ago
chat_template.jinja
Safe
7.76 kB
Upload global_step_62_fused
13 days ago
config.json
2.34 kB
Upload global_step_62_fused
13 days ago
generation_config.json
199 Bytes
Upload global_step_62_fused
13 days ago
model-00001-of-00002.safetensors
49.7 GB
xet
Upload global_step_62_fused
13 days ago
model-00002-of-00002.safetensors
19.6 GB
xet
Upload global_step_62_fused
13 days ago
model.safetensors.index.json
Safe
3.31 MB
Upload global_step_62_fused
13 days ago
tokenizer.json
Safe
20 MB
xet
Upload global_step_62_fused
13 days ago
tokenizer_config.json
Safe
1.12 kB
Upload global_step_62_fused
13 days ago