metadata
license: apache-2.0
language:
- en
base_model:
- Qwen/Qwen3-4B-Thinking-2507
pipeline_tag: text-generation
These are the training checkpoint results for OpenRLHF-Agent/Search-R1.