Add model card and metadata
#1
by nielsr HF Staff - opened
This PR adds a model card for the Composition-RL-8B model.
It includes:
- Metadata for the
text-generationpipeline andtransformerslibrary. - A link to the paper: Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models.
- A link to the official GitHub repository.
- Details regarding the model's initialization from Qwen3-8B-Base and its training on the MATH-Composition-199K dataset.
- The corresponding BibTeX citation.
xx18 changed pull request status to merged