Improve model card for TRAAC Qwen3-4B

#1
by nielsr HF Staff - opened

This PR significantly updates the model card for the joykirat/Qwen3-4B-TRAAC model.

Changes include:

  • Updating metadata: Added pipeline_tag: text-generation, license: mit, and relevant tags (qwen, qwen3, reasoning, rl, adaptive-inference, llm), and base_model: Qwen/Qwen3-4B.
  • Replacing generic placeholder content with a detailed description of the model based on its paper and GitHub repository.
  • Adding direct links to the arXiv paper and the official GitHub repository, including a visual overview image.
  • Incorporating information about the developers, model type, language, and finetuning details.
  • Providing a clear "How to Get Started" section with installation and model download instructions, and directing users to the GitHub repository for comprehensive usage, evaluation, and training scripts, adhering to the guideline of not creating custom code snippets.
  • Adding details about training and evaluation, summarizing key results.
  • Including the official BibTeX citation.

These changes aim to provide a comprehensive and user-friendly model card for the Hugging Face Hub community.

joykirat changed pull request status to merged

Sign up or log in to comment