zjunlp
/

KnowRL-DeepSeek-R1-Distill-Qwen-7B

Add library_name and pipeline_tag metadata

by nielsr HF Staff - opened Apr 17

←

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,5 +1,10 @@
 ---
 license: mit
 ---
 <div align="center">
@@ -7,9 +12,9 @@ license: mit
 <h3 align="center"> Exploring Knowledgeable Reinforcement Learning for Factuality </h3>
 <p align="center">
-  <a href="https://arxiv.org/abs/2506.19807">📄arXiv</a> •
-  <a href="https://github.com/zjunlp/KnowRL">💻GitHub Repo</a> •
-  <a href="https://huggingface.co/datasets/zjunlp/KnowRL-Train-Data">📖Dataset</a>
 </p>
 </div>
@@ -67,8 +72,7 @@ huggingface-cli download zjunlp/KnowRL-DeepSeek-R1-Distill-Qwen-7B --local-dir K
 The model is trained using Knowledgeable Reinforcement Learning (RL) (specifically GRPO) using data from the `zjunlp/KnowRL-Train-Data`.
-For complete details on the training configuration and hyperparameters, please refer to our [GitHub repository](https://github.com/zjunlp/KnowRL
-).
 ---
@@ -81,7 +85,4 @@ If you find this model useful in your research, please consider citing our paper
   journal={arXiv preprint arXiv:2506.19807},
   year={2025}
 }
-```

 ---
 license: mit
+library_name: transformers
+pipeline_tag: text-generation
+base_model: deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
+datasets:
+- zjunlp/KnowRL-Train-Data
 ---
 <div align="center">
 <h3 align="center"> Exploring Knowledgeable Reinforcement Learning for Factuality </h3>
 <p align="center">
+  <a href="https://arxiv.org/abs/2506.19807">📄arXiv</a> •
+  <a href="https://github.com/zjunlp/KnowRL">💻GitHub Repo</a> •
+  <a href="https://huggingface.co/datasets/zjunlp/KnowRL-Train-Data">📖Dataset</a>
 </p>
 </div>
 The model is trained using Knowledgeable Reinforcement Learning (RL) (specifically GRPO) using data from the `zjunlp/KnowRL-Train-Data`.
+For complete details on the training configuration and hyperparameters, please refer to our [GitHub repository](https://github.com/zjunlp/KnowRL).
 ---
   journal={arXiv preprint arXiv:2506.19807},
   year={2025}
 }
+```