Improve model card: Add paper URL, pipeline tag and Github URL

This PR improves the model card by adding the paper URL, the pipeline tag, and the Github URL. It also adds the blog post URLs in the description.

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,14 +1,15 @@
 ---
-library_name: transformers
 datasets:
 - BAAI/TACO
 - tasksource/PRM800K
 language:
 - en
-base_model:
-- Qwen/Qwen2.5-32B-Instruct
-- NovaSky-AI/Sky-T1-32B-Preview
 license: apache-2.0
 ---
 ## Model Details
@@ -18,9 +19,11 @@ license: apache-2.0
 <!-- Provide a longer summary of what this model is. -->
 This is a 32B reasoning model preference optimized on top of Sky-T1-32B-Preview to significantly reduce generation lengths while maintaining accuracy. The performance is on par with o1-preview model in both math and coding, while reducing generation lengths by up to 57% relative to Sky-T1-32B-Preview.
-Please see our [blog post](https://novasky-ai.github.io/posts/reduce-overthinking/) for more details.
 - **Developed by:** NovaSky Team from Sky Computing Lab at UC Berkeley.
 ## Training Details
@@ -71,3 +74,4 @@ Please considering citing our blog post if you found it useful for your research
   note         = {Accessed: 2025-01-23},
   year         = {2025}
 }

 ---
+base_model:
+- Qwen/Qwen2.5-32B-Instruct
+- NovaSky-AI/Sky-T1-32B-Preview
 datasets:
 - BAAI/TACO
 - tasksource/PRM800K
 language:
 - en
+library_name: transformers
 license: apache-2.0
+pipeline_tag: text-generation
 ---
 ## Model Details
 <!-- Provide a longer summary of what this model is. -->
 This is a 32B reasoning model preference optimized on top of Sky-T1-32B-Preview to significantly reduce generation lengths while maintaining accuracy. The performance is on par with o1-preview model in both math and coding, while reducing generation lengths by up to 57% relative to Sky-T1-32B-Preview.
+Please see our [blog post](https://novasky-ai.github.io/posts/reduce-overthinking/) and [Sky-T1 blog post](https://novasky-ai.github.io/posts/sky-t1/) for more details.
 - **Developed by:** NovaSky Team from Sky Computing Lab at UC Berkeley.
+- **Paper:** [LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!](https://hf.co/papers/2502.07374)
+- **Code:** [https://github.com/NovaSky-AI/SkyThought](https://github.com/NovaSky-AI/SkyThought)
 ## Training Details
   note         = {Accessed: 2025-01-23},
   year         = {2025}
 }
+```