JingyaoLi
/

ScienceLLaMA-3b

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

JingyaoLi commited on May 30, 2025

Commit

e462a06

·

verified ·

1 Parent(s): 62be290

Upload README.md with huggingface_hub

Files changed (1) hide show

README.md +15 -10

README.md CHANGED Viewed

@@ -15,21 +15,26 @@ model-index:
 should probably proofread and complete it, then remove this comment. -->
 # ScienceLLaMA-3B
-This model is a fine-tuned version of [meta-llama/Llama-3.2-3B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct) on the OpenMathInstruct-2-1M and the metamath_gsm8k datasets.
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
 ## Training procedure
 ### Training hyperparameters

 should probably proofread and complete it, then remove this comment. -->
 # ScienceLLaMA-3B
+<p align="center">
+• 🤗 <a href="https://huggingface.co/datasets/JingyaoLi/Science-Logits-1.2M" target="_blank">Data </a>
+• 🤗 <a href="https://huggingface.co/JingyaoLi/ScienceLLaMA-3b" target="_blank">ScienceLLaMA-3B </a>
+• 🤗 <a href="https://huggingface.co/JingyaoLi/ScienceLLaMA-1b" target="_blank">ScienceLLaMA-1B </a>
+• 🐱 <a href="Logits-based Finetuning" target="_blank">Code</a>
+• 📃 Paper (TO be released) <br>
+</p>
+This model is a fine-tuned with **Logits-Based Finetuning** on the [JingyaoLi/Science-Logits-1.2M](https://huggingface.co/datasets/JingyaoLi/Science-Logits-1.2M), which integrates the strengths of supervised learning and knowledge distillation by combining teacher logits with ground truth labels. This preserves both correctness and linguistic diversity.
+<div style="text-align: center;">
+    <img src="./images/example.png" alt="example" />
+</div>
 ## Training procedure
+<div style="text-align: center;">
+    <img src="./images/performance.png" alt="performance" />
+</div>
 ### Training hyperparameters