Update README.md
Browse files
README.md
CHANGED
|
@@ -16,7 +16,7 @@ tags:
|
|
| 16 |
---
|
| 17 |
# **Phi4-Super**
|
| 18 |
|
| 19 |
-
[Phi-4-Super finetuned] from Microsoft's Phi-4 is a state-of-the-art open model developed with a focus on responsible problem solving and advanced reasoning capabilities. Built upon a diverse blend of synthetic datasets, carefully filtered public domain websites, and high-quality academic books and Q&A datasets, Phi-4-
|
| 20 |
|
| 21 |
Phi-4-Super adopts a robust safety post-training approach using open-source and in-house synthetic datasets. This involves a combination of SFT (Supervised Fine-Tuning) and iterative DPO (Direct Preference Optimization) techniques, ensuring helpful and harmless outputs across various safety categories.
|
| 22 |
|
|
|
|
| 16 |
---
|
| 17 |
# **Phi4-Super**
|
| 18 |
|
| 19 |
+
[Phi-4-Super finetuned] from Microsoft's Phi-4 is a state-of-the-art open model developed with a focus on responsible problem solving and advanced reasoning capabilities. Built upon a diverse blend of synthetic datasets, carefully filtered public domain websites, and high-quality academic books and Q&A datasets, Phi-4-Super ensures that small, capable models are trained with datasets of exceptional depth and precision.
|
| 20 |
|
| 21 |
Phi-4-Super adopts a robust safety post-training approach using open-source and in-house synthetic datasets. This involves a combination of SFT (Supervised Fine-Tuning) and iterative DPO (Direct Preference Optimization) techniques, ensuring helpful and harmless outputs across various safety categories.
|
| 22 |
|