Update README.md
Browse files
README.md
CHANGED
|
@@ -15,7 +15,7 @@ tags:
|
|
| 15 |
<img src="https://huggingface.co/EpistemeAI/Fireball-Mistral-Nemo-Base-2407-v1-DPO2/resolve/main/fireball.JPG" width="200"/>
|
| 16 |
|
| 17 |
|
| 18 |
-
# Fireball-
|
| 19 |
This model is super fine-tune to provide better coding and better response(from first fine-tune) than Llama-3.1-8B and Google Gemma 2 9B.
|
| 20 |
Further fine tuned with ORPO method with dataset
|
| 21 |
- reciperesearch/dolphin-sft-v0.1-preference
|
|
@@ -46,7 +46,7 @@ This mistral model was trained 2x faster with [Unsloth](https://github.com/unslo
|
|
| 46 |
|
| 47 |
|
| 48 |
|
| 49 |
-
# Model Card for
|
| 50 |
|
| 51 |
The Mistral-Nemo-Base-2407 Large Language Model (LLM) is a pretrained generative text model of 12B parameters trained jointly by Mistral AI and NVIDIA, it significantly outperforms existing models smaller or similar in size.
|
| 52 |
|
|
|
|
| 15 |
<img src="https://huggingface.co/EpistemeAI/Fireball-Mistral-Nemo-Base-2407-v1-DPO2/resolve/main/fireball.JPG" width="200"/>
|
| 16 |
|
| 17 |
|
| 18 |
+
# Fireball-12B
|
| 19 |
This model is super fine-tune to provide better coding and better response(from first fine-tune) than Llama-3.1-8B and Google Gemma 2 9B.
|
| 20 |
Further fine tuned with ORPO method with dataset
|
| 21 |
- reciperesearch/dolphin-sft-v0.1-preference
|
|
|
|
| 46 |
|
| 47 |
|
| 48 |
|
| 49 |
+
# Model Card for Fireball-12B
|
| 50 |
|
| 51 |
The Mistral-Nemo-Base-2407 Large Language Model (LLM) is a pretrained generative text model of 12B parameters trained jointly by Mistral AI and NVIDIA, it significantly outperforms existing models smaller or similar in size.
|
| 52 |
|