Commit
·
189873c
1
Parent(s):
3ff1322
Update README.md
Browse files
README.md
CHANGED
|
@@ -12,13 +12,17 @@ tags:
|
|
| 12 |
## Model Architecture
|
| 13 |
The Mistral-7B-v0.1 model is a transformer-based model that can handle a variety of tasks including but not limited to translation, summarization, and text completion. It's particularly designed for the Italian language and can be fine-tuned for specific tasks.
|
| 14 |
|
| 15 |
-
|
| 16 |
[Leaderboard Ita LLM](https://huggingface.co/spaces/FinancialSupport/open_ita_llm_leaderboard)
|
| 17 |
|
| 18 |
| hellaswag_it acc_norm | arc_it acc_norm | m_mmlu_it 5-shot acc | Average |
|
| 19 |
|:----------------------| :--------------- | :-------------------- | :------- |
|
| 20 |
| 0.6734 | 0.5466 | 0.5334 | 0,5844 |
|
| 21 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 22 |
## How to Use
|
| 23 |
How to utilize my Mistral for Italian text generation
|
| 24 |
|
|
|
|
| 12 |
## Model Architecture
|
| 13 |
The Mistral-7B-v0.1 model is a transformer-based model that can handle a variety of tasks including but not limited to translation, summarization, and text completion. It's particularly designed for the Italian language and can be fine-tuned for specific tasks.
|
| 14 |
|
| 15 |
+
## Evaluation
|
| 16 |
[Leaderboard Ita LLM](https://huggingface.co/spaces/FinancialSupport/open_ita_llm_leaderboard)
|
| 17 |
|
| 18 |
| hellaswag_it acc_norm | arc_it acc_norm | m_mmlu_it 5-shot acc | Average |
|
| 19 |
|:----------------------| :--------------- | :-------------------- | :------- |
|
| 20 |
| 0.6734 | 0.5466 | 0.5334 | 0,5844 |
|
| 21 |
|
| 22 |
+
## Quantized Version
|
| 23 |
+
|
| 24 |
+
[4_bit_version](https://huggingface.co/DeepMount00/Mistral-Ita-7b-GGUF)
|
| 25 |
+
|
| 26 |
## How to Use
|
| 27 |
How to utilize my Mistral for Italian text generation
|
| 28 |
|