seeweb
/

SeewebLLM-it

Text Generation

text-generation-inference

Model card Files Files and versions

itsrocchi commited on Aug 21, 2023

Commit

01fc5b7

·

1 Parent(s): a6d1714

Update README.md

Files changed (1) hide show

README.md +6 -2

README.md CHANGED Viewed

@@ -97,10 +97,14 @@ It's important to note that AI is not a single entity, but rather a rapidly evol
 #### Conclusion
-LLama 2 can understand the question and gives the user a very specific and overall a better answer compared to the one given by the fine-tuned model. However the fine-tuned model answers with a sentence written in a perfect italian.
 ### Training Data and Details
 The dataset used is [seeweb/Seeweb-it-292-forLLM](https://huggingface.co/datasets/seeweb/Seeweb-it-292-forLLM), a dataset containing approx. 300 italian prompt-answer conversations.
-The training has been made on RTX A6000, inside [Seeweb's Cloud Server GPU](https://www.seeweb.it/prodotti/cloud-server-gpu)

 #### Conclusion
+LLama 2 can understand the question and gives the user a very specific and overall a better answer compared to the one given by the fine-tuned model. However the fine-tuned model answers with a sentence written in a perfect italian, which is what we were trying to achieve with this fine-tuning process.
 ### Training Data and Details
 The dataset used is [seeweb/Seeweb-it-292-forLLM](https://huggingface.co/datasets/seeweb/Seeweb-it-292-forLLM), a dataset containing approx. 300 italian prompt-answer conversations.
+The training has been made on RTX A6000, inside [Seeweb's Cloud Server GPU](https://www.seeweb.it/prodotti/cloud-server-gpu)
+### What next?
+The model must be improved: a much bigger dataset needs to be created so that the model can learn many more ways to answer.