samwit
/

dolly-lora

samwit commited on Mar 26, 2023

Commit

ef37fe9

1 Parent(s): cf54188

Create README.md

Files changed (1) hide show

README.md ADDED Viewed

+This is a Finetuning of GPT-J-6B using LoRa - https://huggingface.co/EleutherAI/gpt-j-6B
+The dataset is the cleaned version of the Alpaca dataset - https://github.com/gururise/AlpacaDataCleaned
+A model similar to this has been talked about
+The performance is good but not as good as the orginal Alpaca trained from a base model of LLaMa
+This is mostly due to the LLaMa 7B model being pretrained on 1T tokens and GPT-J-6B being trained on 300-400M tokens