Mini-Llama
Collection
The Mini-Llama series has been created to provide a modern interpretation on the classic text-only Llama experience, based on Ministral 3.
•
22 items
•
Updated
My base pretrain model has undergone full fine-tuning on an additional 350M tokens using portions of Tulu 3 and Nvidia Nemotron instruct sets. It is rough but functionsl, and still needs DPO training to align it with human preferences.
For the base pretrain, see: Nabbers1999/Mini-Llama-8B-Base-0124