Aryanne
/

Mistral-3B-Instruct-v0.2-init

Text Generation

text-generation-inference

Model card Files Files and versions

Mistral-3B-Instruct-v0.2-init / README.md

Aryanne's picture

Update README.md

799b8f7 verified almost 2 years ago

|

history blame contribute delete

371 Bytes

	---
	license: apache-2.0
	inference: false
	---

	# Info

	This is the model [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) which I cut all the intermediate(feed_forward_length) size with 14336 down to 3072, resulting in a ~2.81B model.

	It's necessary to pre-train this model, cause at the moment is generating just gibberish.