| license: apache-2.0 | |
| EDIT: Base Mistral, but with some minor head trauma. Promising in theory, needs finetuning, but only really outperforms the 14b in size. | |
| A 11b Mistral base model, based on the NeverSleep recipe. | |
| ### Recipe | |
| slices | |
| - sources: | |
| - | |
| - model: mistralai/Mistral-7B-v0.1 | |
| - | |
| layer_range: [0, 24] | |
| - sources: | |
| - | |
| - model: mistralai/Mistral-7B-v0.1 | |
| - | |
| layer_range: [8, 32] | |
| merge_method: passthrough | |
| dtype: bfloat16 |