This is not a 7B. It's a ~9B. Please label appropriately.

by georgewalker - opened Jan 10, 2024

Jan 10, 2024

Like several of the top '7B' models on the leaderboard, this is actually a 9B, downstream of https://huggingface.co/zyh3826/GML-Mistral-merged-v1, a merge that combined the first 32 layers (ie all of them) of one Mistral-7B finetune with the last 8 layers of another Mistral finetune, creating a model that is about 9B parameters.

It is helpful to label model sizes appropriately. Better would be if Huggingface labeled models based on their file size and bpw, instead of allowing for these sorts of mistakes to occur and proliferate, as one mislabeled model begets others derived from it.

Ont

Jan 14, 2024

•

edited Jan 14, 2024

Some responses from this model appear better considered than those of some other 7B models, but this model employs 25% more layers to achieve its winning performance. I agree that this model best be labeled as a 9B model.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment