Joseph717171
/

Genstruct-10.7B

Text Generation

text-generation-inference

Model card Files Files and versions

Joseph717171 commited on Mar 30, 2024

Commit

398ba37

·

verified ·

1 Parent(s): 2acb90c

Update README.md

Files changed (1) hide show

README.md +18 -3

README.md CHANGED Viewed

@@ -1,13 +1,28 @@
 ---
-base_model: []
-library_name: transformers
 tags:
 - mergekit
 - merge
 ---
 # Genstruct-10.7B
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 ## Merge Details

 ---
+base_model: mistralai/Mistral-7B-v0.1
 tags:
+- Mistral
+- instruct
+- finetune
+- synthetic
 - mergekit
 - merge
+license: apache-2.0
+language:
+- en
+library_name: transformers
 ---
+# Credit for the model card's description goes to ddh0, mergekit, and, migtissera
+# Inspired by ddh0/Starling-LM-10.7B-beta and ddh0/Mistral-10.7B-Instruct-v0.2
 # Genstruct-10.7B
+This is Genstruct-10.7B, a depth-upscaled version of [NousResearch/Genstruct-7B](https://huggingface.co/NousResearch/Genstruct-7B).
+This model is intended to be used as a basis for further fine-tuning, or as a drop-in upgrade from the original 7 billion parameter model.
+Paper detailing how Depth-Up Scaling works:  [SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling](https://arxiv.org/abs/2312.15166)
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 ## Merge Details