VictorDCh
/

Llama-3-8B-Instruct-MoE-2

Text Generation

text-generation-inference

Model card Files Files and versions

VictorDCh commited on Jun 3, 2024

Commit

4d5f9f4

·

verified ·

1 Parent(s): 5f6b9e8

Update README.md

Files changed (1) hide show

README.md +75 -1

README.md CHANGED Viewed

@@ -13,7 +13,81 @@ tags: []
 ### Model Description
-<!-- Provide a longer summary of what this model is. -->
 This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.

 ### Model Description
+MODEL_NAME = "Llama-3-8B-Instruct-MoE-2"
+yaml_config = """
+base_model: meta-llama/Meta-Llama-3-8B-Instruct
+experts_per_token: 2
+experts:
+  - source_model: meta-llama/Meta-Llama-3-8B-Instruct
+    positive_prompts:
+    - "What are the different"
+    - "what are the distinct"
+    - "Give me the unique"
+  - source_model: meta-llama/Meta-Llama-3-8B-Instruct
+    positive_prompts:
+    - "When"
+    - "when"
+    - "Where"
+    - "where"
+    - "Which"
+    - "which"
+    - "Who"
+    - "who"
+    - "What"
+    - "what"
+    - "Whom"
+    - "whom"
+    - "Whose"
+    - "whose"
+  - source_model: meta-llama/Meta-Llama-3-8B-Instruct
+    positive_prompts:
+    - "Larger"
+    - "larger"
+    - "Smaller"
+    - "smaller"
+    - "Bigger"
+    - "bigger"
+    - "Smallest"
+    - "smallest"
+    - "Largest"
+    - "largest"
+    - "Biggest"
+    - "biggest"
+    - "Most"
+    - "most"
+    - "Least"
+    - "least"
+    - "More"
+    - "more"
+    - "Less"
+    - "less"
+    - "Number"
+    - "number"
+    - "Numbers"
+    - "numbers"
+    - "Quantity"
+    - "quantity"
+    - "At least"
+    - "at least"
+    - "At most"
+    - "at most"
+    - "Greater"
+    - "greater"
+    - "Fewer"
+    - "fewer"
+    - "Than"
+    - "than"
+    - "Equal"
+    - "equal"
+    - "Same"
+    - "same"
+    - "Equal to"
+    - "equal to"
+  - source_model: meta-llama/Meta-Llama-3-8B-Instruct
+    positive_prompts:
+    - "that also"
+    - "who have the same"
+"""
 This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.