turtle170
/

MicroAtlas-V1

Text Generation

Model card Files Files and versions

turtle170 commited on Jan 12

Commit

8bd2e83

·

verified ·

1 Parent(s): f09a2f5

Update README.md

Files changed (1) hide show

README.md +35 -10

README.md CHANGED Viewed

@@ -6,33 +6,58 @@ tags:
 - base_model:adapter:microsoft/Phi-3-mini-4k-instruct
 - lora
 - transformers
 ---
 # Model Card for Model ID
-<!-- Provide a quick summary of what the model is/does. -->
 ## Model Details
 ### Model Description
-<!-- Provide a longer summary of what this model is. -->
-- **Developed by:** [More Information Needed]
-- **Funded by [optional]:** [More Information Needed]
-- **Shared by [optional]:** [More Information Needed]
-- **Model type:** [More Information Needed]
-- **Language(s) (NLP):** [More Information Needed]
-- **License:** [More Information Needed]
-- **Finetuned from model [optional]:** [More Information Needed]
 ### Model Sources [optional]
-<!-- Provide the basic links for the model. -->
 - **Repository:** [More Information Needed]
 - **Paper [optional]:** [More Information Needed]

 - base_model:adapter:microsoft/Phi-3-mini-4k-instruct
 - lora
 - transformers
+license: apache-2.0
+datasets:
+- teknium/OpenHermes-2.5
+- Magpie-Align/Magpie-Phi3-Pro-300K-Filtered
+language:
+- en
 ---
 # Model Card for Model ID
+Phi-3-Mini-OpenHermes-Magpie-V1 is a general purpose model trained on both the teknium/OpenHermes-2.5 dataset and the Magpie-Align/Phi3-Pro-300K-Filtered dataset
+and designed to provide speed, efficiency, and intelligence.
 ## Model Details
+OpenHermes dataset:
+1 Epoch
+8 Batch Size
+1 Gradient Accumulation
+5e-5 LR
+16 LoRa r
+32 LoRa Alpha
+300 Warmup steps
+500 Eval steps
+Trained only on Attention layers.
+Magpie  dataset:
+1 Epoch
+16 Batch Size
+1 Gradient Accumulation
+1e-4 LR
+16 LoRa r
+32 LoRa Alpha
+150 Warmup steps
+500 Eval steps
+Trained with Gate, Up, and Down layers.
 ### Model Description
+This model excels at creating bullet point formatting, while still mantaining
+- **Developed by:** Turtle170 (anonymous
+- **Language(s) (NLP):** English
+- **License:** apache-2.0
+- **Finetuned from model :** Phi-3-Mini-4k-Instruct with turtle170/Phi-3-Mini-OpenHermes-V1 adapters
 ### Model Sources [optional]
+<!-- Provide the basic
 - **Repository:** [More Information Needed]
 - **Paper [optional]:** [More Information Needed]