Junhoee
/

Qwen-Megumin

Text Generation

Model card Files Files and versions

Junhoee commited on Nov 26, 2024

Commit

4a8bf48

·

verified ·

1 Parent(s): d61e50e

Update README.md

Files changed (1) hide show

README.md +16 -6

README.md CHANGED Viewed

@@ -2,23 +2,33 @@
 base_model: Qwen/Qwen2.5-7B-Instruct
 library_name: peft
 pipeline_tag: text-generation
 ---
 # Model Card for Model ID
-<!-- Provide a quick summary of what the model is/does. -->
 ## Model Details
 ### Model Description
-<!-- Provide a longer summary of what this model is. -->
-- **Developed by:** [More Information Needed]
 - **Funded by [optional]:** [More Information Needed]
 - **Shared by [optional]:** [More Information Needed]
 - **Model type:** [More Information Needed]

 base_model: Qwen/Qwen2.5-7B-Instruct
 library_name: peft
 pipeline_tag: text-generation
+language:
+- en
+tags:
+- persona
+- llm
 ---
 # Model Card for Model ID
+I developed a persona LLM, also known as a role-play LLM.
+The character is modeled after Megumin, a character from the novel Blessing of this Wonderful World.
 ## Model Details
 ### Model Description
+This model is fine-tuned using the Qwen/Qwen2.5-7B-Instruct model as a mother model.
+Due to the lack of GPU memory and resources, we used the QLoRA method to train only certain layer parts.
+The learning factors were as follows
+- learning_rate=5e-5
+- lr_scheduler_type="cosine"
+- warmup_steps=800
+- num_train_epochs=5
+- per_device_train_batch_size=8
+- **Developed by:** [Junhoee Ku](https://github.com/junhoeKu)
 - **Funded by [optional]:** [More Information Needed]
 - **Shared by [optional]:** [More Information Needed]
 - **Model type:** [More Information Needed]