amkyawdev commited on
Commit
c01e23f
·
verified ·
1 Parent(s): c9e62d0

Upload model_card.yaml with huggingface_hub

Browse files
Files changed (1) hide show
  1. model_card.yaml +108 -0
model_card.yaml ADDED
@@ -0,0 +1,108 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+ language:
4
+ - my
5
+ - en
6
+ tags:
7
+ - myanmar
8
+ - burmese
9
+ - llm
10
+ - code-generation
11
+ - fine-tuned
12
+ - lora
13
+ - phi-2
14
+ - custom_code
15
+ - transformers
16
+ - peft
17
+ - ai
18
+ - coding
19
+ - conversational-ai
20
+ - nlp
21
+ license: mit
22
+ datasets:
23
+ - amkyawdev/myanmar-llm-data
24
+ base_model: microsoft/phi-2
25
+ model-index:
26
+ - name: mm-llm-coder-lite-v1
27
+ results: []
28
+ ---
29
+
30
+ # Model Card: mm-llm-coder-lite-v1
31
+
32
+ ## Model Details
33
+
34
+ - **Model Name**: mm-llm-coder-lite-v1
35
+ - **Base Model**: microsoft/phi-2
36
+ - **Model Type**: Large Language Model (LLM)
37
+ - **Fine-tuning Method**: LoRA (Low-Rank Adaptation)
38
+ - **Language**: Burmese (Myanmar)
39
+ - **License**: MIT
40
+
41
+ ## Training Details
42
+
43
+ - **Training Epochs**: 3
44
+ - **Learning Rate**: 2e-4
45
+ - **LoRA Rank (r)**: 16
46
+ - **LoRA Alpha**: 32
47
+ - **LoRA Dropout**: 0.05
48
+ - **Max Length**: 512
49
+ - **Batch Size**: 4
50
+ - **Gradient Accumulation**: 4
51
+ - **Training Framework**: Hugging Face Transformers + PEFT + TRL
52
+
53
+ ## Dataset
54
+
55
+ - **Training Data**: amkyawdev/myanmar-llm-data
56
+ - **Train Samples**: ~20,327
57
+ - **Test Samples**: ~17,155
58
+ - **Validation Samples**: ~17,071
59
+
60
+ ### Data Distribution
61
+
62
+ | Tag | Description | Percentage |
63
+ |-----|-------------|------------|
64
+ | coding | Programming conversations | 90% |
65
+ | translation | English-Myanmar translation | 1% |
66
+ | general | General knowledge Q&A | 1% |
67
+ | greeting | Burmese greetings | 1% |
68
+
69
+ ## Usage
70
+
71
+ ```python
72
+ from transformers import AutoModelForCausalLM, AutoTokenizer
73
+
74
+ model = AutoModelForCausalLM.from_pretrained("amkyawdev/mm-llm-coder-lite-v1")
75
+ tokenizer = AutoTokenizer.from_pretrained("amkyawdev/mm-llm-coder-lite-v1")
76
+ ```
77
+
78
+ ## Prompt Format
79
+
80
+ ```
81
+ System: <system_prompt>
82
+
83
+ User: <user_message>
84
+
85
+ Assistant: <assistant_response><eos>
86
+ ```
87
+
88
+ ## Limitations
89
+
90
+ - Model is specialized for Myanmar language and code generation
91
+ - Performance may vary for other languages
92
+ - Fine-tuned on limited dataset size
93
+
94
+ ## Ethical Considerations
95
+
96
+ - Model trained on publicly available Myanmar language data
97
+ - No personal identifiable information included
98
+ - Intended for educational and research purposes
99
+
100
+ ## Acknowledgments
101
+
102
+ - Microsoft for phi-2 base model
103
+ - Hugging Face for Transformers and PEFT
104
+ - Myanmar NLP community
105
+
106
+ ---
107
+
108
+ *This model card was auto-generated for mm-llm-coder-lite-v1*