aslon1213 commited on
Commit
c03838f
·
verified ·
1 Parent(s): 2920dc8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +64 -0
README.md CHANGED
@@ -0,0 +1,64 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ datasets:
3
+ - aslon1213/orpheus_qa_dataset
4
+ - aslon1213/orpheus-tts-dataset
5
+ - DavronSherbaev/uzbekvoice-filtered
6
+ language:
7
+ - uz
8
+ base_model:
9
+ - meta-llama/Llama-3.2-3B-Instruct
10
+ tags:
11
+ - llama
12
+ - tts
13
+ - llm
14
+ - transformers
15
+ - uzbek
16
+ ---
17
+ # Uploaded model
18
+
19
+ - **Developed by:** aslon1213
20
+ - **License:** apache-2.0
21
+ - **Finetuned from model :** aslon1213/orpheus-checkpoints-87330
22
+
23
+
24
+
25
+
26
+ ### 🧠 Base Model
27
+ - **Base model**: meta-llama/Llama-3.2-3B-Instruct
28
+ - **Tokenizer**: meta-llama/Llama-3.2-3B-Instruct
29
+ - **Architecture**: LLaMA
30
+ - **License**: Apache-2.0
31
+
32
+
33
+ ### 🚀 Pretraining Configuration
34
+
35
+ Training Setup
36
+ - **Epochs**: 1
37
+ - **Batch size**: 1
38
+ - **Learning rate**: 5e-5
39
+ - **Training processes**: 8
40
+ - **Padding token ID**: 128263
41
+ - **Save steps**: 12,000
42
+ - **Data mixing ratio**: 0.05
43
+ - **Total batches**: 87,330
44
+ - **Per Epoch Time**: 8 hours on 1 GPU
45
+
46
+ ### Hardware
47
+ - **GPU**: 1× NVIDIA H100
48
+
49
+ ### Pretrain Datasets
50
+ - **Text QA**: aslon1213/orpheus_qa_dataset - 26.5K samples
51
+ - **TTS**: aslon1213/orpheus-tts-dataset - 500K samples
52
+
53
+
54
+
55
+ # Metrics Available at this link
56
+ [WanDB Report ](https://api.wandb.ai/links/roostamovic-smartbank/ixybtc2g)
57
+
58
+
59
+
60
+
61
+ # Acknowledgements:
62
+ 1. [Llama](https://github.com/meta-llama/llama)
63
+ 2. [Orpheus-TTS](https://github.com/canopyai/Orpheus-TTS)
64
+ 3. [SNAC](https://github.com/hubertsiuzdak/snac)