nateshmbhat commited on
Commit
7f2164c
·
1 Parent(s): 4c4c95e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -1
README.md CHANGED
@@ -9,4 +9,21 @@ widget:
9
  # ISHA Call Center QA Model
10
 
11
  ## This model was trained on a finetuned version(StableBeluga2) of Llama2-13B from stabilityai : [(StableBeluga2 tops the LLM leaderboard currently)](https://huggingface.co/stabilityai/StableBeluga2)
12
- ### Dataset Used : https://huggingface.co/datasets/nateshmbhat/isha-qa-text
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9
  # ISHA Call Center QA Model
10
 
11
  ## This model was trained on a finetuned version(StableBeluga2) of Llama2-13B from stabilityai : [(StableBeluga2 tops the LLM leaderboard currently)](https://huggingface.co/stabilityai/StableBeluga2)
12
+ ### Dataset Used : https://huggingface.co/datasets/nateshmbhat/isha-qa-text
13
+
14
+
15
+
16
+ #### Train Params used :
17
+ - Base model : stabilityai/StableBeluga-13B
18
+ - Quantization Used : 4 bit
19
+ - Learning rate : 2e-4
20
+ - Batch Size : 2
21
+ - Epochs : 3
22
+ - Trainer : sft
23
+ - Max token length : 2048 (capable of higher token length)
24
+
25
+
26
+ #### Full command :
27
+ ```
28
+ !autotrain llm --train --project_name project-isha-qa --model stabilityai/StableBeluga-13B --data_path nateshmbhat/isha-qa-text --use_peft --use_int4 --learning_rate 2e-4 --train_batch_size 2 --num_train_epochs 3 --trainer sft --model_max_length 2048 --push_to_hub --repo_id nateshmbhat/model-isha-qa
29
+ ```