utkarshshukla2912 commited on
Commit
c8474ab
·
verified ·
1 Parent(s): a0acba1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +95 -4
README.md CHANGED
@@ -6,10 +6,103 @@ tags:
6
  - unsloth
7
  - qwen3_5
8
  license: apache-2.0
9
- language:
10
- - en
11
  ---
12
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
  # Uploaded finetuned model
14
 
15
  - **Developed by:** RinggAI
@@ -17,5 +110,3 @@ language:
17
  - **Finetuned from model :** Qwen/Qwen3.5-2B
18
 
19
  This qwen3_5 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
20
-
21
- [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
 
6
  - unsloth
7
  - qwen3_5
8
  license: apache-2.0
 
 
9
  ---
10
 
11
+
12
+ The model was finetuned on ~128,000 curated transcripts across different domanins and language preferences
13
+ - Expanded Training: Now optimized for CX Support, Healthcare, Loan Collection, Insurance, Ecommerce, and Concierge services.
14
+ - Feature Improvement: Significantly enhanced relative date-time extraction for more precise data processing.
15
+ - Training Overview
16
+ - You can plug it into your calling or voice AI stack to automatically extract:
17
+ - Enum-based classifications (e.g., call outcome, intent, disposition)
18
+ - Conversation summaries
19
+ - Action items / follow-ups
20
+ - Relative DateTime Artifacts
21
+
22
+ It’s built to handle real-world Hindi, English, Indic noisy transcripts.
23
+
24
+ ![Training Overview](logs.jpg)
25
+
26
+
27
+ Finetuning Parameters:
28
+ ```
29
+ rank = 64 # kept small to know change inherent model intelligence but to make sure structured ectraction is followed
30
+ trainer = SFTTrainer(
31
+ model = model,
32
+ tokenizer = tokenizer,
33
+ train_dataset = train_dataset,
34
+ eval_dataset = test_dataset,
35
+ args = SFTConfig(
36
+ dataset_text_field = "prompt",
37
+ max_seq_length = max_seq_length,
38
+ per_device_train_batch_size = 5,
39
+ gradient_accumulation_steps = 5,
40
+
41
+ warmup_steps = 10,
42
+ num_train_epochs = 2,
43
+ learning_rate = 2e-4,
44
+ lr_scheduler_type = "linear",
45
+
46
+ optim = "adamw_8bit",
47
+ weight_decay = 0.01, # Unsloth default (was 0.001)
48
+ seed = SEED,
49
+
50
+ logging_steps = 50,
51
+ report_to = "wandb",
52
+
53
+ eval_strategy = "steps",
54
+ eval_steps = 5000,
55
+ save_strategy = "steps",
56
+ save_steps = 5000,
57
+ load_best_model_at_end = True,
58
+ metric_for_best_model = "eval_loss",
59
+
60
+ output_dir = "outputs_qwen35_2b",
61
+ dataset_num_proc = 8,
62
+ fp16= not torch.cuda.is_bf16_supported(),
63
+ bf16= torch.cuda.is_bf16_supported()
64
+ ),
65
+ )
66
+ ```
67
+
68
+ Provide the below schema for best output:
69
+ ```
70
+ response_schema = {
71
+ "type": "object",
72
+ "properties": {
73
+ "key_points": {
74
+ "type": "array",
75
+ "items": {"type": "string"},
76
+ "nullable": True,
77
+ },
78
+ "action_items": {
79
+ "type": "array",
80
+ "items": {"type": "string"},
81
+ "nullable": True,
82
+ },
83
+ "summary": {"type": "string"},
84
+ "classification": classification_schema,
85
+ "callback_requested": {
86
+ "type": "boolean",
87
+ "nullable": False,
88
+ "description": "If the user requested a callback or mentiones currently he is busy then value is true otherwise false",
89
+ },
90
+ "callback_requested_time": {
91
+ "type": "string",
92
+ "nullable": True,
93
+ "description": "ISO 8601 datetime string (YYYY-MM-DDTHH:MM:SS) in the call's timezone, if user requested a callback",
94
+ },
95
+ },
96
+ "required": ["summary", "classification"],
97
+ }
98
+ ```
99
+
100
+
101
+
102
+ [<img style="border-radius: 20px;" src="https://storage.googleapis.com/desivocal-prod/desi-vocal/logo.png" width="200"/>](https://ringg.ai)
103
+ [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
104
+
105
+
106
  # Uploaded finetuned model
107
 
108
  - **Developed by:** RinggAI
 
110
  - **Finetuned from model :** Qwen/Qwen3.5-2B
111
 
112
  This qwen3_5 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.