murataksit34 commited on
Commit
0e78352
·
verified ·
1 Parent(s): 39c2a67

Fix README with correct datasets/splits and curriculum notes

Browse files
Files changed (1) hide show
  1. README.md +11 -11
README.md CHANGED
@@ -15,19 +15,19 @@ datasets:
15
  - zero9tech/veri-bilimci-insight-diyalog-tr-16.2k
16
  ---
17
 
18
- # QWEN3-4B-Data-Science-V2-TR Türkçe Insight Ustası
19
 
20
- ## Eğitim Kurgusu
21
- 1. Türkçe düşünme adaptasyonu: ile yaklaşık ön eğitim/adaptasyon ( kayıt).
22
- 2. Alan uzmanlığı SFT: .
23
 
24
- ## Kullanılan Dataset Özeti
25
- - Toplam kayıt:
26
- - Split: · ·
27
- - Hard gate:
28
- - Soft gate:
29
- - Self-BLEU (final):
30
- - Distinct-2 (final):
31
 
32
  ## Copyright
33
  Copyright (c) Zero9 Tech
 
15
  - zero9tech/veri-bilimci-insight-diyalog-tr-16.2k
16
  ---
17
 
18
+ # QWEN3-4B-Data-Science-V2-TR - Turkce Insight Ustasi
19
 
20
+ ## Egitim Kurgusu
21
+ 1. Turkce dusunme adaptasyonu: wikimedia/wikipedia ile yaklasik %80 on egitim/adaptasyon (427,990 kayit).
22
+ 2. Alan uzmanligi SFT: zero9tech/veri-bilimci-insight-diyalog-tr-16.2k.
23
 
24
+ ## Kullanilan Dataset Ozeti
25
+ - Toplam kayit: 16,180
26
+ - Split: train 13,763 ; validation 814 ; test 1,603
27
+ - Hard gate: PASS
28
+ - Soft gate: PASS
29
+ - Self-BLEU (final): 0.5988
30
+ - Distinct-2 (final): 0.0806
31
 
32
  ## Copyright
33
  Copyright (c) Zero9 Tech