bluolightning commited on
Commit
dda240d
·
verified ·
1 Parent(s): c9326db

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +26 -3
README.md CHANGED
@@ -1,3 +1,26 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - OmniAICreator/Japanese-Novels-23M
5
+ - NilanE/ParallelFiction-Ja_En-100k
6
+ - globis-university/aozorabunko-clean
7
+ - joujiboi/Galgame-VisualNovel-Reupload
8
+ - CC100
9
+ - AnimeText
10
+ language:
11
+ - ja
12
+ base_model:
13
+ - tohoku-nlp/bert-base-japanese-char-v2
14
+ pipeline_tag: fill-mask
15
+ library_name: transformers
16
+ ---
17
+
18
+ # Custom Japanese BERT (4-layer)
19
+
20
+ This model is a tiny Japanese BERT model with 4 layers, optimized for speed.
21
+
22
+ ## Model Background
23
+ - **Architecture:** BERT (4 layers, 256 hidden size, 4 heads, 1024 FFN)
24
+ - **Distillation:** Distilled from a fine-tuned version of `tohoku-nlp/bert-base-japanese-char-v2`.
25
+ - **Initialization:** The student model was randomly initialized.
26
+ - **Tokenizer:** Japanese Character-level tokenizer, shared with the teacher.