nebulette commited on
Commit
0b929d4
·
verified ·
1 Parent(s): 5acd51e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +36 -3
README.md CHANGED
@@ -1,3 +1,36 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model_relation: finetune
4
+ base_model:
5
+ - sbintuitions/modernbert-ja-310m
6
+ tags:
7
+ - modernbert
8
+ ---
9
+
10
+ BERT for Anima
11
+
12
+ ![](images/bert.png)
13
+
14
+ *2girls, two girls are sitting at the table, the left girl has pink hair, the right girl has black hair. The left girl is wearing a yellow hair ribbon*
15
+
16
+ This modern architecture combines the LLM adapter with existing architectures.
17
+
18
+ Although the tokenization process is still done at the BERT level, the T5 ids are not entirely discarded. It can be called a text encoder with a dual tokenizer.
19
+
20
+ This implementation is also leaner than the fat, transformer-based adapters on the top of LLMs.
21
+
22
+ While ModernBERT's entire multilingual vocabulary can still be used, this is backward compatible with Anima's T5.
23
+
24
+ This repo also converges faster than [her lil sister](https://huggingface.co/nebulette/berry).
25
+
26
+ Source data
27
+
28
+ - anime themed, long natural sentences
29
+ - general tags
30
+ - character count tags
31
+ - colors
32
+ - negatives
33
+ - safety levels
34
+ - spatial relationships
35
+
36
+ No credits are provided; this is original work.