Hailay commited on
Commit
f7b63bf
·
verified ·
1 Parent(s): e878f53

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -21,7 +21,7 @@ This model is a fine-tuned version of [`xlm-roberta-base`](https://huggingface.c
21
 
22
  - **Base model**: `xlm-roberta-base`
23
  - **Language**: Amharic
24
- - **Tokenizer**: Custom BPE tokenizer (not morpheme-aware)
25
  - **Adaptation**: Embedding initialization via weighted average of pretrained XLM-R embeddings, guided by FastText word vectors for Amharic
26
  - **Training dataset**: Amharic portion of the [NLLB (No Language Left Behind)](https://github.com/facebookresearch/flores) parallel corpus
27
  - **Objective**: Masked Language Modeling (MLM)
 
21
 
22
  - **Base model**: `xlm-roberta-base`
23
  - **Language**: Amharic
24
+ - **Tokenizer**: Custom BPE tokenizer
25
  - **Adaptation**: Embedding initialization via weighted average of pretrained XLM-R embeddings, guided by FastText word vectors for Amharic
26
  - **Training dataset**: Amharic portion of the [NLLB (No Language Left Behind)](https://github.com/facebookresearch/flores) parallel corpus
27
  - **Objective**: Masked Language Modeling (MLM)