Vacaspati commited on
Commit
8796613
·
verified ·
1 Parent(s): e0182d5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +60 -3
README.md CHANGED
@@ -1,3 +1,60 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ license: apache-2.0
3
+ language:
4
+ - bn
5
+ ---
6
+
7
+ # IV-Electra
8
+
9
+ **IV-Electra** is a 17 million-parameter model, trained on the Vācaspati literary dataset and IndicCorpv1.0 (Bangla subset).
10
+
11
+ ## Model Details
12
+
13
+ - **Architecture:** Electra-small (but reduced to 17 M parameters)
14
+ - **Pretraining Corpus:** Vācaspati — a curated Bangla literary corpus
15
+ - **Parameter Count:** 17 M (≈ 1/7th the size of BERT-base)
16
+ - **Tokenizer:** WordPiece, vocabulary size 50 K
17
+
18
+
19
+ ## Usage Example
20
+
21
+ ```python
22
+ from transformers import BertTokenizer, AutoModelForSequenceClassification
23
+
24
+ tokenizer = BertTokenizer.from_pretrained("Vacaspati/IV-Electra")
25
+ model = AutoModelForSequenceClassification.from_pretrained("Vacaspati/IV-Electra")
26
+ ```
27
+
28
+ ## Citation
29
+
30
+ If you are using this model please cite:
31
+
32
+ ```bibtex
33
+
34
+ @inproceedings{bhattacharyya-etal-2023-vacaspati,
35
+ title = "{VACASPATI}: A Diverse Corpus of {B}angla Literature",
36
+ author = "Bhattacharyya, Pramit and
37
+ Mondal, Joydeep and
38
+ Maji, Subhadip and
39
+ Bhattacharya, Arnab",
40
+ editor = "Park, Jong C. and
41
+ Arase, Yuki and
42
+ Hu, Baotian and
43
+ Lu, Wei and
44
+ Wijaya, Derry and
45
+ Purwarianti, Ayu and
46
+ Krisnadhi, Adila Alfa",
47
+ booktitle = "Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)",
48
+ month = nov,
49
+ year = "2023",
50
+ address = "Nusa Dua, Bali",
51
+ publisher = "Association for Computational Linguistics",
52
+ url = "https://aclanthology.org/2023.ijcnlp-main.72/",
53
+ doi = "10.18653/v1/2023.ijcnlp-main.72",
54
+ pages = "1118--1130"
55
+ }
56
+
57
+ ```
58
+
59
+
60
+