Badnyal commited on
Commit
6cab71f
·
verified ·
1 Parent(s): 5d19955

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -17,7 +17,7 @@ pipeline_tag: text-generation
17
 
18
  # Kren v1: Khasi Generative Language Model
19
 
20
- Kren v1 is the first *publicly documented* encoder→decoder conversion producing a generative language model for an Indian language (Khasi). The conversion was performed by transferring weights and adapting the architecture of MWirelabs/khasibert (RoBERTa-style encoder) into a GPT-2 style causal decoder, followed by progressive causal LM fine-tuning.
21
 
22
  ## Model Overview
23
 
@@ -142,7 +142,7 @@ Kren v1 may produce hallucinations, biased or culturally sensitive content, and
142
 
143
  ## Research Significance
144
 
145
- - **First**: Encoder-to-decoder conversion methodology for Indian languages
146
  - **Methodology**: Validates progressive training approach for low-resource languages
147
  - **Findings**: Demonstrates optimal training data volumes for indigenous language models
148
  - **Impact**: Establishes foundation for Northeast Indian language AI development
@@ -151,7 +151,7 @@ Kren v1 may produce hallucinations, biased or culturally sensitive content, and
151
 
152
  ```bibtex
153
  @misc{nyalang2024kren,
154
- title={Kren v1.0: The First Publicly Documented Encoder-to-Decoder Generative Language Model for an Indian Language (Khasi)},
155
  author={Badal Nyalang},
156
  year={2024},
157
  publisher={Zenodo},
 
17
 
18
  # Kren v1: Khasi Generative Language Model
19
 
20
+ Kren v1 is a *publicly documented* encoder→decoder conversion producing a generative language model for an Indian language (Khasi). The conversion was performed by transferring weights and adapting the architecture of MWirelabs/khasibert (RoBERTa-style encoder) into a GPT-2 style causal decoder, followed by progressive causal LM fine-tuning.
21
 
22
  ## Model Overview
23
 
 
142
 
143
  ## Research Significance
144
 
145
+ - **Process**: Encoder-to-decoder conversion methodology for Indian languages
146
  - **Methodology**: Validates progressive training approach for low-resource languages
147
  - **Findings**: Demonstrates optimal training data volumes for indigenous language models
148
  - **Impact**: Establishes foundation for Northeast Indian language AI development
 
151
 
152
  ```bibtex
153
  @misc{nyalang2024kren,
154
+ title={Kren v1.0: An Encoder-to-Decoder Generative Language Model for an Indian Language (Khasi)},
155
  author={Badal Nyalang},
156
  year={2024},
157
  publisher={Zenodo},