Ransaka
/

sinhala-gpt2

Text Generation

text-generation-inference

Model card Files Files and versions

Metrics Training metrics Community

Ransaka commited on Sep 17, 2023

Commit

80f1456

·

1 Parent(s): fb6bede

Update README.md

Files changed (1) hide show

README.md +1 -6

README.md CHANGED Viewed

@@ -24,11 +24,6 @@ language:
 # sinhala-gpt2
 This particular model has undergone fine-tuning based on the [gpt2](https://huggingface.co/gpt2) architecture, utilizing a dataset of Sinhala NEWS from various sources.
-Even though this is quite simple to train, it is still capable of generating news articles that are identical. Take, for example, the following samples(Some of them are hilarious though :D):
-- "ඔබ විසින් මෙම විරෝධතාව සංවිධානය කර තිබුණේ නැහැ කියලා හිටපු ජනාධිපති මහ"
-- "දුර්ලභ ගණයේ විශ්වවිද්යාල ප්රතිපාදන කොමිෂන් සභාවේ සභාපති මහාචාර්ය ජී එල්"
-⚠️ Since the dataset used for this model is mostly composed of news articles, it is heavily biased toward generating news content. This bias may become apparent during the generation process.
 ## Training procedure
 The model was trained for 12+ hours on Kaggle GPUs.
@@ -40,7 +35,7 @@ from transformers import AutoTokenizer, AutoModelForCausalLM,pipeline
 tokenizer = AutoTokenizer.from_pretrained("Ransaka/sinhala-gpt2")
 model = AutoModelForCausalLM.from_pretrained("Ransaka/sinhala-gpt2")
-generator("දුර") #දුර ඈත පාසැල් වියේ පසුවූයේ මෙම සිද්ධිය සම්බන්ධයෙන් විමර්ශන සිදුකරන බවයි
 ```
 or using git
 ```bash

 # sinhala-gpt2
 This particular model has undergone fine-tuning based on the [gpt2](https://huggingface.co/gpt2) architecture, utilizing a dataset of Sinhala NEWS from various sources.
 ## Training procedure
 The model was trained for 12+ hours on Kaggle GPUs.
 tokenizer = AutoTokenizer.from_pretrained("Ransaka/sinhala-gpt2")
 model = AutoModelForCausalLM.from_pretrained("Ransaka/sinhala-gpt2")
+generator("දුර")
 ```
 or using git
 ```bash