AliMuhammad73 commited on
Commit
6f3ada8
·
1 Parent(s): 0b8acc5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +29 -0
README.md CHANGED
@@ -1,3 +1,32 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
  ---
 
1
+ # Custom Urdu LLM
2
+
3
+ This is a custom transformer-based Large Language Model for Urdu.
4
+
5
+ ## Model Details
6
+ - **Architecture:** Transformer (GPT-based)
7
+ - **Framework:** PyTorch
8
+ - **Tokenizer:** SentencePiece
9
+ - **Hyperparameters:**
10
+ - Vocabulary Size: 20,000
11
+ - Embedding Size: 768
12
+ - Attention Heads: 12
13
+ - Layers: 12
14
+ - Dropout: 0.2
15
+
16
+ ## Usage
17
+
18
+ ```python
19
+ from transformers import AutoModel, AutoTokenizer
20
+
21
+ model = AutoModel.from_pretrained("AliMuhammad73/testing-model")
22
+ tokenizer = AutoTokenizer.from_pretrained("AliMuhammad73/testing-model")
23
+
24
+ prompt = <prompt in urdu>
25
+ inputs = tokenizer(prompt, return_tensors="pt")
26
+ output = model.generate(inputs.input_ids, max_new_tokens=tokens_to_generate)
27
+ print(tokenizer.decode(output[0]))
28
+ ```
29
+
30
  ---
31
  license: apache-2.0
32
  ---