Upload model and tokenizer

Files changed (5) hide show

README.md CHANGED Viewed

@@ -1,32 +1,9 @@
-# Custom Urdu LLM
-This is a custom transformer-based Large Language Model for Urdu.
-## Model Details
-- **Architecture:** Transformer (GPT-based)
-- **Framework:** PyTorch
-- **Tokenizer:** SentencePiece
-- **Hyperparameters:**
-  - Vocabulary Size: 20,000
-  - Embedding Size: 768
-  - Attention Heads: 12
-  - Layers: 12
-  - Dropout: 0.2
-## Usage
-```python
-from transformers import AutoModel, AutoTokenizer
-model = AutoModel.from_pretrained("AliMuhammad73/testing-model")
-tokenizer = AutoTokenizer.from_pretrained("AliMuhammad73/testing-model")
-prompt = <prompt in urdu>
-inputs = tokenizer(prompt, return_tensors="pt")
-output = model.generate(inputs.input_ids, max_new_tokens=tokens_to_generate)
-print(tokenizer.decode(output[0]))
-```
----
-license: apache-2.0
----

+---
+tags:
+- model_hub_mixin
+- pytorch_model_hub_mixin
+---
+This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration:
+- Library: [More Information Needed]
+- Docs: [More Information Needed]

config.json ADDED Viewed

+{
+  "vocab_size": 20000
+}

model.safetensors ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:66ee862a2d28074541cad0ee5b4ec9d9aa98b8ab1efcd4782e1e5bef64c7adae
+size 404903720

tokenizer.model ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:45858257f60a18fca1e9c91f723a2ded05ebc8e7de3e8ba64af819b095728c50
+size 395464

tokenizer_config.json ADDED Viewed

+{
+    "type": "llama",
+    "vocab_size": 0
+}