Instructions to use dvitvaai/pothana-chat-300M with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use dvitvaai/pothana-chat-300M with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="dvitvaai/pothana-chat-300M")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("dvitvaai/pothana-chat-300M")
model = AutoModelForCausalLM.from_pretrained("dvitvaai/pothana-chat-300M")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps Settings

vLLM

How to use dvitvaai/pothana-chat-300M with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "dvitvaai/pothana-chat-300M"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "dvitvaai/pothana-chat-300M",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/dvitvaai/pothana-chat-300M

SGLang

How to use dvitvaai/pothana-chat-300M with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "dvitvaai/pothana-chat-300M" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "dvitvaai/pothana-chat-300M",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "dvitvaai/pothana-chat-300M" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "dvitvaai/pothana-chat-300M",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use dvitvaai/pothana-chat-300M with Docker Model Runner:
```
docker model run hf.co/dvitvaai/pothana-chat-300M
```

neshkatrapati commited on Feb 17

Commit

ebb32fd

verified ·

1 Parent(s): e0ef6c3

Upload folder using huggingface_hub

Browse files

Files changed (7) hide show

README.md +6 -5
config.json +5 -2
generation_config.json +5 -2
model.safetensors +1 -1
special_tokens_map.json +9 -2
tokenizer_class.py +12 -0
tokenizer_config.json +11 -4

README.md CHANGED Viewed

@@ -40,6 +40,7 @@ Developed by **[Dvitva AI](https://dvitva.ai)**.
 | **Vocab size** | 86,075 (base + 4 chat tokens) |
 | **Tokenizer** | Morfessor + BPE (Telugu morpheme-aware) |
 | **Fine-tuning** | Full SFT on Telugu conversations |
 | **Developed by** | [Dvitva AI](https://dvitva.ai) |
 ## Chat Template
@@ -66,6 +67,8 @@ The model generates after `<|assistant|>` and stops at `<|end|>`.
 | Token | ID |
 |---|---|
 | `<|system|>` | 86071 |
 | `<|user|>` | 86072 |
 | `<|assistant|>` | 86073 |
@@ -126,7 +129,7 @@ print(tokenizer.decode(outputs[0], skip_special_tokens=False))
 ### Using the CLI chat script
-For the best interactive experience, use the `chat.py` CLI from the [training repo](https://github.com/dvitvaai/telugu-lm):
 ```bash
 # Interactive multi-turn chat
@@ -156,12 +159,10 @@ This model uses a **Morfessor + BPE hybrid tokenizer** designed for Telugu:
 ```python
 import morfessor, re
-from huggingface_hub import hf_hub_download
-# Load Morfessor model from the repo
-morf_path = hf_hub_download(repo_id="dvitvaai/pothana-chat-300M", filename="morfessor_telugu.bin")
 io = morfessor.MorfessorIO()
-morf_model = io.read_binary_model_file(morf_path)
 TELUGU_RE = re.compile(r"[\u0C00-\u0C7F]+")

 | **Vocab size** | 86,075 (base + 4 chat tokens) |
 | **Tokenizer** | Morfessor + BPE (Telugu morpheme-aware) |
 | **Fine-tuning** | Full SFT on Telugu conversations |
+| **Best val loss** | 2.4830389234855454 |
 | **Developed by** | [Dvitva AI](https://dvitva.ai) |
 ## Chat Template
 | Token | ID |
 |---|---|
+| `<bos>` | 2 |
+| `<eos>` | 3 |
 | `<|system|>` | 86071 |
 | `<|user|>` | 86072 |
 | `<|assistant|>` | 86073 |
 ### Using the CLI chat script
+For the best experience, use the included `chat.py` CLI:
 ```bash
 # Interactive multi-turn chat
 ```python
 import morfessor, re
+# Load Morfessor model
 io = morfessor.MorfessorIO()
+morf_model = io.read_binary_model_file("morfessor_telugu.bin")
 TELUGU_RE = re.compile(r"[\u0C00-\u0C7F]+")

config.json CHANGED Viewed

@@ -21,10 +21,13 @@
   "tie_word_embeddings": true,
   "pad_token_id": 0,
   "bos_token_id": 2,
-  "eos_token_id": [3, 86074],
   "attention_dropout": 0.0,
   "initializer_range": 0.02,
   "pretraining_tp": 1,
   "use_cache": true,
   "transformers_version": "4.40.0"
-}

   "tie_word_embeddings": true,
   "pad_token_id": 0,
   "bos_token_id": 2,
+  "eos_token_id": [
+    3,
+    86074
+  ],
   "attention_dropout": 0.0,
   "initializer_range": 0.02,
   "pretraining_tp": 1,
   "use_cache": true,
   "transformers_version": "4.40.0"
+}

generation_config.json CHANGED Viewed

@@ -1,7 +1,10 @@
 {
   "_from_model_config": true,
   "bos_token_id": 2,
-  "eos_token_id": [3, 86074],
   "pad_token_id": 0,
   "do_sample": true,
   "temperature": 0.7,
@@ -10,4 +13,4 @@
   "max_new_tokens": 256,
   "repetition_penalty": 1.1,
   "transformers_version": "4.40.0"
-}

 {
   "_from_model_config": true,
   "bos_token_id": 2,
+  "eos_token_id": [
+    3,
+    86074
+  ],
   "pad_token_id": 0,
   "do_sample": true,
   "temperature": 0.7,
   "max_new_tokens": 256,
   "repetition_penalty": 1.1,
   "transformers_version": "4.40.0"
+}

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:236a8a7692f176c516db8a5c7448795000e1677de1c2798cb75c7d37aa6bee1f
 size 1380356280

 version https://git-lfs.github.com/spec/v1
+oid sha256:f2fb678f8659adecbe45390748938a5b3ec93c7bc5d5fa133fbab54723f8b168
 size 1380356280

special_tokens_map.json CHANGED Viewed

@@ -3,5 +3,12 @@
   "eos_token": "<eos>",
   "unk_token": "<unk>",
   "pad_token": "<pad>",
-  "additional_special_tokens": ["<|system|>", "<|user|>", "<|assistant|>", "<|end|>"]
-}

   "eos_token": "<eos>",
   "unk_token": "<unk>",
   "pad_token": "<pad>",
+  "additional_special_tokens": [
+    "<|system|>",
+    "<|user|>",
+    "<|assistant|>",
+    "<|end|>",
+    "<bos>",
+    "<eos>"
+  ]
+}

tokenizer_class.py CHANGED Viewed

@@ -8,8 +8,14 @@ class TeluguTokenizer(PreTrainedTokenizerFast):
     Tokens ending with @@ are continuation pieces that join to the next token.
     This class overrides decode() to strip @@ markers and join morphemes:
         "రెడ్డి@@ గారు" → "రెడ్డిగారు"
     """
     def decode(self, token_ids, skip_special_tokens=False, **kwargs):
         text = super().decode(token_ids, skip_special_tokens=skip_special_tokens, **kwargs)
         # Strip @@ continuation markers:
@@ -17,4 +23,10 @@ class TeluguTokenizer(PreTrainedTokenizerFast):
         text = text.replace("@@ ", "")
         # Handle remaining @@ (before punctuation, end of string, etc.)
         text = text.replace("@@", "")
         return text

     Tokens ending with @@ are continuation pieces that join to the next token.
     This class overrides decode() to strip @@ markers and join morphemes:
         "రెడ్డి@@ గారు" → "రెడ్డిగారు"
+    Also strips chat special tokens (<|system|>, <|user|>, <|assistant|>, <|end|>)
+    from decoded output for clean text.
     """
+    # Chat special tokens to strip from output
+    _CHAT_SPECIALS = ["<|system|>", "<|user|>", "<|assistant|>", "<|end|>"]
     def decode(self, token_ids, skip_special_tokens=False, **kwargs):
         text = super().decode(token_ids, skip_special_tokens=skip_special_tokens, **kwargs)
         # Strip @@ continuation markers:
         text = text.replace("@@ ", "")
         # Handle remaining @@ (before punctuation, end of string, etc.)
         text = text.replace("@@", "")
+        # Strip chat special tokens
+        for special in self._CHAT_SPECIALS:
+            text = text.replace(special, "")
+        # Clean up extra whitespace from removed tokens
+        import re
+        text = re.sub(r"  +", " ", text).strip()
         return text

tokenizer_config.json CHANGED Viewed

@@ -15,11 +15,18 @@
   "add_eos_token": false,
   "clean_up_tokenization_spaces": false,
   "model_max_length": 2048,
-  "additional_special_tokens": ["<|system|>", "<|user|>", "<|assistant|>", "<|end|>"],
-  "chat_template": "{% for message in messages %}{% if loop.first %}<bos>{% endif %}{% if message['role'] == 'system' %}<|system|> {{ message['content'] }} <|end|>{% elif message['role'] == 'user' %}<|user|> {{ message['content'] }} <|end|>{% elif message['role'] == 'assistant' %}<|assistant|> {{ message['content'] }} <|end|>{% endif %}{% endfor %}{% if add_generation_prompt %}<|assistant|>{% endif %}",
   "extra_info": {
     "type": "morfessor_bpe_telugu",
     "separator": "@@",
     "note": "This tokenizer expects Morfessor-segmented text as input. For raw Telugu text, run Morfessor segmentation first using the included morfessor_telugu.bin model. Tokens ending with '@@' are continuation pieces that join to the next token. The decoder handles @@ removal automatically."
-  }
-}

   "add_eos_token": false,
   "clean_up_tokenization_spaces": false,
   "model_max_length": 2048,
   "extra_info": {
     "type": "morfessor_bpe_telugu",
     "separator": "@@",
     "note": "This tokenizer expects Morfessor-segmented text as input. For raw Telugu text, run Morfessor segmentation first using the included morfessor_telugu.bin model. Tokens ending with '@@' are continuation pieces that join to the next token. The decoder handles @@ removal automatically."
+  },
+  "additional_special_tokens": [
+    "<|system|>",
+    "<|user|>",
+    "<|assistant|>",
+    "<|end|>",
+    "<bos>",
+    "<eos>"
+  ],
+  "chat_template": "{{ bos_token }}{% for message in messages %}{% if message['role'] == 'system' %}<|system|>{{ message['content'] }}<|end|>{% elif message['role'] == 'user' %}<|user|>{{ message['content'] }}<|end|>{% elif message['role'] == 'assistant' %}<|assistant|>{{ message['content'] }}<|end|>{% endif %}{% endfor %}{% if add_generation_prompt %}<|assistant|>{% endif %}"
+}