chuuhtetnaing
/

myanmar-text-segmentation-model

Token Classification

text-segmentation

Model card Files Files and versions

chuuhtetnaing commited on Dec 24, 2025

Commit

407a296

·

verified ·

1 Parent(s): dca60b1

Update README.md

Files changed (1) hide show

README.md +12 -12

README.md CHANGED Viewed

@@ -97,25 +97,25 @@ Fine-tuned [FacebookAI/xlm-roberta-base](https://huggingface.co/FacebookAI/xlm-r
 ## Usage
-```python
-from transformers import AutoModelForTokenClassification, AutoTokenizer, pipeline
-model = AutoModelForTokenClassification.from_pretrained("chuuhtetnaing/myanmar_text_segmentation_model")
-tokenizer = AutoTokenizer.from_pretrained("chuuhtetnaing/myanmar_text_segmentation_model")
-# Using pipeline
-nlp = pipeline("token-classification", model=model, tokenizer=tokenizer)
-tokens = nlp("အချစ်ဆိုတာလူတွေရှင်သန်ဖို့သဘာဝကပေးတဲ့လက်နက်လား၊ဒါမှမဟုတ်ယဉ်ကျေးမှုအရတီထွင်ထားတဲ့စိတ်ကူးယဉ်မှုသက်သက်လား။")
 segmented_text = []
-for item in tokens:
-    if item["entity_group"] == "B":
-        segmented_text.append(item["word"])
     else:  # 'I' - append to previous word
-        segmented_text[-1] += item["word"]
 segmented_text = " ".join(segmented_text)
-return segmented_text
 ```
 ## Label Mapping

 ## Usage
+### Using Pipeline
+```python
+from transformers import pipeline
+nlp = pipeline("token-classification", model="chuuhtetnaing/myanmar-text-segmentation-model", grouped_entities=True)
+segments = nlp("အချစ်ဆိုတာလူတွေရှင်သန်ဖို့သဘာဝကပေးတဲ့လက်နက်လား၊ဒါမှမဟုတ်ယဉ်ကျေးမှုအရတီထွင်ထားတဲ့စိတ်ကူးယဉ်မှုသက်သက်လား။")
 segmented_text = []
+for segment in segments:
+    if segment["entity_group"] == "B":
+        segmented_text.append(segment["word"])
     else:  # 'I' - append to previous word
+        segmented_text[-1] += segment["word"]
 segmented_text = " ".join(segmented_text)
+print(segmented_text)
+# အချစ်ဆိုတာ လူတွေရှင်သန်ဖို့ သဘာဝကပေးတဲ့လက်နက်လား၊ ဒါမှမဟုတ် ယဉ်ကျေးမှုအရ တီထွင်ထားတဲ့ စိတ်ကူးယဉ်မှုသက်သက်လား။
 ```
 ## Label Mapping