File size: 3,245 Bytes
48c2673 d7984cf bc67265 d7984cf bc67265 d7984cf bc67265 d7984cf bc67265 d7984cf bc67265 d7984cf bc67265 d7984cf bc67265 d7984cf bc67265 d7984cf bc67265 d7984cf bc67265 d7984cf bc67265 d7984cf bc67265 d7984cf bc67265 d7984cf bc67265 d7984cf bc67265 d7984cf bc67265 d7984cf bc67265 d7984cf bc67265 d7984cf bc67265 d7984cf bc67265 d7984cf bc67265 d7984cf 48c2673 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 |
---
license: cc-by-4.0
language:
- th
base_model:
- airesearch/wangchanberta-base-att-spm-uncased
pipeline_tag: token-classification
---
library_name: transformers
tags: [ner, thai, food, review, token-classification]
---
# Model Card for wttw/modchelin_thainer-base-model
This model performs Named Entity Recognition (NER) on Thai-language food reviews. It is designed to extract domain-specific aspects such as dish names, ingredients, restaurant service, and sentiment-related phrases from customer-written content.
## Model Details
### Model Description
This is the model card of a 🤗 Transformers model that has been pushed to the Hugging Face Hub.
- **Developed by:** Vitawat Kitipatthavorn
- **Finetuned from model:** `airesearch/wangchanberta-base-att-spm-uncased`
- **Model type:** Token Classification (NER)
- **Language(s) (NLP):** Thai
- **License:** cc-by-sa-4.0
- **Shared by:** wttw
- **Model ID:** `wttw/modchelin_thainer-base-model`
## Uses
### Direct Use
This model is designed for extracting domain-specific entities from Thai-language food reviews. It identifies and classifies named entities related to:
- Food/menu items
- Taste
- Service
- Ambiance
- Price and value
- Other aspects relevant to customer dining experiences
**Example:**
- **Input:** `"ต้มยำกุ้งอร่อยมาก แต่บริการช้า"`
- **Output:**
- `ต้มยำกุ้ง: FOOD`
- `บริการ: SERVICE`
The model is suitable for NLP pipelines aimed at analyzing restaurant reviews, powering sentiment dashboards, or supporting aspect-based sentiment analysis (ABSA).
### Downstream Use
The model can be integrated into:
- Thai ABSA pipelines
- Restaurant feedback summarization systems
- Chatbots or moderation tools for food delivery and review platforms
### Out-of-Scope Use
The model is not designed for:
- Non-food-related documents (e.g., legal, clinical, political)
- General-purpose Thai NER tasks
- Use cases requiring high confidence on ambiguous or out-of-domain text
## Bias, Risks, and Limitations
The model is trained specifically on food review content and may:
- Struggle with informal slang or regional dialects
- Over-predict `FOOD` entities in unrelated contexts
- Misclassify ambiguous phrases without surrounding context
### Recommendations
Users should:
- Avoid applying this model outside food-related domains
- Fine-tune further if working with reviews in specific dialects or contexts
- Evaluate on a sample of target data before production use
- Consider setting confidence thresholds before using predictions downstream
## How to Get Started with the Model
```python
from transformers import AutoTokenizer, AutoModelForTokenClassification
from transformers import pipeline
model_name = "wttw/modchelin_thainer-base-model"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForTokenClassification.from_pretrained(model_name)
ner_pipeline = pipeline("ner", model=model, tokenizer=tokenizer, aggregation_strategy="simple")
example = "ต้มยำกุ้งอร่อยมาก แต่บริการช้า"
entities = ner_pipeline(example)
print(entities) |