File size: 974 Bytes
a9a0272 ebd3ed5 a9a0272 7924495 e896690 7924495 a9a0272 7924495 a9a0272 7924495 a9a0272 7924495 ebd3ed5 7924495 a9a0272 7924495 a9a0272 7924495 a9a0272 7924495 a9a0272 7924495 ebd3ed5 7924495 a9a0272 7924495 a9a0272 7924495 a9a0272 7924495 a9a0272 7924495 a9a0272 7924495 e896690 f9e51c6 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 |
---
license: mit
datasets:
- Lyon28/Caca-Behavior
language:
- id
tags:
- retrieval
- qa
- indonesian
- bm25
- tfidf
---
# Chatbot Caca - Retrieval-Based QA
Chatbot berbasis BM25 + TF-IDF untuk QA Bahasa Indonesia.
## Model Details
- **Type:** Retrieval-based QA System
- **Size:** 2.69 MB
- **Algorithm:** Hybrid BM25 + TF-IDF + Fuzzy Matching
- **Dataset:** Caca-Behavior (4.079 QA pairs)
- **Language:** Indonesian
## Usage
```python
# Install dependencies
!pip install rank-bm25 scikit-learn huggingface-hub
# Download model
from huggingface_hub import hf_hub_download
model_path = hf_hub_download(
repo_id="Lyon28/Caca-Chatbot",
filename="chatbot_caca.pkl"
)
# Load model
import pickle
with open(model_path, 'rb') as f:
data = pickle.load(f)
print(f"Loaded {len(data['qa_pairs'])} QA pairs!")
```
## Performance
- Query speed: < 10ms
- Accuracy: High for paraphrase matching
- Memory: ~3MB
## Credits
Created by Lyon28 |