File size: 974 Bytes
a9a0272
 
 
ebd3ed5
a9a0272
 
 
7924495
 
 
 
 
e896690
7924495
a9a0272
7924495
a9a0272
7924495
a9a0272
7924495
 
 
ebd3ed5
7924495
a9a0272
7924495
a9a0272
7924495
 
 
a9a0272
7924495
 
a9a0272
7924495
ebd3ed5
7924495
 
a9a0272
7924495
 
 
 
a9a0272
7924495
 
a9a0272
7924495
a9a0272
7924495
 
 
a9a0272
7924495
e896690
f9e51c6
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
---
license: mit
datasets:
- Lyon28/Caca-Behavior
language:
- id
tags:
- retrieval
- qa
- indonesian
- bm25
- tfidf
---
# Chatbot Caca - Retrieval-Based QA
   
Chatbot berbasis BM25 + TF-IDF untuk QA Bahasa Indonesia.
   
## Model Details
   
- **Type:** Retrieval-based QA System
- **Size:** 2.69 MB
- **Algorithm:** Hybrid BM25 + TF-IDF + Fuzzy Matching
- **Dataset:** Caca-Behavior (4.079 QA pairs)
- **Language:** Indonesian
   
## Usage
   
```python
# Install dependencies
!pip install rank-bm25 scikit-learn huggingface-hub
   
# Download model
from huggingface_hub import hf_hub_download
   
model_path = hf_hub_download(
repo_id="Lyon28/Caca-Chatbot",
filename="chatbot_caca.pkl"
)
   
# Load model
import pickle
with open(model_path, 'rb') as f:
data = pickle.load(f)
   
print(f"Loaded {len(data['qa_pairs'])} QA pairs!")
```
   
## Performance
   
- Query speed: < 10ms
- Accuracy: High for paraphrase matching
- Memory: ~3MB
   
## Credits
   
Created by Lyon28