Lyon28 commited on
Commit
a9a0272
·
verified ·
1 Parent(s): 042bef1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +57 -3
README.md CHANGED
@@ -1,3 +1,57 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ datasets:
4
+ - Lyon28/datasets-caca-3500
5
+ language:
6
+ - id
7
+ tags:
8
+ - retrieval
9
+ - qa
10
+ - indonesian
11
+ - bm25
12
+ - tfidf
13
+ ---
14
+
15
+ # Chatbot Caca - Retrieval-Based QA
16
+
17
+ Chatbot berbasis BM25 + TF-IDF untuk QA Bahasa Indonesia.
18
+
19
+ ## Model Details
20
+
21
+ - **Type:** Retrieval-based QA System
22
+ - **Size:** 2.69 MB
23
+ - **Algorithm:** Hybrid BM25 + TF-IDF + Fuzzy Matching
24
+ - **Dataset:** datasets-caca-3500 (3,500 QA pairs)
25
+ - **Language:** Indonesian
26
+
27
+ ## Usage
28
+
29
+ ```python
30
+ # Install dependencies
31
+ !pip install rank-bm25 scikit-learn huggingface-hub
32
+
33
+ # Download model
34
+ from huggingface_hub import hf_hub_download
35
+
36
+ model_path = hf_hub_download(
37
+ repo_id="Lyon28/caca-based-chatbot",
38
+ filename="chatbot_caca.pkl"
39
+ )
40
+
41
+ # Load model
42
+ import pickle
43
+ with open(model_path, 'rb') as f:
44
+ data = pickle.load(f)
45
+
46
+ print(f"Loaded {len(data['qa_pairs'])} QA pairs!")
47
+ ```
48
+
49
+ ## Performance
50
+
51
+ - Query speed: < 10ms
52
+ - Accuracy: High for paraphrase matching
53
+ - Memory: ~3MB
54
+
55
+ ## Credits
56
+
57
+ Created by Lyon28