mjpsm commited on
Commit
2aae2e1
ยท
verified ยท
1 Parent(s): 13c777e

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +88 -0
README.md ADDED
@@ -0,0 +1,88 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ library_name: xgboost
4
+ tags:
5
+ - text-classification
6
+ - xgboost
7
+ - embeddings
8
+ - sentence-transformers
9
+ - slack
10
+ - checkin-detection
11
+ pipeline_tag: text-classification
12
+ model_name: "Check-In or Not Classifier"
13
+ framework: xgboost
14
+ datasets:
15
+ - custom
16
+ language:
17
+ - en
18
+ inference:
19
+ parameters:
20
+ threshold: 0.5
21
+ ---
22
+
23
+ ## ๐Ÿง  Check-In vs. Not Check-In Classifier
24
+
25
+ This model is a binary text classifier that determines whether a Slack message is a CHECKIN or NOT_CHECKIN. It was trained using a balanced dataset of real Slack check-ins and synthetic non-check-in examples. The model uses sentence-transformer embeddings combined with an XGBoost classifier.
26
+
27
+ ## ๐Ÿš€ Use Case
28
+
29
+ This model is designed for automation systems such as Slack bots, n8n workflows, CIC MCP agents, and productivity tracking tools. It detects whether a user message represents real progress or general conversation.
30
+
31
+ Example predictions:
32
+
33
+ "Today I refined the workflow automation" โ†’ CHECKIN
34
+ "Does anyone have the Zoom link?" โ†’ NOT_CHECKIN
35
+
36
+ ## ๐Ÿ“Š Training Details
37
+
38
+ Dataset size: 4,277 total examples
39
+ Labels:
40
+
41
+ - **CHECKIN:** 2,139 real Slack check-ins
42
+ - **NOT_CHECKIN:** 2,139 synthetic non-check-in samples
43
+
44
+ Training achieved **100% accuracy** due to the dataset being clean, consistent, and linearly separable.
45
+
46
+ ## ๐Ÿ“ Repository Structure
47
+
48
+ ```
49
+ checkin_or_not_classifier.json
50
+ README.md
51
+ ```
52
+
53
+ ## ๐Ÿ”ง Inference Example
54
+
55
+ ```python
56
+ from sentence_transformers import SentenceTransformer
57
+ import xgboost as xgb
58
+
59
+ embedder = SentenceTransformer("sentence_transformer_model")
60
+
61
+ booster = xgb.Booster()
62
+ booster.load_model("checkin_or_not_classifier.json")
63
+
64
+ def predict(text):
65
+ emb = embedder.encode([text])
66
+ pred = booster.predict(xgb.DMatrix(emb))[0]
67
+ return ("CHECKIN" if pred >= 0.5 else "NOT_CHECKIN", float(pred))
68
+
69
+ label, score = predict("Today I worked on automation tasks.")
70
+ print(label, score)
71
+ ```
72
+
73
+ ## ๐Ÿ“˜ Intended Use
74
+
75
+ This model is intended for binary classification of short messages. It is not intended for sentiment analysis, toxicity detection, or general-purpose NLP tasks.
76
+
77
+ ## ๐Ÿ“œ License
78
+
79
+ Apache-2.0
80
+
81
+ ## โœจ Author
82
+
83
+ Mazamesso โ€œMazzyโ€ Meba
84
+ University of North Florida
85
+ AI Automation & Engineering
86
+
87
+
88
+