This model is designed for classifying Turkish text into different turn-taking categories in a conversation.
Developed by SiriusAI Tech Brain Team
Mission
To enhance conversational AI by accurately detecting turn-taking dynamics in Turkish dialogues, enabling more natural and engaging interactions.
The turn-detector model is capable of classifying responses in Turkish conversations into two distinct categories: agent_response and backchannel. This functionality is crucial for developing advanced voice assistants and dialogue systems that better understand human interactions. By leveraging the power of the BertForSequenceClassification architecture, the model achieves remarkable accuracy and reliability.
Why This Model Matters
High Accuracy: With an impressive accuracy of over 99%, this model ensures reliable classifications in real-world applications.
Enterprise-Grade Performance: Designed for production use, it meets the stringent requirements of enterprise clients.
NLP Expertise: Developed using state-of-the-art natural language processing techniques, it provides a competitive edge in understanding Turkish conversations.
Scalable Solution: Easily integratable into existing systems, allowing for seamless deployment in various applications.
Robust Training: Trained on a substantial dataset, ensuring its effectiveness across diverse conversational contexts.
Model Overview
Property
Value
Architecture
BertForSequenceClassification
Base Model
dbmdz/bert-base-turkish-uncased
Task
Text Classification
Language
Turkish (tr)
Categories
2 labels
Model Size
~110M parameters
Inference Time
~10-15ms (GPU) / ~40-50ms (CPU)
Performance Metrics
Final Evaluation Results
Metric
Score
Description
Macro F1
0.9924
Harmonic mean of precision and recall
MCC
0.9849
Matthews Correlation Coefficient
Accuracy
99.3242%
Ratio of correctly predicted instances to total instances
Per-Class Performance
Category
Accuracy
Correct
Total
agent_response
99.5%
7,429
7,464
backchannel
98.9%
3,741
3,782
Dataset
Dataset Statistics
Split
Samples
Purpose
Train
44,982
Model training
Test
11,246
Model evaluation
Total
56,228
Complete dataset
Category Distribution
Category
Samples
Percentage
Description
turn_action
56,228
100.0%
turn_action category
Subcategory Breakdown
Category
Subcategories
turn_action
agent_response, backchannel
Label Definitions
Label
ID
Description
Turkish Examples
agent_response
0
Represents a direct response from the agent in a conversation
"Merhaba, size nasıl yardımcı olabilirim?"
backchannel
1
Indicates acknowledgment or encouragement from the listener
"Evet", "Anladım"
Important: Category Boundaries
The distinction between agent_response and backchannel is critical. An agent_response represents a substantive reply to a query, while backchannel responses are brief acknowledgments that do not provide new information.
Disclaimer: This model is designed for text classification applications. Always implement with appropriate safeguards and human oversight. Model predictions should inform decisions, not replace human judgment.