picket-cliff
/

deepl-project-model

Text Classification

Model card Files Files and versions

picket-cliff commited on Mar 13

Commit

febc162

·

verified ·

1 Parent(s): 58604c5

Update README.md

Files changed (1) hide show

README.md +6 -0

README.md CHANGED Viewed

@@ -54,7 +54,9 @@ Deep learning models cannot process raw text; they require numerical tensors. We
 2.	Special Tokens: The tokenizer automatically prepends the [CLS] (Classification) token to the start of the sequence and the [SEP] (Separator) token at the end. The final hidden state corresponding to the [CLS] token is what the model uses for the binary classification decision.
 3.	Truncation and Padding: Transformer models require fixed-size input matrices for batch processing. Based on our EDA length distribution, we set max_length = 128.
   o	Sentences longer than 128 tokens were truncated.
   o	Sentences shorter than 128 tokens were padded with the [PAD] token (ID 0).
 4.	Attention Masks: To prevent the model from performing Self-Attention on meaningless padding tokens, the tokenizer generates an attention_mask (an array of 1s for real words and 0s for padding).
@@ -78,9 +80,13 @@ Accuracy, f1 score (macro and weighted)
 ### Results
 When evaluated on a 80-20 split we obtained:
 •	Accuracy: 99.10%
 •	Macro Average F1-Score: 0.98
 •	Weighted Average F1-Score: 0.99
 Meanwhile the dummy achieved 86.6% accuracy.
 #### Summary

 2.	Special Tokens: The tokenizer automatically prepends the [CLS] (Classification) token to the start of the sequence and the [SEP] (Separator) token at the end. The final hidden state corresponding to the [CLS] token is what the model uses for the binary classification decision.
 3.	Truncation and Padding: Transformer models require fixed-size input matrices for batch processing. Based on our EDA length distribution, we set max_length = 128.
   o	Sentences longer than 128 tokens were truncated.
   o	Sentences shorter than 128 tokens were padded with the [PAD] token (ID 0).
 4.	Attention Masks: To prevent the model from performing Self-Attention on meaningless padding tokens, the tokenizer generates an attention_mask (an array of 1s for real words and 0s for padding).
 ### Results
 When evaluated on a 80-20 split we obtained:
 •	Accuracy: 99.10%
 •	Macro Average F1-Score: 0.98
 •	Weighted Average F1-Score: 0.99
 Meanwhile the dummy achieved 86.6% accuracy.
 #### Summary