Eval-03 / README.md
Mr-FineTuner's picture
Upload folder using huggingface_hub
15db98f verified

Sentiment Analysis Models

This repository contains two logistic regression models trained to predict sentiment scores.

Model Details

  • Base embedding model: mixedbread-ai/mxbai-embed-large-v1
  • Architecture: LogisticRegression (scikit-learn)
  • Training data: Custom sentiment dataset with dual expert annotations
  • Data split: 70% training, 15% development, 15% test

Performance Metrics

Development Set

Against Expert 1:

  • Exact match: 49.27%
  • Within 1 level: 96.05%

Against Expert 2:

  • Exact match: 41.00%
  • Within 1 level: 93.05%

Test Set

Against Expert 1:

  • Exact match: 49.32%
  • Within 1 level: 94.93%

Against Expert 2:

  • Exact match: 41.44%
  • Within 1 level: 91.51%

Usage

See inference.py for an example of how to use these models to predict sentiment for new text.

Model Files

  • model1.joblib: Model trained on Expert 1 annotations
  • model2.joblib: Model trained on Expert 2 annotations

Data Files

  • dev_results.csv: Complete predictions on development set
  • test_results.csv: Complete predictions on test set