Eval-03 / README.md
Mr-FineTuner's picture
Upload folder using huggingface_hub
15db98f verified
# Sentiment Analysis Models
This repository contains two logistic regression models trained to predict sentiment scores.
## Model Details
- Base embedding model: mixedbread-ai/mxbai-embed-large-v1
- Architecture: LogisticRegression (scikit-learn)
- Training data: Custom sentiment dataset with dual expert annotations
- Data split: 70% training, 15% development, 15% test
## Performance Metrics
### Development Set
#### Against Expert 1:
- Exact match: 49.27%
- Within 1 level: 96.05%
#### Against Expert 2:
- Exact match: 41.00%
- Within 1 level: 93.05%
### Test Set
#### Against Expert 1:
- Exact match: 49.32%
- Within 1 level: 94.93%
#### Against Expert 2:
- Exact match: 41.44%
- Within 1 level: 91.51%
## Usage
See `inference.py` for an example of how to use these models to predict sentiment for new text.
## Model Files
- `model1.joblib`: Model trained on Expert 1 annotations
- `model2.joblib`: Model trained on Expert 2 annotations
## Data Files
- `dev_results.csv`: Complete predictions on development set
- `test_results.csv`: Complete predictions on test set