MultiLangModel-Best / README.md
SOTAagi2030's picture
Upload folder using huggingface_hub
79dcb1d verified
metadata
license: mit
library_name: transformers

MultiLangModel

MultiLangModel

1. Introduction

MultiLangModel excels at translation and multilingual tasks. This checkpoint is selected based on the best translation benchmark score.

2. Evaluation Results

Comprehensive Benchmark Results

Benchmark MLModel-v1 MLModel-v2 MultiLangModel
Core Reasoning Tasks Math Reasoning 0.510 0.535 0.508
Logical Reasoning 0.789 0.801 0.812
Common Sense 0.716 0.702 0.724
Language Understanding Reading Comprehension 0.671 0.685 0.688
Question Answering 0.582 0.599 0.610
Text Classification 0.803 0.811 0.825
Sentiment Analysis 0.777 0.781 0.790
Generation Tasks Code Generation 0.615 0.631 0.630
Creative Writing 0.588 0.579 0.603
Dialogue Generation 0.621 0.635 0.647
Summarization 0.745 0.755 0.767
Specialized Capabilities Translation 0.782 0.799 0.804
Knowledge Retrieval 0.651 0.668 0.683
Instruction Following 0.733 0.749 0.757
Safety Evaluation 0.718 0.701 0.721

Overall Performance Summary

MultiLangModel achieves top performance on translation tasks while maintaining strong results across all other benchmarks.

3. License

MIT License

4. Contact

Open an issue on GitHub.