SOTAagi2030
/

MultiLangModel-Best

Feature Extraction

Model card Files Files and versions

MultiLangModel-Best / README.md

SOTAagi2030's picture

Upload folder using huggingface_hub

79dcb1d verified about 5 hours ago

|

history blame contribute delete

1.95 kB

	---
	license: mit
	library_name: transformers
	---
	# MultiLangModel

	<!-- markdownlint-disable first-line-h1 -->
	<!-- markdownlint-disable html -->
	<!-- markdownlint-disable no-duplicate-header -->

	<div align="center">
	<img src="figures/fig1.png" width="60%" alt="MultiLangModel" />
	</div>
	<hr>

	<div align="center" style="line-height: 1;">
	<a href="LICENSE" style="margin: 2px;">
	<img alt="License" src="figures/fig2.png" style="display: inline-block; vertical-align: middle;"/>
	</a>
	</div>

	## 1. Introduction

	MultiLangModel excels at translation and multilingual tasks. This checkpoint is selected based on the best translation benchmark score.

	<p align="center">
	<img width="80%" src="figures/fig3.png">
	</p>

	## 2. Evaluation Results

	### Comprehensive Benchmark Results

	<div align="center">

	\| \| Benchmark \| MLModel-v1 \| MLModel-v2 \| MultiLangModel \|
	\|---\|---\|---\|---\|---\|
	\| Core Reasoning Tasks \| Math Reasoning \| 0.510 \| 0.535 \| 0.508 \|
	\| \| Logical Reasoning \| 0.789 \| 0.801 \| 0.812 \|
	\| \| Common Sense \| 0.716 \| 0.702 \| 0.724 \|
	\| Language Understanding \| Reading Comprehension \| 0.671 \| 0.685 \| 0.688 \|
	\| \| Question Answering \| 0.582 \| 0.599 \| 0.610 \|
	\| \| Text Classification \| 0.803 \| 0.811 \| 0.825 \|
	\| \| Sentiment Analysis \| 0.777 \| 0.781 \| 0.790 \|
	\| Generation Tasks \| Code Generation \| 0.615 \| 0.631 \| 0.630 \|
	\| \| Creative Writing \| 0.588 \| 0.579 \| 0.603 \|
	\| \| Dialogue Generation \| 0.621 \| 0.635 \| 0.647 \|
	\| \| Summarization \| 0.745 \| 0.755 \| 0.767 \|
	\| Specialized Capabilities\| Translation \| 0.782 \| 0.799 \| 0.804 \|
	\| \| Knowledge Retrieval \| 0.651 \| 0.668 \| 0.683 \|
	\| \| Instruction Following \| 0.733 \| 0.749 \| 0.757 \|
	\| \| Safety Evaluation \| 0.718 \| 0.701 \| 0.721 \|

	</div>

	### Overall Performance Summary
	MultiLangModel achieves top performance on translation tasks while maintaining strong results across all other benchmarks.

	## 3. License
	[MIT License](LICENSE)

	## 4. Contact
	Open an issue on GitHub.