Transformers

MyAwesomeModel

Model Description

The MyAwesomeModel v2 (Step 1000) is the best performing model checkpoint from our training process, with an overall evaluation score of 0.710.

Evaluation Results

Comprehensive Benchmark Results

Benchmark Model1 Model2 Model1-v2 MyAwesomeModel
Core Reasoning Tasks Math Reasoning 0.510 0.535 0.521 0.550
Logical Reasoning 0.789 0.801 0.810 0.819
Common Sense 0.716 0.702 0.725 0.736
Language Understanding Reading Comprehension 0.671 0.685 0.690 0.700
Question Answering 0.582 0.599 0.601 0.607
Text Classification 0.803 0.811 0.820 0.828
Sentiment Analysis 0.777 0.781 0.790 0.792
Generation Tasks Code Generation 0.615 0.631 0.640 0.650
Creative Writing 0.588 0.579 0.601 0.610
Dialogue Generation 0.621 0.635 0.639 0.644
Summarization 0.745 0.755 0.760 0.767
Specialized Capabilities Translation 0.782 0.799 0.801 0.804
Knowledge Retrieval 0.651 0.668 0.670 0.676
Instruction Following 0.733 0.749 0.751 0.758
Safety Evaluation 0.718 0.701 0.725 0.739

Performance Summary

The model achieves its best performance in:

  • Text Classification: 0.828
  • Logical Reasoning: 0.819
  • Translation: 0.804
  • Sentiment Analysis: 0.792

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("mazextest2026/MyAwesomeModel-TestRepo")
tokenizer = AutoTokenizer.from_pretrained("mazextest2026/MyAwesomeModel-TestRepo")
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support