InfosysEnterprise
/

Mify-Coder-2.5B

Model card Files Files and versions

srkchowdary2000 commited on Nov 28, 2025

Commit

ccec14f

·

verified ·

1 Parent(s): bb0a628

Update README.md

Files changed (1) hide show

README.md +4 -1

README.md CHANGED Viewed

@@ -26,12 +26,15 @@ Mify-Coder-2.5B-v0.1 is a **2.5B-parameter code-focused language model**. It del
 | **Category**   | **Benchmark**       | **# Shots** | **Metric** | **Scores**   |
 |----------------|----------------------|-------------|------------|-------------------|
-| Code Gen       | MBPP                | 0           | pass@1     | 90.70%     |
 | Code Gen       | MBPP+               | 0           | pass@1     | 88.89%      |
 | Code Gen       | HumanEval           | 0           | pass@1     | 53.05%      |
 | Code Gen       | HumanEval+          | 0           | pass@1     | 46.95%     |
 | Code Gen       | NumpyEval           | 0           | pass@1     | 56.44%     |
 | Code Gen       | PandasEval          | 0           | pass@1     | 53.47%      |
 - Outperforms larger models on algorithmic reasoning tasks while maintaining competitive general coding and security-oriented capabilities.

 | **Category**   | **Benchmark**       | **# Shots** | **Metric** | **Scores**   |
 |----------------|----------------------|-------------|------------|-------------------|
+| Code Gen       | MBPP                | 0           | pass@1     | 89.23%     |
 | Code Gen       | MBPP+               | 0           | pass@1     | 88.89%      |
 | Code Gen       | HumanEval           | 0           | pass@1     | 53.05%      |
 | Code Gen       | HumanEval+          | 0           | pass@1     | 46.95%     |
 | Code Gen       | NumpyEval           | 0           | pass@1     | 56.44%     |
 | Code Gen       | PandasEval          | 0           | pass@1     | 53.47%      |
+| Tool Use       | BFCL v1             | 0           | acc        | 79.19%     |
+| Tool Use       | BFCL v2             | 0           | acc        | 55.26%      |
 - Outperforms larger models on algorithmic reasoning tasks while maintaining competitive general coding and security-oriented capabilities.