theprint
/

DevilsAdvocate-8B

Text Generation

text-generation-inference

Model card Files Files and versions

theprint commited on Jan 27

Commit

e14a246

·

verified ·

1 Parent(s): 981d02b

Update README.md

Files changed (1) hide show

README.md +12 -0

README.md CHANGED Viewed

@@ -29,6 +29,18 @@ This model is a fine-tuned version of Qwen/Qwen3-8B using the Unsloth framework
 - **Base model:** Qwen/Qwen3-8B
 - **Fine-tuning method:** LoRA with rank 128
 ## Intended Use
 General conversation, project feedback and brainstorming.

 - **Base model:** Qwen/Qwen3-8B
 - **Fine-tuning method:** LoRA with rank 128
+## Benchmark Eval
+| Benchmark | Task Category | Accuracy | Raw Score |
+| :--- | :--- | :--- | :--- |
+| **MMLU** | High School Computer Science | 70.0% | 14/20 |
+| **MMLU** | High School Mathematics | 55.0%** | 11/20 |
+| **GSM8K** | Grade School Math Reasoning | **70.0%** | 14/20 |
+| **ARC** | ARC-Challenge (Science) | **15.0%** | 3/20 |
+### Reproducibility
+Benchmarked using **LabEval**, a lightweight benchmarking suite for M-series Macs and LM Studio.
+[**LabEval on GitHub**](https://github.com/theprint/LabEval)
 ## Intended Use
 General conversation, project feedback and brainstorming.