| | ---
|
| | library_name: transformers
|
| | tags:
|
| | - mergekit
|
| | - merge
|
| | base_model:
|
| | - deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
|
| | - Qwen/Qwen2.5-7B-Instruct
|
| | - bunnycore/Qwen-2.5-7b-rp-lora
|
| | - Qwen/Qwen2.5-7B-Instruct
|
| | language:
|
| | - zho
|
| | - eng
|
| | - fra
|
| | - spa
|
| | - por
|
| | - deu
|
| | - ita
|
| | - rus
|
| | - jpn
|
| | - kor
|
| | - vie
|
| | - tha
|
| | - ara
|
| | model-index:
|
| | - name: Qwen-2.5-7B-R1-Stock
|
| | results:
|
| | - task:
|
| | type: text-generation
|
| | name: Text Generation
|
| | dataset:
|
| | name: IFEval (0-Shot)
|
| | type: HuggingFaceH4/ifeval
|
| | args:
|
| | num_few_shot: 0
|
| | metrics:
|
| | - type: inst_level_strict_acc and prompt_level_strict_acc
|
| | value: 75.73
|
| | name: strict accuracy
|
| | source:
|
| | url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=bunnycore/Qwen-2.5-7B-R1-Stock
|
| | name: Open LLM Leaderboard
|
| | - task:
|
| | type: text-generation
|
| | name: Text Generation
|
| | dataset:
|
| | name: BBH (3-Shot)
|
| | type: BBH
|
| | args:
|
| | num_few_shot: 3
|
| | metrics:
|
| | - type: acc_norm
|
| | value: 34.85
|
| | name: normalized accuracy
|
| | source:
|
| | url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=bunnycore/Qwen-2.5-7B-R1-Stock
|
| | name: Open LLM Leaderboard
|
| | - task:
|
| | type: text-generation
|
| | name: Text Generation
|
| | dataset:
|
| | name: MATH Lvl 5 (4-Shot)
|
| | type: hendrycks/competition_math
|
| | args:
|
| | num_few_shot: 4
|
| | metrics:
|
| | - type: exact_match
|
| | value: 0.0
|
| | name: exact match
|
| | source:
|
| | url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=bunnycore/Qwen-2.5-7B-R1-Stock
|
| | name: Open LLM Leaderboard
|
| | - task:
|
| | type: text-generation
|
| | name: Text Generation
|
| | dataset:
|
| | name: GPQA (0-shot)
|
| | type: Idavidrein/gpqa
|
| | args:
|
| | num_few_shot: 0
|
| | metrics:
|
| | - type: acc_norm
|
| | value: 6.6
|
| | name: acc_norm
|
| | source:
|
| | url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=bunnycore/Qwen-2.5-7B-R1-Stock
|
| | name: Open LLM Leaderboard
|
| | - task:
|
| | type: text-generation
|
| | name: Text Generation
|
| | dataset:
|
| | name: MuSR (0-shot)
|
| | type: TAUR-Lab/MuSR
|
| | args:
|
| | num_few_shot: 0
|
| | metrics:
|
| | - type: acc_norm
|
| | value: 8.05
|
| | name: acc_norm
|
| | source:
|
| | url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=bunnycore/Qwen-2.5-7B-R1-Stock
|
| | name: Open LLM Leaderboard
|
| | - task:
|
| | type: text-generation
|
| | name: Text Generation
|
| | dataset:
|
| | name: MMLU-PRO (5-shot)
|
| | type: TIGER-Lab/MMLU-Pro
|
| | config: main
|
| | split: test
|
| | args:
|
| | num_few_shot: 5
|
| | metrics:
|
| | - type: acc
|
| | value: 36.6
|
| | name: accuracy
|
| | source:
|
| | url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=bunnycore/Qwen-2.5-7B-R1-Stock
|
| | name: Open LLM Leaderboard
|
| | ---
|
| | # merge
|
| |
|
| | This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
| |
|
| | ## Merge Details
|
| | ### Merge Method
|
| |
|
| | This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [Qwen/Qwen2.5-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct) as a base.
|
| |
|
| | ### Models Merged
|
| |
|
| | The following models were included in the merge:
|
| | * [deepseek-ai/DeepSeek-R1-Distill-Qwen-7B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B)
|
| | * [Qwen/Qwen2.5-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct) + [bunnycore/Qwen-2.5-7b-rp-lora](https://huggingface.co/bunnycore/Qwen-2.5-7b-rp-lora)
|
| |
|
| | ### Configuration
|
| |
|
| | The following YAML configuration was used to produce this model:
|
| |
|
| | ```yaml
|
| | models:
|
| | - model: deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
|
| | - model: Qwen/Qwen2.5-7B-Instruct
|
| | - model: Qwen/Qwen2.5-7B-Instruct+bunnycore/Qwen-2.5-7b-rp-lora
|
| | base_model: Qwen/Qwen2.5-7B-Instruct
|
| | merge_method: model_stock
|
| | parameters:
|
| | dtype: bfloat16
|
| | tokenizer_source: Qwen/Qwen2.5-7B-Instruct
|
| |
|
| | ```
|
| |
|
| | # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
|
| | Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/bunnycore__Qwen-2.5-7B-R1-Stock-details)
|
| |
|
| | | Metric |Value|
|
| | |-------------------|----:|
|
| | |Avg. |26.97|
|
| | |IFEval (0-Shot) |75.73|
|
| | |BBH (3-Shot) |34.85|
|
| | |MATH Lvl 5 (4-Shot)| 0.00|
|
| | |GPQA (0-shot) | 6.60|
|
| | |MuSR (0-shot) | 8.05|
|
| | |MMLU-PRO (5-shot) |36.60|
|
| |
|
| |
|