out_interp
This is a merge of pre-trained language models created using mergekit.
Merge Details
Merge Method
This model was merged using the Karcher Mean merge method using Qwen/Qwen3-0.6B-Base as a base.
Models Merged
The following models were included in the merge:
Configuration
The following YAML configuration was used to produce this model:
merge_method: karcher
dtype: bfloat16
base_model: Qwen/Qwen3-0.6B-Base
models:
- model: Qwen/Qwen3-0.6B-Base
parameters:
weight: 1.0
- model: AIPlans/Qwen3-0.6B-ReMax
parameters:
weight: 1.0
parameters:
normalize: true
max_iter: 10
tol: 1.0e-05
- Downloads last month
- 1