Model Details

Merger Configuration

All source adapters share the following configuration:

Rank (r): 16
Alpha: 16
Target Modules:
- q_proj (Query projection)
- k_proj (Key projection)
- v_proj (Value projection)
- o_proj (Output projection)
- up_proj (Upsampling projection)
- down_proj (Downsampling projection)
- gate_proj (Gate projection)
The order of loading adapters may affect the final result
Equal weights were chosen to maintain balanced influence from each adapter
The merged adapter maintains the same architecture and rank as the original adapters
While this adapter merges multiple fine-tunes, each component was developed as part of independent research efforts to explore and language model capabilities as part of R&D process.

Downloads last month: -; Downloads are not tracked for this model. How to track