Nohobby
/

AbominationSnowPig

Text Generation

text-generation-inference

Model card Files Files and versions

AbominationSnowPig / README.md

Nohobby's picture

Update README.md

c10319c verified about 1 year ago

|

history blame contribute delete

1.6 kB

	---
	base_model:
	- unsloth/Llama-3.3-70B-Instruct
	library_name: transformers
	tags:
	- mergekit
	- merge

	---
	# AbominationSnowPig

	One of the steps in this thing https://huggingface.co/Nohobby/L3.3-Prikol-70B-v0.2

	## Merge Details
	### Merge Method

	This model was merged using the NuSLERP merge method.

	### Models Merged

	The following models were included in the merge:
	* [unsloth/Llama-3.3-70B-Instruct](https://huggingface.co/unsloth/Llama-3.3-70B-Instruct)
	* Step1

	### Configuration

	The following YAML configuration was used to produce this model:

	```yaml
	dtype: bfloat16
	tokenizer_source: base
	merge_method: nuslerp
	parameters:
	nuslerp_row_wise: true
	models:
	- model: unsloth/Llama-3.3-70B-Instruct
	parameters:
	weight:
	- filter: v_proj
	value: [0, 0, 1, 1, 1, 1, 1, 1, 1, 0, 0]
	- filter: o_proj
	value: [1, 0, 1, 0, 0, 0, 0, 0, 1, 1, 1]
	- filter: up_proj
	value: [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1]
	- filter: gate_proj
	value: [0, 0, 1, 1, 1, 1, 1, 1, 1, 0, 0]
	- filter: down_proj
	value: [0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0]
	- value: 0
	- model: Step1
	parameters:
	weight:
	- filter: v_proj
	value: [1, 1, 0, 0, 0, 0, 0, 0, 0, 1, 1]
	- filter: o_proj
	value: [0, 1, 0, 1, 1, 1, 1, 1, 0, 0, 0]
	- filter: up_proj
	value: [0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0]
	- filter: gate_proj
	value: [1, 1, 0, 0, 0, 0, 0, 0, 0, 1, 1]
	- filter: down_proj
	value: [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1]
	- value: 1
	```