Yellowtree
/

LLaMA2-7B_2-by-4_Sparse

Model card Files Files and versions

LLaMA2-7B_2-by-4_Sparse / README.md

Yellowtree's picture

Update README.md

fc1f665 verified about 1 year ago

|

history blame contribute delete

501 Bytes

	---
	license: apache-2.0
	language:
	- en
	base_model:
	- meta-llama/Llama-2-7b-hf
	---
	# Model Card for Model ID

	<!-- Provide a quick summary of what the model is/does. -->

	This repo contains a 2:4 sparse version of the LLaMA2-7B model. Trainied with methods from AAAI25 paper [Pruning Large Language Models with Semi-Structural Adaptive Sparse Training](https://arxiv.org/abs/2407.20584).

	### Model Description

	Same structured as LLaMA2-7B, but weight from linear layer conform to 2:4 sparse pattern.