Yellowtree
/

LLaMA2-7B_2-by-4_Sparse

Model card Files Files and versions

Yellowtree commited on Dec 21, 2024

Commit

fc1f665

·

verified ·

1 Parent(s): f72c7cc

Update README.md

Files changed (1) hide show

README.md +10 -1

README.md CHANGED Viewed

@@ -4,4 +4,13 @@ language:
 - en
 base_model:
 - meta-llama/Llama-2-7b-hf
----

 - en
 base_model:
 - meta-llama/Llama-2-7b-hf
+---
+# Model Card for Model ID
+<!-- Provide a quick summary of what the model is/does. -->
+This repo contains a 2:4 sparse version of the LLaMA2-7B model. Trainied with methods from AAAI25 paper [Pruning Large Language Models with Semi-Structural Adaptive Sparse Training](https://arxiv.org/abs/2407.20584).
+### Model Description
+Same structured as LLaMA2-7B, but weight from linear layer conform to 2:4 sparse pattern.