Pruning strategy to delete optimal layers of Llama-8B instruct model. Discarded 25% of the layers, model still produces legible text. Next steps include healing PEFT steps
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support