llm AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration Paper • 2306.00978 • Published Jun 1, 2023 • 11 shenzhi-wang/Llama3-8B-Chinese-Chat Text Generation • 8B • Updated Jul 4, 2024 • 7.51k • • 685
AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration Paper • 2306.00978 • Published Jun 1, 2023 • 11
llm AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration Paper • 2306.00978 • Published Jun 1, 2023 • 11 shenzhi-wang/Llama3-8B-Chinese-Chat Text Generation • 8B • Updated Jul 4, 2024 • 7.51k • • 685
AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration Paper • 2306.00978 • Published Jun 1, 2023 • 11