llm AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration Paper • 2306.00978 • Published Jun 1, 2023 • 13 shenzhi-wang/Llama3-8B-Chinese-Chat Text Generation • 8B • Updated Jul 4, 2024 • 6.45k • • 689
AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration Paper • 2306.00978 • Published Jun 1, 2023 • 13
llm AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration Paper • 2306.00978 • Published Jun 1, 2023 • 13 shenzhi-wang/Llama3-8B-Chinese-Chat Text Generation • 8B • Updated Jul 4, 2024 • 6.45k • • 689
AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration Paper • 2306.00978 • Published Jun 1, 2023 • 13