Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
mratsim
/
MiniMax-M2.1-FP8-INT4-AWQ
like
39
Text Generation
Safetensors
48 datasets
llm-compressor
minimax_m2
fp8
awq
conversational
vllm
code
devops
software engineering
engineer
developer
architect
stem
agent
custom_code
compressed-tensors
arxiv:
5 papers
License:
modified-mit
Model card
Files
Files and versions
xet
Community
10
main
MiniMax-M2.1-FP8-INT4-AWQ
Commit History
mention VRAM cost of BF16
5b18eb5
verified
mratsim
commited on
Jan 14
remove -FP8 in model name
7d33890
verified
mratsim
commited on
Jan 14
Add mention to mratsim/MiniMax-M2.1-BF16-INT4-AWQ
f60e498
verified
mratsim
commited on
Jan 14
Fix Engrish
7bde446
verified
mratsim
commited on
Jan 6
typo in script command
70f7944
verified
mratsim
commited on
Jan 6
Add repetition_penalty and frequency_penalty suggestion
a3aac1b
verified
mratsim
commited on
Jan 5
Update README.md
2d8be0e
verified
mratsim
commited on
Jan 3
Update README.md
bd78c3a
verified
mratsim
commited on
Jan 3
Update README.md
b177683
verified
mratsim
commited on
Jan 3
Add performance figures
6a70db0
verified
mratsim
commited on
Jan 3
Update README.md
2e6cbf9
verified
mratsim
commited on
Jan 3
Create model card
87abc8a
verified
mratsim
commited on
Jan 3
Create calibrate_software_engineer.yaml
3c9a12b
verified
mratsim
commited on
Jan 2
Upload folder using huggingface_hub
34f3f23
verified
mratsim
commited on
Jan 2
initial commit
8d6de6f
verified
mratsim
commited on
Jan 2