arxiv:2312.07104
Lianmin
lmzheng
AI & ML interests
Training and serving large models
Recent Activity
liked a model 1 day ago
modal-labs/Qwen3.5-397B-A17B-DFlash liked a model about 2 months ago
sgl-project/DeepSeek-V4-Flash-FP8 liked a model 6 months ago
XiaomiMiMo/MiMo-V2-Flash