fix: align RotaryEmbedding with Qwen2Moe pattern for transformers compat
#4 opened 13 days ago
by
kashif
Runnable via dInfer?
👀 1
#3 opened about 1 month ago
by
Muzel
Could you provide the official NVFP4 version? Dear friend.
#2 opened about 1 month ago
by
win10
Support for mlx lm and llama.cpp
➕ 2
#1 opened about 2 months ago
by
Narutoouz