How many GPUs for 8 or higher concurrency using RTX 3090s Rig ?
1
#9 opened about 1 month ago
by
BiggestFox
Any plans for Xiaomi's MiMo V2 Flash?
2
#8 opened about 1 month ago
by
droussis
Are there any plans to make BF16/FP8 AWQ INT4 version of Qwen/Qwen3.5-397B-A17B?
❤️ 1
2
#7 opened about 2 months ago
by
zuuky
Links in README
1
#6 opened about 2 months ago
by
Jon-Nielsen
accuracy
26
#4 opened about 2 months ago
by
ktsaou
accuracy benchmark
1
#3 opened about 2 months ago
by
mwalol
FP8 + INT4 version
14
#2 opened about 2 months ago
by
bigstorm
Cant get it to work on 8x RTX3090
14
#1 opened about 2 months ago
by
maglat