Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
12
Cem
cemt
Follow
0 followers
·
7 following
AI & ML interests
None yet
Recent Activity
reacted
to
wolfram
's
post
with 👍
5 days ago
Finally finished my extensive **Qwen 3 evaluations** across a range of formats and quantisations, focusing on **MMLU-Pro** (Computer Science). A few take-aways stood out - especially for those interested in local deployment and performance trade-offs: 1️⃣ **Qwen3-235B-A22B** (via Fireworks API) tops the table at **83.66%** with ~55 tok/s. 2️⃣ But the **30B-A3B Unsloth** quant delivered **82.20%** while running locally at ~45 tok/s and with zero API spend. 3️⃣ The same Unsloth build is ~5x faster than Qwen's **Qwen3-32B**, which scores **82.20%** as well yet crawls at <10 tok/s. 4️⃣ On Apple silicon, the **30B MLX** port hits **79.51%** while sustaining ~64 tok/s - arguably today's best speed/quality trade-off for Mac setups. 5️⃣ The **0.6B** micro-model races above 180 tok/s but tops out at **37.56%** - that's why it's not even on the graph (50 % performance cut-off). All local runs were done with LM Studio on an M4 MacBook Pro, using Qwen's official recommended settings. **Conclusion:** Quantised 30B models now get you ~98 % of frontier-class accuracy - at a fraction of the latency, cost, and energy. For most local RAG or agent workloads, they're not just good enough - they're the new default. Well done, Qwen - you really whipped the llama's ass! And to OpenAI: for your upcoming open model, please make it MoE, with toggleable reasoning, and release it in many sizes. *This* is the future!
liked
a dataset
7 months ago
Turkish-NLI/legal_nli_TR_V1
liked
a model
7 months ago
openai/gpt-oss-20b
View all activity
Organizations
cemt
's models
8
Sort: Recently updated
cemt/Alpaca-llama-3-4bit
Text Generation
•
8B
•
Updated
Apr 30, 2024
•
3
cemt/Alpaca-llama-3-8b-bnb-16bit
Text Generation
•
Updated
Apr 29, 2024
•
1
cemt/Alpaca-llama-3-8b-bnb-4bit
Text Generation
•
Updated
Apr 29, 2024
cemt/Alpaca-llama-3-8b-bnb-4bit-gguf
8B
•
Updated
Apr 29, 2024
•
3
cemt/Alpaca-llama-3-8b-bnb-4bit-model
Updated
Apr 29, 2024
cemt/Wordpress-Mistral-7B-Fine-Tune
Text Generation
•
7B
•
Updated
Apr 26, 2024
•
44
•
2
cemt/WikiSQL-Phi-2-Super
Text Generation
•
Updated
Apr 23, 2024
•
1
cemt/OrpoLlama-3-8B
Text Generation
•
8B
•
Updated
Apr 23, 2024