fahrizalfarid's picture

In a Training Loop 🔄

fahrizalfarid

akahana

·

AI & ML interests

NLP

Recent Activity

reacted to SeaWolf-AI's post with 🔥 about 1 month ago

🏟️ Smol AI WorldCup: A 4B Model Just Beat 8B — Here's the Data We evaluated 18 small language models from 12 makers on 125 questions across 7 languages. The results challenge the assumption that bigger is always better. Community Article: https://huggingface.co/blog/FINAL-Bench/smol-worldcup Live Leaderboard: https://huggingface.co/spaces/ginigen-ai/smol-worldcup Dataset: https://huggingface.co/datasets/ginigen-ai/smol-worldcup What we found: → Gemma-3n-E4B (4B, 2GB RAM) outscores Qwen3-8B (8B, 5.5GB). Doubling parameters gained only 0.4 points. RAM cost: 2.75x more. → GPT-OSS-20B fits in 1.5GB yet matches Champions-league dense models requiring 8.5GB. MoE architecture is the edge AI game-changer. → Thinking models hurt structured output. DeepSeek-R1-7B scores 8.7 points below same-size Qwen3-8B and runs 2.7x slower. → A 1.3B model fabricates confident fake content 80% of the time when prompted with nonexistent entities. Qwen3 family hits 100% trap detection across all sizes. → Qwen3-1.7B (1.2GB) outscores Mistral-7B, Llama-3.1-8B, and DeepSeek-R1-14B. Latest architecture at 1.7B beats older architecture at 14B. What makes this benchmark different? Most benchmarks ask "how smart?" — we measure five axes simultaneously: Size, Honesty, Intelligence, Fast, Thrift (SHIFT). Our ranking metric WCS = sqrt(SHIFT x PIR_norm) rewards models that are both high-quality AND efficient. Smart but massive? Low rank. Tiny but poor? Also low. Top 5 by WCS: 1. GPT-OSS-20B — WCS 82.6 — 1.5GB — Raspberry Pi tier 2. Gemma-3n-E4B — WCS 81.8 — 2.0GB — Smartphone tier 3. Llama-4-Scout — WCS 79.3 — 240 tok/s — Fastest model 4. Qwen3-4B — WCS 76.6 — 2.8GB — Smartphone tier 5. Qwen3-1.7B — WCS 76.1 — 1.2GB — IoT tier Built in collaboration with the FINAL Bench research team. Interoperable with ALL Bench Leaderboard for full small-to-large model comparison. Dataset is open under Apache 2.0 (125 questions, 7 languages). We welcome new model submissions.

updated a dataset about 2 months ago

akahana/wikipedia-id-conv

published a dataset about 2 months ago

akahana/wikipedia-id-conv

View all activity

Organizations

None yet

akahana 's models 58

akahana/wikipedia-gpt2

0.1B • Updated Dec 30, 2024

akahana/tebak-gambar-mobilevit

2.07M • Updated Aug 8, 2024

akahana/mnist-mobilevit

958k • Updated Aug 8, 2024

akahana/tinybert-javanese

Fill-Mask • 4.42M • Updated Aug 3, 2024 • 3

akahana/minibert-indonesia

11.2M • Updated Aug 2, 2024 • 1

akahana/smallbert-javanese

Fill-Mask • 28.8M • Updated Jul 31, 2024

akahana/distilgpt2-javanese

Text Generation • 81.9M • Updated Jul 24, 2024 • 2

akahana/tinygpt2-javanese

Text Generation • 6.96M • Updated Jul 23, 2024 • 11

akahana/gpt2-javanese

Text Generation • 0.1B • Updated Jul 22, 2024 • 3

akahana/mini-roberta-javanese

Fill-Mask • 22.5M • Updated Jul 19, 2024 • 21

akahana/roberta-javanese

Fill-Mask • 0.1B • Updated Jul 18, 2024 • 8

akahana/albert-javanese

Fill-Mask • Updated Jul 12, 2024 • 7

akahana/mt5-small-google

0.2B • Updated Jul 10, 2024 • 1

akahana/mt5-id-to-en

Updated Jul 10, 2024

akahana/roberta-base-indonesia-dev

Fill-Mask • Updated Jul 2, 2024

akahana/harry-potter-gpt2

Text Generation • 1.75M • Updated Nov 8, 2023 • 5

akahana/tiny-roberta-indonesia

Feature Extraction • 828k • Updated Sep 19, 2023 • 20 • 2

akahana/indonesia-emotion-roberta

Text Classification • 0.1B • Updated Sep 19, 2023 • 21

akahana/vit-base-cats-vs-dogs

Image Classification • 85.8M • Updated Sep 19, 2023 • 71 • 7

akahana/gpt2-indonesia

Text Generation • 0.2B • Updated Sep 19, 2023 • 16 • 4

akahana/roberta-base-indonesia

Feature Extraction • 0.1B • Updated Sep 19, 2023 • 23

akahana/indonesia-emotion-distilbert

Text Classification • 68.1M • Updated Sep 15, 2023 • 3

akahana/asl-vit

Image Classification • 85.8M • Updated Sep 15, 2023 • 35 • 3

akahana/wav2vec2-base-indonesia-v2

Updated Dec 14, 2021

akahana/indonesia-distilbert

Updated Dec 11, 2021

akahana/indonesia-emotion-roberta-small

Text Classification • Updated Dec 8, 2021 • 10

akahana/indonesia-roberta-small

Fill-Mask • Updated Dec 8, 2021 • 12

akahana/indonesia-sentiment-roberta

Text Classification • Updated Dec 7, 2021 • 12