Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop š
38.7
TFLOPS
13
14
93
fahrizalfarid
akahana
Follow
sundarshanmu's profile picture
kargaranamir's profile picture
evalstate's profile picture
11 followers
Ā·
50 following
fahrizalfarid
fahrizalfarid
AI & ML interests
NLP
Recent Activity
reacted
to
SeaWolf-AI
's
post
with š„
about 1 month ago
šļø Smol AI WorldCup: A 4B Model Just Beat 8B ā Here's the Data We evaluated 18 small language models from 12 makers on 125 questions across 7 languages. The results challenge the assumption that bigger is always better. Community Article: https://huggingface.co/blog/FINAL-Bench/smol-worldcup Live Leaderboard: https://huggingface.co/spaces/ginigen-ai/smol-worldcup Dataset: https://huggingface.co/datasets/ginigen-ai/smol-worldcup What we found: ā Gemma-3n-E4B (4B, 2GB RAM) outscores Qwen3-8B (8B, 5.5GB). Doubling parameters gained only 0.4 points. RAM cost: 2.75x more. ā GPT-OSS-20B fits in 1.5GB yet matches Champions-league dense models requiring 8.5GB. MoE architecture is the edge AI game-changer. ā Thinking models hurt structured output. DeepSeek-R1-7B scores 8.7 points below same-size Qwen3-8B and runs 2.7x slower. ā A 1.3B model fabricates confident fake content 80% of the time when prompted with nonexistent entities. Qwen3 family hits 100% trap detection across all sizes. ā Qwen3-1.7B (1.2GB) outscores Mistral-7B, Llama-3.1-8B, and DeepSeek-R1-14B. Latest architecture at 1.7B beats older architecture at 14B. What makes this benchmark different? Most benchmarks ask "how smart?" ā we measure five axes simultaneously: Size, Honesty, Intelligence, Fast, Thrift (SHIFT). Our ranking metric WCS = sqrt(SHIFT x PIR_norm) rewards models that are both high-quality AND efficient. Smart but massive? Low rank. Tiny but poor? Also low. Top 5 by WCS: 1. GPT-OSS-20B ā WCS 82.6 ā 1.5GB ā Raspberry Pi tier 2. Gemma-3n-E4B ā WCS 81.8 ā 2.0GB ā Smartphone tier 3. Llama-4-Scout ā WCS 79.3 ā 240 tok/s ā Fastest model 4. Qwen3-4B ā WCS 76.6 ā 2.8GB ā Smartphone tier 5. Qwen3-1.7B ā WCS 76.1 ā 1.2GB ā IoT tier Built in collaboration with the FINAL Bench research team. Interoperable with ALL Bench Leaderboard for full small-to-large model comparison. Dataset is open under Apache 2.0 (125 questions, 7 languages). We welcome new model submissions.
updated
a dataset
about 2 months ago
akahana/wikipedia-id-conv
published
a dataset
about 2 months ago
akahana/wikipedia-id-conv
View all activity
Organizations
None yet
akahana
's models
58
Sort:Ā Recently updated
akahana/wikipedia-gpt2
0.1B
ā¢
Updated
Dec 30, 2024
akahana/tebak-gambar-mobilevit
2.07M
ā¢
Updated
Aug 8, 2024
akahana/mnist-mobilevit
958k
ā¢
Updated
Aug 8, 2024
akahana/tinybert-javanese
Fill-Mask
ā¢
4.42M
ā¢
Updated
Aug 3, 2024
ā¢
3
akahana/minibert-indonesia
11.2M
ā¢
Updated
Aug 2, 2024
ā¢
1
akahana/smallbert-javanese
Fill-Mask
ā¢
28.8M
ā¢
Updated
Jul 31, 2024
akahana/distilgpt2-javanese
Text Generation
ā¢
81.9M
ā¢
Updated
Jul 24, 2024
ā¢
2
akahana/tinygpt2-javanese
Text Generation
ā¢
6.96M
ā¢
Updated
Jul 23, 2024
ā¢
11
akahana/gpt2-javanese
Text Generation
ā¢
0.1B
ā¢
Updated
Jul 22, 2024
ā¢
3
akahana/mini-roberta-javanese
Fill-Mask
ā¢
22.5M
ā¢
Updated
Jul 19, 2024
ā¢
21
akahana/roberta-javanese
Fill-Mask
ā¢
0.1B
ā¢
Updated
Jul 18, 2024
ā¢
8
akahana/albert-javanese
Fill-Mask
ā¢
Updated
Jul 12, 2024
ā¢
7
akahana/mt5-small-google
0.2B
ā¢
Updated
Jul 10, 2024
ā¢
1
akahana/mt5-id-to-en
Updated
Jul 10, 2024
akahana/roberta-base-indonesia-dev
Fill-Mask
ā¢
Updated
Jul 2, 2024
akahana/harry-potter-gpt2
Text Generation
ā¢
1.75M
ā¢
Updated
Nov 8, 2023
ā¢
5
akahana/tiny-roberta-indonesia
Feature Extraction
ā¢
828k
ā¢
Updated
Sep 19, 2023
ā¢
20
ā¢
2
akahana/indonesia-emotion-roberta
Text Classification
ā¢
0.1B
ā¢
Updated
Sep 19, 2023
ā¢
21
akahana/vit-base-cats-vs-dogs
Image Classification
ā¢
85.8M
ā¢
Updated
Sep 19, 2023
ā¢
71
ā¢
7
akahana/gpt2-indonesia
Text Generation
ā¢
0.2B
ā¢
Updated
Sep 19, 2023
ā¢
16
ā¢
4
akahana/roberta-base-indonesia
Feature Extraction
ā¢
0.1B
ā¢
Updated
Sep 19, 2023
ā¢
23
akahana/indonesia-emotion-distilbert
Text Classification
ā¢
68.1M
ā¢
Updated
Sep 15, 2023
ā¢
3
akahana/asl-vit
Image Classification
ā¢
85.8M
ā¢
Updated
Sep 15, 2023
ā¢
35
ā¢
3
akahana/wav2vec2-base-indonesia-v2
Updated
Dec 14, 2021
akahana/indonesia-distilbert
Updated
Dec 11, 2021
akahana/indonesia-emotion-roberta-small
Text Classification
ā¢
Updated
Dec 8, 2021
ā¢
10
akahana/indonesia-roberta-small
Fill-Mask
ā¢
Updated
Dec 8, 2021
ā¢
12
akahana/indonesia-sentiment-roberta
Text Classification
ā¢
Updated
Dec 7, 2021
ā¢
12
Previous
1
2
Next