5 5

Bambuu

bambuuai

bambuuai

AI & ML interests

None yet

Recent Activity

upvoted an article about 2 months ago

State of Open Source on Hugging Face: Spring 2026

upvoted an article about 2 months ago

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

upvoted a changelog about 2 months ago

Introducing Buckets: S3-like storage on the Hub

View all activity

Organizations

upvoted 2 articles about 2 months ago

Article

State of Open Source on Hugging Face: Spring 2026

Mar 17

•

Article

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

Mar 17

•

upvoted a changelog about 2 months ago

Hugging Face Changelog

Introducing Buckets: S3-like storage on the Hub

Mar 10

• 186

upvoted an article about 2 months ago

Article

Introducing Storage Buckets on the Hugging Face Hub

Mar 10

•

194

reacted to SeaWolf-AI's post with 🔥 about 2 months ago

Post

11137

🏟️ Smol AI WorldCup: A 4B Model Just Beat 8B — Here's the Data

We evaluated 18 small language models from 12 makers on 125 questions across 7 languages. The results challenge the assumption that bigger is always better.

Community Article: https://huggingface.co/blog/FINAL-Bench/smol-worldcup
Live Leaderboard: ginigen-ai/smol-worldcup
Dataset: ginigen-ai/smol-worldcup

What we found:

→ Gemma-3n-E4B (4B, 2GB RAM) outscores Qwen3-8B (8B, 5.5GB). Doubling parameters gained only 0.4 points. RAM cost: 2.75x more.

→ GPT-OSS-20B fits in 1.5GB yet matches Champions-league dense models requiring 8.5GB. MoE architecture is the edge AI game-changer.

→ Thinking models hurt structured output. DeepSeek-R1-7B scores 8.7 points below same-size Qwen3-8B and runs 2.7x slower.

→ A 1.3B model fabricates confident fake content 80% of the time when prompted with nonexistent entities. Qwen3 family hits 100% trap detection across all sizes.

→ Qwen3-1.7B (1.2GB) outscores Mistral-7B, Llama-3.1-8B, and DeepSeek-R1-14B. Latest architecture at 1.7B beats older architecture at 14B.

What makes this benchmark different?

Most benchmarks ask "how smart?" — we measure five axes simultaneously: Size, Honesty, Intelligence, Fast, Thrift (SHIFT). Our ranking metric WCS = sqrt(SHIFT x PIR_norm) rewards models that are both high-quality AND efficient. Smart but massive? Low rank. Tiny but poor? Also low.

Top 5 by WCS:
1. GPT-OSS-20B — WCS 82.6 — 1.5GB — Raspberry Pi tier
2. Gemma-3n-E4B — WCS 81.8 — 2.0GB — Smartphone tier
3. Llama-4-Scout — WCS 79.3 — 240 tok/s — Fastest model
4. Qwen3-4B — WCS 76.6 — 2.8GB — Smartphone tier
5. Qwen3-1.7B — WCS 76.1 — 1.2GB — IoT tier

Built in collaboration with the FINAL Bench research team. Interoperable with ALL Bench Leaderboard for full small-to-large model comparison.

Dataset is open under Apache 2.0 (125 questions, 7 languages). We welcome new model submissions.

1 reply

liked a dataset about 2 months ago

zeroentropy/polysemy

Viewer • Updated Nov 12, 2025 • 650 • 43 • 2

commented on Mixture of Experts (MoEs) in Transformers 2 months ago

The enable_expert_parallel flag hiding the complexity of GroupedGemmParallel + RouterParallel behind a single config is a great DX win — distributing experts across devices used to require a lot of custom plumbing.

upvoted an article 2 months ago

Article

Mixture of Experts (MoEs) in Transformers

Feb 26

•

159

updated a model 2 months ago

bambuuai/Kai-30B-Instruct-i1-Q4_K_M-GGUF

Text Generation • Updated Mar 5 • 4

liked 2 models 2 months ago

fal/Qwen-Image-Edit-2511-Multiple-Angles-LoRA

Image-to-Image • Updated Jan 7 • 33.4k • • 1.29k

NoesisLab/Kai-30B-Instruct

Text Generation • 33B • Updated Mar 26 • 53 • 21

published a model 2 months ago

bambuuai/Kai-30B-Instruct-i1-Q4_K_M-GGUF

Text Generation • Updated Mar 5 • 4

updated a model 2 months ago

bambuuai/Kimi-K2.5-openspiel-lora-r16

Reinforcement Learning • Updated Mar 4 • 2

liked a model 2 months ago

Qwen/Qwen3.5-27B-GPTQ-Int4

Image-Text-to-Text • 28B • Updated 14 days ago • 426k • 53

reacted to imnotkitty's post with 🚀🚀 2 months ago

Post

2287

The most popular OpenClaw tool has been released!

🥇 Claw for All: The ultimate all-rounder. Simplifies deployment for both devs & pros with a seamless web/mobile experience.
🥈 OpenClaw Launch: Speed is king. Deploy your apps in under 30 seconds with a single click.
🥉 ClawTeam: Skip the setup. Get pre-configured AI agent blueprints built specifically for OpenClaw.
4️⃣ vibeclaw: Local-first. Run OpenClaw in your browser sandbox in literally 1 second.
5️⃣ Tinkerclaw: The startup favorite. Zero-code platform to deploy, manage, and scale AI assistants.
6️⃣ ClawWrapper: The "last mile" tool. Simplifies the entire packaging and launch process.

Which one are you adding to your stack? 🛠️
(Source: OpenClaw Directory)

2 replies

updated a model 2 months ago

bambuuai/LFM2.5-1.2B-Instruct-GGUF

Text Generation • 1B • Updated Mar 4 • 32

published a model 2 months ago

bambuuai/LFM2.5-1.2B-Instruct-GGUF

Text Generation • 1B • Updated Mar 4 • 32

updated a dataset 2 months ago

bambuuai/Affine-RL-Dataset

Preview • Updated Mar 3 • 33 • 1

reacted to OzTianlu's post with 🔥 2 months ago

Post

4790

🔥 UPGRADE in Kai: 30B Scaling! 🔥
NoesisLab/Kai-30B-Instruct
NoesisLab/Kai-30B-Instruct
We are incredibly excited to announce that the Kai-30B-Instruct model and its official Space are now LIVE! 🚀
If you've been following the journey from Kai-0.35B to Kai-3B, you know we're rethinking how models reason. Tired of verbose, slow Chain-of-Thought (CoT) outputs that flood your screen with self-talk? So are we.
Kai-30B-Instruct scales up our Adaptive Dual-Search Distillation (ADS) framework. By bridging classical A* heuristic search with continuous gradient descent , we use an information-theoretic log-barrier to physically prune high-entropy reasoning paths during training.
The result? Pure implicit reasoning. The model executes structured logic, arithmetic carries, and branch selections as a reflex in a single forward pass—no external scaffolding required.
At 3B, we observed a phase transition where the model achieved "logical crystallization". Now, at 30B, we are giving the ADS regularizer the massive representational capacity it needs to tackle higher-order symbolic abstractions and complex reasoning tasks.
🧪 Test Kai yourself in our new Space:
NoesisLab/Kai-30B-Instruct
📦 Model Weights:
NoesisLab/Kai-30B-Instruct
Bring your hardest math, logic, and coding benchmarks. We invite the community to stress-test the limits of the penalty wall! 🧱💥

1 reply

Bambuu

AI & ML interests

Recent Activity

Organizations

bambuuai's activity

State of Open Source on Hugging Face: Spring 2026

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

Introducing Buckets: S3-like storage on the Hub

Introducing Storage Buckets on the Hugging Face Hub

Mixture of Experts (MoEs) in Transformers