Ahmed Morsi

eramax

https://emolike.net/

AI & ML interests

None yet

Recent Activity

liked a model about 12 hours ago

deepreinforce-ai/Ornith-1.0-397B

liked a model 1 day ago

speakleash/Bielik-11B-v3.0-Instruct

liked a model 2 days ago

ValiantLabs/Qwen3.6-27B-Esper4

View all activity

Organizations

None yet

upvoted a paper 3 days ago

Multi-Faceted Interactivity Alignment in Full-Duplex Speech Models

Paper • 2606.11167 • Published 17 days ago • 5

upvoted a paper 9 days ago

VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models

Paper • 2606.16140 • Published 11 days ago • 117

upvoted a collection 28 days ago

Bonsai Image

Collection

6 items • Updated 22 days ago • 87

upvoted 2 collections 2 months ago

DFlash

Collection

Block Diffusion for Flash Speculative Decoding • 22 items • Updated 11 days ago • 136

Qwen3.6

Collection

4 items • Updated Apr 22 • 418

upvoted an article 2 months ago

Article

ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models?

lightonai

•

Feb 19

• 22

upvoted a collection 2 months ago

MiniMax-M2.7 REAP

Collection

6 items • Updated Apr 20 • 1

upvoted an article 3 months ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 909

upvoted a collection 4 months ago

Quantized Qwen3.5

Collection

Verified models. Compatible with Transformers v5.3 and vLLM v0.16.1rc1 (nightly). Under evaluation. • 9 items • Updated Mar 12 • 9

upvoted a collection 12 months ago

NextCoder

Collection

NextCoder family of code-editing LMs developed with Selective Knowledge Transfer and its training data. • 6 items • Updated Jul 9, 2025 • 79

upvoted an article 12 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 780

upvoted a paper 12 months ago

Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks

Paper • 2505.16901 • Published May 22, 2025 • 48

upvoted a collection about 1 year ago

MiniMax-M1

Collection

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model. • 6 items • Updated Apr 15 • 119

upvoted an article about 1 year ago

Article

Learn the Hugging Face Kernel Hub in 5 Minutes

drbh, danieldk, Narsil, pcuenq, pagezyhf, merve, reach-vb

•

Jun 12, 2025

• 164

upvoted a collection about 1 year ago

Deepseek Papers

Collection

Deepseek papers collection • 32 items • Updated 4 days ago • 352

upvoted 3 articles about 1 year ago

Article

Uncensor any LLM with abliteration

mlabonne

•

Jun 13, 2024

• 868

Article

You could have designed state of the art positional encoding

FL33TW00D-HF

•

Nov 25, 2024

• 487

Article

Vision Language Models (Better, faster, stronger)

merve, sergiopaniego, ariG23498, pcuenq, andito

•

May 12, 2025

• 614

upvoted 2 collections about 1 year ago

OpenVision

Collection

27 items • Updated Aug 15, 2025 • 33

Absolute Zero Reasoner

Collection

6 items • Updated May 9, 2025 • 56

Ahmed Morsi

AI & ML interests

Recent Activity

Organizations

eramax's activity

**ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models?**

Welcome Gemma 4: Frontier multimodal intelligence on device

SmolLM3: smol, multilingual, long-context reasoner

Learn the Hugging Face Kernel Hub in 5 Minutes

Uncensor any LLM with abliteration

You could have designed state of the art positional encoding

Vision Language Models (Better, faster, stronger)

ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models?