Mohamed Bassam

M-bassam

8 39

AI & ML interests

None yet

Recent Activity

liked a model 3 days ago

Qwen/Qwen3.6-27B

upvoted a collection 7 days ago

DeepSeek-V4

liked a model 10 days ago

nvidia/GLM-5.2-NVFP4

View all activity

Organizations

None yet

upvoted a collection 7 days ago

DeepSeek-V4

Collection

6 items • Updated 10 days ago • 719

upvoted 2 articles 20 days ago

Article

Profiling in PyTorch (Part 2): From nn.Linear to a Fused MLP

ariG23498, ror, sergiopaniego, pcuenq, sayakpaul

•

26 days ago

• 50

Article

Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler

ariG23498, sayakpaul, sergiopaniego, ror, pcuenq

•

May 29

• 132

upvoted 2 articles 24 days ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

not-lain

•

Jan 30, 2025

• 361

Article

Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand

qgallouedec

•

Dec 4, 2025

• 73

upvoted 2 papers 24 days ago

MoBA: Mixture of Block Attention for Long-Context LLMs

Paper • 2502.13189 • Published Feb 18, 2025 • 19

MiniMax Sparse Attention

Paper • 2606.13392 • Published 26 days ago • 149

upvoted a collection about 1 year ago

MedGemma Release

Collection

Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 9 items • Updated Mar 12 • 511

Mohamed Bassam

AI & ML interests

Recent Activity

Organizations

M-bassam's activity

Profiling in PyTorch (Part 2): From nn.Linear to a Fused MLP

Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler

KV Caching Explained: Optimizing Transformer Inference Efficiency

Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand