Kushagra Bansal's picture

4 2

Kushagra Bansal

kb231000

·

AI & ML interests

None yet

Organizations

None yet

upvoted 2 collections 8 months ago

mmBERT: a modern multilingual encoder

mmBERT is trained on 3T tokens from over 1800 languages, showing SoTA scores on benchmarks and exceptional low-resource performance • 16 items • Updated Sep 9, 2025 • 53

DINOv3

DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 15 items • Updated Mar 10 • 648

upvoted 2 articles almost 2 years ago

Article

Merge Large Language Models with mergekit

mlabonne

•

Jan 9, 2024

• 156

Article

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

AviSoori1x

•

May 7, 2024

• 121