LeonceNsh (Leonce Nshuti)

upvoted an article 6 months ago

Article

CUGA on Hugging Face: Democratizing Configurable AI Agents

ibm-research

•

Dec 15, 2025

• 67

upvoted an article 9 months ago

Article

Gaia2 and ARE: Empowering the community to study agents

+9

clefourrier, gregmialz, mlcu, mortimerp9, XciD, tfrere, evijit, RomainFroger, dheeraj7596, CarolinePascal, upiter

•

Sep 22, 2025

• 136

upvoted 2 papers 9 months ago

Qwen3-Omni Technical Report

Paper • 2509.17765 • Published Sep 22, 2025 • 154

WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning

Paper • 2509.13305 • Published Sep 16, 2025 • 93

upvoted a collection 11 months ago

DINOv3

Collection

DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 15 items • Updated Mar 10 • 676

upvoted 2 articles over 1 year ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

not-lain

•

Jan 30, 2025

• 356

Article

Welcome to Inference Providers on the Hub 🔥

+5

burkaygur, zeke, aton2006, hassanelmghari, sbrandeis, kramp, julien-c

•

Jan 28, 2025

• 494

upvoted a collection over 1 year ago

Marqo-Ecommerce-Embeddings

Collection

State-of-the-art embedding models fine-tuned for the ecommerce domain. +67% increase in evaluation metrics vs ViT-B-16-SigLIP. • 10 items • Updated Nov 14, 2024 • 18

upvoted an article almost 2 years ago

Article

Llama can now see and run on your device - welcome Llama 3.2

+5

merve, philschmid, osanseviero, reach-vb, lewtun, ariG23498, pcuenq

•

Sep 25, 2024

• 191

upvoted a collection almost 2 years ago

DataGemma Release

Collection

A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated Mar 12 • 89

upvoted 2 articles almost 2 years ago

Article

Document Similarity Search with ColPali

fsommers

•

Sep 21, 2024

• 52

Article

WWDC 24: Running Mistral 7B with Core ML

+2

pcuenq, FL33TW00D-HF, reach-vb, osanseviero

•

Jul 22, 2024

• 65

upvoted a collection almost 2 years ago

FP8 LLMs for vLLM

Collection

Accurate FP8 quantized models by Neural Magic, ready for use with vLLM! • 42 items • Updated Mar 2 • 81

upvoted 2 articles about 2 years ago

Article

BM25 for Python: Achieving high performance while simplifying dependencies with BM25S⚡

xhluca

•

Jul 9, 2024

• 86

Article

Training and Finetuning Embedding Models with Sentence Transformers

tomaarsen

•

May 28, 2024

• 275

upvoted a collection about 2 years ago

Hermes 2

Collection

Nous' Flagship LLM Series • 21 items • Updated Mar 2 • 109

Leonce Nshuti PRO

AI & ML interests

Organizations

CUGA on Hugging Face: Democratizing Configurable AI Agents

Gaia2 and ARE: Empowering the community to study agents

Qwen3-Omni Technical Report

WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning

DINOv3

KV Caching Explained: Optimizing Transformer Inference Efficiency

Welcome to Inference Providers on the Hub 🔥

Marqo-Ecommerce-Embeddings

Llama can now see and run on your device - welcome Llama 3.2

DataGemma Release

Document Similarity Search with ColPali

WWDC 24: Running Mistral 7B with Core ML

FP8 LLMs for vLLM

BM25 for Python: Achieving high performance while simplifying dependencies with BM25S⚡

Training and Finetuning Embedding Models with Sentence Transformers

Hermes 2

Leonce Nshuti PRO

AI & ML interests

Organizations

LeonceNsh's activity

CUGA on Hugging Face: Democratizing Configurable AI Agents

Gaia2 and ARE: Empowering the community to study agents

KV Caching Explained: Optimizing Transformer Inference Efficiency

Welcome to Inference Providers on the Hub 🔥

Llama can now see and run on your device - welcome Llama 3.2

Document Similarity Search with ColPali

WWDC 24: Running Mistral 7B with Core ML

BM25 for Python: Achieving high performance while simplifying dependencies with *BM25S*⚡

Training and Finetuning Embedding Models with Sentence Transformers

BM25 for Python: Achieving high performance while simplifying dependencies with BM25S⚡