Open to Work

14 10

JP2

MJPT2

AI & ML interests

NLP Generative Multimodal Models

Recent Activity

upvoted an article 15 days ago

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

liked a dataset 26 days ago

OX-PIXL/STVQA-7K

liked a model 26 days ago

nyu-visionx/cambrian-8b

View all activity

Organizations

None yet

upvoted an article 15 days ago

Article

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

tomaarsen

•

28 days ago

• 70

upvoted an article about 2 months ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 152

upvoted an article 2 months ago

Article

Mixture of Experts (MoEs) in Transformers

ariG23498, pcuenq, merve, IlyasMoutawwakil, ArthurZ, sergiopaniego, Molbap

•

Feb 26

• 159

upvoted a collection 3 months ago

L1

Collection

L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning • 7 items • Updated Jul 13, 2025 • 9

upvoted 2 articles 3 months ago

Article

What is test-time compute and how to scale it?

Kseniase

•

Feb 6, 2025

• 120

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

not-lain

•

Jan 30, 2025

• 328

upvoted a collection 4 months ago

Scaling Test-Time Compute with Open Models

Collection

Models and datasets used in our blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute • 10 items • Updated Jan 6, 2025 • 30

upvoted a paper 4 months ago

Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking

Paper • 2502.02339 • Published Feb 4, 2025 • 23

upvoted 2 articles 4 months ago

Article

From GRPO to DAPO and GSPO: What, Why, and How

NormalUhr

•

Aug 9, 2025

• 118

Article

Preference Optimization for Vision Language Models

qgallouedec, vwxyzjn, merve, kashif

•

Jul 10, 2024

• 93

upvoted an article 7 months ago

Article

PipelineRL

ServiceNow

•

Apr 25, 2025

• 44

upvoted a paper 7 months ago

Robot Learning: A Tutorial

Paper • 2510.12403 • Published Oct 14, 2025 • 130

upvoted an article 8 months ago

Article

There is no such thing as a tokenizer-free lunch

catherinearnett

•

Sep 25, 2025

• 98

upvoted an article about 1 year ago

Article

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

mikelabs

•

Nov 19, 2024

• 12

JP2

AI & ML interests

Recent Activity

Organizations

MJPT2's activity

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

Mixture of Experts (MoEs) in Transformers

What is test-time compute and how to scale it?

KV Caching Explained: Optimizing Transformer Inference Efficiency

From GRPO to DAPO and GSPO: What, Why, and How

Preference Optimization for Vision Language Models

PipelineRL

There is no such thing as a tokenizer-free lunch

LLaVA-o1: Let Vision Language Models Reason Step-by-Step