Sean Ma

seanmamasde

54 147

Seanmamasde

AI & ML interests

None yet

Recent Activity

liked a dataset 6 days ago

google/fleurs

liked a model 8 days ago

Qwen/Qwen3.6-27B

liked a model 13 days ago

Qwen/Qwen3-Embedding-8B

View all activity

Organizations

None yet

upvoted a collection about 1 month ago

Qwen3

Collection

84 items • Updated Dec 31, 2025 • 1.82k

upvoted a paper about 1 month ago

Group-in-Group Policy Optimization for LLM Agent Training

Paper • 2505.10978 • Published May 16, 2025 • 23

upvoted a collection about 2 months ago

Qwen3.5

Collection

21 items • Updated Mar 9 • 1.7k

upvoted a collection 2 months ago

DeepSeek-V4

Collection

6 items • Updated 3 days ago • 703

upvoted a paper 3 months ago

Self-Distilled RLVR

Paper • 2604.03128 • Published Apr 3 • 179

upvoted a collection 3 months ago

Gemma 4

Collection

15 items • Updated 19 days ago • 999

upvoted a paper 3 months ago

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Paper • 2603.25716 • Published Mar 26 • 157

upvoted a paper 4 months ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 526

upvoted a collection 6 months ago

SpecBundle

Collection

A collection of production-grade draft models for speculative decoding • 18 items • Updated Apr 15 • 19

upvoted an article 7 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

natolambert, LouisCastricato, lvwerra, Dahoas

•

Dec 9, 2022

• 418

upvoted a collection 7 months ago

DeepSeek-V3.2

Collection

4 items • Updated Dec 1, 2025 • 544

upvoted 2 articles 7 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

burtenshaw, evalstate

•

Dec 4, 2025

• 630

Article

Transformers v5: Simple model definitions powering the AI ecosystem

lysandre, ArthurZ, cyrilvallez, reach-vb

•

Dec 1, 2025

• 312

upvoted 2 papers 7 months ago

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published Nov 20, 2025 • 110

GigaEvo: An Open Source Optimization Framework Powered By LLMs And Evolution Algorithms

Paper • 2511.17592 • Published Nov 17, 2025 • 122

upvoted an article 7 months ago

Article

Common AI Model Formats

ngxson

•

Feb 27, 2025

• 73

upvoted a paper 7 months ago

Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

Paper • 2511.14993 • Published Nov 19, 2025 • 234

upvoted a paper 8 months ago

VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation

Paper • 2511.02778 • Published Nov 4, 2025 • 104

upvoted an article 8 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 780

upvoted a paper 9 months ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 276

Sean Ma

AI & ML interests

Recent Activity

Organizations

seanmamasde's activity

Illustrating Reinforcement Learning from Human Feedback (RLHF)

We Got Claude to Fine-Tune an Open Source LLM

Transformers v5: Simple model definitions powering the AI ecosystem

Common AI Model Formats

SmolLM3: smol, multilingual, long-context reasoner