yuchuqing

rain2sun

·

AI & ML interests

None yet

Recent Activity

liked a model 23 days ago

FutureLivingLab/iFlow-ROME

liked a model 30 days ago

zai-org/GLM-5.2

liked a model about 1 month ago

moonshotai/Kimi-K2.7-Code

View all activity

Organizations

None yet

upvoted a collection 8 months ago

Olmo 3

Artifacts for the Olmo 3 release. • 7 items • Updated Mar 2 • 174

upvoted an article about 1 year ago

Article

Open R1: Update #2

open-r1

•

Feb 10, 2025

• 219

upvoted 3 collections about 1 year ago

AM-Thinking-v1

3 items • Updated May 19, 2025 • 3

Math-Code-Reason

可规则验证数据集，要求带标准答案 • 22 items • Updated Jan 5 • 1

Qwen3

84 items • Updated Dec 31, 2025 • 1.83k

upvoted 5 collections over 1 year ago

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 32 items • Updated Mar 2 • 101

OpenCoder

OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. • 8 items • Updated 16 days ago • 92

Pythia Scaling Suite

Pythia is the first LLM suite designed specifically to enable scientific research on LLMs. To learn more see https://github.com/EleutherAI/pythia • 18 items • Updated Feb 26, 2025 • 33

OpenCoder Datasets

OpenCoder datasets! • 6 items • Updated Nov 15, 2024 • 47

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 43 items • Updated Mar 2 • 731

upvoted 4 articles almost 2 years ago

Article

Preference Tuning LLMs with Direct Preference Optimization Methods

+3

kashif, edbeeching, lewtun, lvwerra, osanseviero

•

Jan 18, 2024

• 84

Article

A failed experiment: Infini-Attention, and why we should keep trying?

+1

neuralink, lvwerra, thomwolf

•

Aug 14, 2024

• 76

Article

SmolLM - blazingly fast and remarkably powerful

+1

loubnabnl, anton-l, eliebak

•

Jul 16, 2024

• 460

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

+1

loubnabnl, anton-l, davanstrien

•

Mar 20, 2024

• 115

upvoted a collection almost 2 years ago

"Physics of Language Models" series

7 items • Updated Dec 22, 2025 • 55