Richard Lian PRO

richardlian

4 39 94

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

How LoRA Remembers? A Parametric Memory Law for LLM Finetuning

liked a Space about 1 month ago

OpenEvals/every-leaderboards

published a model about 2 months ago

lopentu/meta-llama-Llama-3.2-3B-DottedWSD

View all activity

Organizations

upvoted a paper about 1 month ago

How LoRA Remembers? A Parametric Memory Law for LLM Finetuning

Paper • 2605.30260 • Published May 28 • 44

upvoted an article 7 months ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

lysandre, ArthurZ, cyrilvallez, reach-vb

•

Dec 1, 2025

• 312

upvoted an article 8 months ago

Article

Sentence Transformers is joining Hugging Face!

tomaarsen

•

Oct 22, 2025

• 88

upvoted an article 9 months ago

Article

Introducing RTEB: A New Standard for Retrieval Evaluation

fzliu, KennethEnevoldsen, Samoed, isaacchung, tomaarsen, fzoll

•

Oct 1, 2025

• 146

upvoted a collection 9 months ago

The Big Benchmarks Collection

Collection

Gathering benchmark spaces on the hub (beyond the Open LLM Leaderboard) • 13 items • Updated Nov 18, 2024 • 266

upvoted a paper 11 months ago

Inverse Scaling in Test-Time Compute

Paper • 2507.14417 • Published Jul 19, 2025 • 28

upvoted an article about 1 year ago

Article

KV Cache from scratch in nanoVLM

ariG23498, kashif, lusxvr, andito, pcuenq

•

Jun 4, 2025

• 120

upvoted 2 papers about 1 year ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2, 2025 • 190

Parallel Scaling Law for Language Models

Paper • 2505.10475 • Published May 15, 2025 • 83

upvoted 2 articles about 1 year ago

Article

The Transformers Library: standardizing model definitions

lysandre, ArthurZ, pcuenq, julien-c

•

May 15, 2025

• 123

Article

Vision Language Models (Better, faster, stronger)

merve, sergiopaniego, ariG23498, pcuenq, andito

•

May 12, 2025

• 613

upvoted a collection about 1 year ago

Unsloth Dynamic 2.0 Quants

Collection

New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 107 items • Updated 5 days ago • 746

upvoted an article about 1 year ago

Article

Introducing HELMET: Holistically Evaluating Long-context Language Models

hyen, gaotianyu1350, houminmin, kding1, danf, moshew, cdq10131

•

Apr 16, 2025

• 42

upvoted 5 articles over 1 year ago

Article

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

Kseniase

•

Mar 17, 2025

• 360

Article

Rearchitecting Hugging Face Uploads and Downloads

port8080, jsulz, erinys

•

Nov 26, 2024

• 50

Article

From Files to Chunks: Improving HF Storage Efficiency

jsulz, erinys

•

Nov 20, 2024

• 74

Article

Xet is on the Hub

assafvayner, brianronan, seanses, jgodlewski, sirahd, jsulz

•

Mar 18, 2025

• 80

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

NormalUhr

•

Feb 7, 2025

• 295

upvoted 2 papers over 1 year ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17, 2025 • 115

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published Jan 23, 2025 • 41

Richard Lian PRO

AI & ML interests

Recent Activity

Organizations

richardlian's activity

Transformers v5: Simple model definitions powering the AI ecosystem

Sentence Transformers is joining Hugging Face!

Introducing RTEB: A New Standard for Retrieval Evaluation

KV Cache from scratch in nanoVLM

The Transformers Library: standardizing model definitions

Vision Language Models (Better, faster, stronger)

Introducing HELMET: Holistically Evaluating Long-context Language Models

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

Rearchitecting Hugging Face Uploads and Downloads

From Files to Chunks: Improving HF Storage Efficiency

Xet is on the Hub

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge