Junlin Zhou

jlzhou

46 63 137

edwardzjl

AI & ML interests

None yet

Recent Activity

liked a dataset 7 days ago

allenai/tmax-15k-open-instruct

upvoted a paper 9 days ago

Multi-LCB: Extending LiveCodeBench to Multiple Programming Languages

liked a dataset 20 days ago

BAAI/TACO

View all activity

Organizations

upvoted a paper 9 days ago

Multi-LCB: Extending LiveCodeBench to Multiple Programming Languages

Paper • 2606.20517 • Published 13 days ago • 60

upvoted a collection 3 months ago

Nemotron-Post-Training-v3

Collection

Collection of datasets used in the post-training phase of Nemotron Nano, Super, and Ultra v3. • 50 items • Updated 19 days ago • 168

upvoted a paper 5 months ago

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

Paper • 2602.08676 • Published Feb 9 • 73

upvoted 2 papers 6 months ago

Interleaved Reasoning for Large Language Models via Reinforcement Learning

Paper • 2505.19640 • Published May 26, 2025 • 15

LLaDA2.0: Scaling Up Diffusion Language Models to 100B

Paper • 2512.15745 • Published Dec 10, 2025 • 90

upvoted a paper 7 months ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18, 2025 • 141

upvoted an article 8 months ago

Article

Accelerating LLM Inference: Fast Sampling with Gumbel-Max Trick

cxdu

•

Oct 24, 2024

• 14

upvoted an article 10 months ago

Article

Diffusion Language Models: The New Paradigm

ProCreations

•

Jun 10, 2025

• 51

upvoted a paper 10 months ago

Rope to Nope and Back Again: A New Hybrid Attention Strategy

Paper • 2501.18795 • Published Jan 30, 2025 • 13

upvoted an article 11 months ago

Article

How to generate text: using different decoding methods for language generation with Transformers

patrickvonplaten

•

Mar 1, 2020

• 301

upvoted a paper 11 months ago

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Paper • 2507.10532 • Published Jul 14, 2025 • 90

upvoted 2 articles 12 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 780

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

thomwolf, matthieu-lapeyre

•

Jul 9, 2025

• 804

upvoted 5 papers about 1 year ago

Don't Pay Attention

Paper • 2506.11305 • Published Jun 12, 2025 • 8

Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning

Paper • 2506.06205 • Published Jun 6, 2025 • 30

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265

Just as Humans Need Vaccines, So Do Models: Model Immunization to Combat Falsehoods

Paper • 2505.17870 • Published May 23, 2025 • 5

Cache Me if You Can: Accelerating Diffusion Models through Block Caching

Paper • 2312.03209 • Published Dec 6, 2023 • 21

upvoted an article about 1 year ago

Article

Uncensor any LLM with abliteration

mlabonne

•

Jun 13, 2024

• 870

upvoted a paper about 1 year ago

RealHarm: A Collection of Real-World Language Model Application Failures

Paper • 2504.10277 • Published Apr 14, 2025 • 10

Junlin Zhou

AI & ML interests

Recent Activity

Organizations

jlzhou's activity

Accelerating LLM Inference: Fast Sampling with Gumbel-Max Trick

Diffusion Language Models: The New Paradigm

How to generate text: using different decoding methods for language generation with Transformers

SmolLM3: smol, multilingual, long-context reasoner

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Uncensor any LLM with abliteration