Zhu Lin's picture

🤝 Open to Collab

Zhu Lin

czl

·

https://czl.my/

AI & ML interests

Computer Vision, LLM

Recent Activity

updated a dataset about 13 hours ago

czl/xinyi_public_gym

updated a dataset about 13 hours ago

czl/nangang_sports_center

updated a dataset about 13 hours ago

czl/zhongshan_public_gym

View all activity

Organizations

upvoted a paper 3 months ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 167

upvoted an article 3 months ago

Article

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

nvidia

•

Mar 17

• 67

upvoted a paper 3 months ago

Multimodal OCR: Parse Anything from Documents

Paper • 2603.13032 • Published Mar 13 • 45

upvoted an article 4 months ago

Article

Forge: Scalable Agent RL Framework and Algorithm

MiniMax-AI

•

Feb 13

• 155

upvoted 2 articles 6 months ago

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

MiniMax-AI

•

Oct 30, 2025

• 43

Article

What makes good reasoning data

MiniMax-AI

•

Oct 30, 2025

• 45

upvoted 3 articles 7 months ago

Article

Building the Open Agent Ecosystem Together: Introducing OpenEnv

+8

spisakjo, darktex, zkwentz, mortimerp9, Sanyam, Hamid-Nazeri, Pankit01, emre0, lewtun, reach-vb

•

Oct 23, 2025

• 164

Article

There is no such thing as a tokenizer-free lunch

catherinearnett

•

Sep 25, 2025

• 100

Article

Evaluate Your Own RAG: Why Best Practices Failed Us

charles-azam

•

Nov 5, 2025

• 14

upvoted a paper 7 months ago

The Path Not Taken: RLVR Provably Learns Off the Principals

Paper • 2511.08567 • Published Nov 11, 2025 • 36

upvoted an article 8 months ago

Article

Why Did MiniMax M2 End Up as a Full Attention Model?

MiniMax-AI

•

Oct 30, 2025

• 80

upvoted a collection 9 months ago

Nemotron-Pre-Training-Datasets

Large scale pre-training datasets used in the Nemotron family of models. • 15 items • Updated 14 days ago • 171

upvoted a paper 9 months ago

SSDD: Single-Step Diffusion Decoder for Efficient Image Tokenization

Paper • 2510.04961 • Published Oct 6, 2025 • 5

upvoted a collection 10 months ago

NVIDIA Nemotron V2

Open, Production-ready Enterprise Models. Nvidia Open Model license. • 9 items • Updated 14 days ago • 106

upvoted an article 10 months ago

Article

You could have designed state of the art positional encoding

FL33TW00D-HF

•

Nov 25, 2024

• 487

upvoted a collection 10 months ago

Instruct datasets

5 items • Updated May 5, 2025 • 6

upvoted 2 collections 12 months ago

🧠 SmolLM3

Smol, multilingual, long-context reasoner • 14 items • Updated Oct 9, 2025 • 104

Gemma 3n

4 items • Updated Mar 12 • 272

upvoted a collection over 1 year ago

DeepSeek R1 (All Versions)

DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 37 items • Updated 11 days ago • 269

upvoted a paper over 1 year ago

Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices

Paper • 2410.11795 • Published Oct 15, 2024 • 18