Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Alex Zhang's picture
1 2 2

Alex Zhang

a1zhang
tolgacangoz's profile picture sergiopaniego's profile picture simonguozirui's profile picture
·
https://alexzhang13.github.io
  • a1zhang
  • alexzhang13

AI & ML interests

None yet

Recent Activity

published a model about 18 hours ago
a1zhang/rlm-qwen3-8b
updated a model 12 days ago
a1zhang/rlm-qwen3-8b
replied to sergiopaniego's post 12 days ago
Recursive Language Models (RLM) is a new interface for LLMs with cool ideas by Alex Zhang! ⚠️ LLMs struggle with long prompts → attention overload & lost info 🔄 RLMs inspect, split & call themselves on chunks, then aggregate results ✅ Handles millions of tokens, reduces noise, improves reasoning 💡 System prompt guides recursion 🎯 RLM trajectories can be used for RL training or distillation (OpenEnv+TRL!!) We're adding it to OpenEnv (with Kashif Rasul): https://github.com/meta-pytorch/OpenEnv/pull/282 More resources: > Paper: https://huggingface.co/papers/2512.24601 > Paper blog: https://alexzhang13.github.io/blog/2025/rlm/ > RLM repo: https://github.com/alexzhang13/rlm
View all activity

Organizations

Sakana AI's profile picture Prime Intellect's profile picture Scaling Intelligence's profile picture Oolong: Evaluating Long Context Reasoning and Aggregation Capabilities's profile picture Scratch to Scale's profile picture

Papers 3

arxiv:2505.18134
arxiv:2502.10517
arxiv:2410.03859

models 1

a1zhang/rlm-qwen3-8b

8B • Updated 12 days ago • 34

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs