Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Bowen's picture
7 2

Bowen

PeterJinGo
Lriver's profile picture benpaodexiniu's profile picture John6666's profile picture
·

AI & ML interests

None yet

Organizations

ptllama's profile picture rubricrm's profile picture Cell-O1's profile picture archive's profile picture longRAG's profile picture

upvoted a paper about 2 months ago

OpenDecoder: Open Large Language Model Decoding to Incorporate Document Quality in RAG

Paper • 2601.09028 • Published Jan 13 • 34
upvoted a paper 8 months ago

MIRIX: Multi-Agent Memory System for LLM-Based Agents

Paper • 2507.07957 • Published Jul 10, 2025 • 80
upvoted a collection 10 months ago

Search-R1-v0.3

Collection
RL with outcome reward + format reward. https://arxiv.org/abs/2505.15117 • 12 items • Updated Aug 12, 2025 • 4
upvoted a paper 10 months ago

RM-R1: Reward Modeling as Reasoning

Paper • 2505.02387 • Published May 5, 2025 • 81
upvoted 2 collections 12 months ago

Search-R1-v0.2

Collection
Exploration with a more stable RL pipeline with outcome-only reward and scaled-up LLMs. https://arxiv.org/abs/2503.09516 • 26 items • Updated Aug 12, 2025 • 5

Search-R1

Collection
Preliminary checkpoints with outcome-only RL. • 15 items • Updated Aug 12, 2025 • 17
upvoted a paper about 1 year ago

Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12, 2025 • 38
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs