Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
马逸川's picture
6 1

马逸川

YichuanMa
chengscott's profile picture 21world's profile picture KennyUTC's profile picture
·
  • Entarochuan

AI & ML interests

(M)LLM

Recent Activity

updated a dataset 2 days ago
YichuanMa/LoGos-Rollout-1K
published a dataset 2 days ago
YichuanMa/LoGos-Rollout-1K
updated a dataset 2 days ago
YichuanMa/Go-GRPO-1K
View all activity

Organizations

None yet

upvoted a paper 3 days ago

TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization

Paper • 2601.16480 • Published 6 days ago • 50
upvoted an article 3 months ago
view article
Article

Open-R1: a fully open reproduction of DeepSeek-R1

  • +1
Jan 28, 2025
•
886
upvoted a paper 5 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 262
upvoted a paper 7 months ago

Pre-Trained Policy Discriminators are General Reward Models

Paper • 2507.05197 • Published Jul 7, 2025 • 39
upvoted a paper 10 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14, 2025 • 306
upvoted a paper 11 months ago

Thus Spake Long-Context Large Language Model

Paper • 2502.17129 • Published Feb 24, 2025 • 73
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs