1 9 2

马逸川

YichuanMa

Entarochuan

AI & ML interests

(M)LLM

Recent Activity

authored a paper about 2 months ago

TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization

authored a paper about 2 months ago

Timely Machine: Awareness of Time Makes Test-Time Scaling Agentic

authored a paper about 2 months ago

Mixing Expert Knowledge: Bring Human Thoughts Back To the Game of Go

View all activity

Organizations

None yet

authored 4 papers about 2 months ago

upvoted a paper about 2 months ago

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Paper • 2603.25040 • Published Mar 26 • 133

liked a dataset 2 months ago

YichuanMa/Expert-Go-SFT-100K

Viewer • Updated Mar 2 • 100k • 17 • 3

upvoted a paper 2 months ago

Timely Machine: Awareness of Time Makes Test-Time Scaling Agentic

Paper • 2601.16486 • Published Jan 23 • 1

upvoted a paper 3 months ago

Mixing Expert Knowledge: Bring Human Thoughts Back To the Game of Go

Paper • 2601.16447 • Published Jan 23 • 1

updated 2 datasets 3 months ago

YichuanMa/LoGos-Rollout-1K

Viewer • Updated Mar 2 • 1k • 34

YichuanMa/Go-GRPO-1K

Viewer • Updated Mar 2 • 1k • 31

updated a model 3 months ago

YichuanMa/LoGos-7B

Text Generation • 8B • Updated Mar 2 • 46 • 4

updated a dataset 3 months ago

YichuanMa/Expert-Go-SFT-100K

Viewer • Updated Mar 2 • 100k • 17 • 3

New activity in YichuanMa/Expert-Go-SFT-100K 3 months ago

Clarification on the two distinct data formats

#2 opened 4 months ago by

peiyao-sentient

published 3 datasets 4 months ago

YichuanMa/LoGos-Rollout-1K

Viewer • Updated Mar 2 • 1k • 34

YichuanMa/Go-GRPO-1K

Viewer • Updated Mar 2 • 1k • 31

YichuanMa/Expert-Go-SFT-100K

Viewer • Updated Mar 2 • 100k • 17 • 3

upvoted a paper 4 months ago

TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization

Paper • 2601.16480 • Published Jan 23 • 50

liked a model 4 months ago

YichuanMa/LoGos-7B

Text Generation • 8B • Updated Mar 2 • 46 • 4

upvoted an article 7 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

eliebak, lvwerra, lewtun

•

Jan 28, 2025

• 889

published a model 7 months ago

YichuanMa/LoGos-7B

Text Generation • 8B • Updated Mar 2 • 46 • 4

马逸川

AI & ML interests

Recent Activity

Organizations

YichuanMa's activity

Clarification on the two distinct data formats

Open-R1: a fully open reproduction of DeepSeek-R1