alex shengzhi li

alexshengzhili

7 6 5

https://scholar.google.com/citations?user=UBxhmfIAAAAJ&hl=en&oi=ao

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

SWE-Together: Evaluating Coding Agents in Interactive User Sessions

updated a dataset 23 days ago

alexshengzhili/dataclaw-harbor-candidates

upvoted a paper 3 months ago

IntelliAsk: Learning to Ask High-Quality Research Questions via RLVR

View all activity

Organizations

upvoted a paper 4 days ago

SWE-Together: Evaluating Coding Agents in Interactive User Sessions

Paper • 2606.29957 • Published 6 days ago • 13

updated a dataset 23 days ago

alexshengzhili/dataclaw-harbor-candidates

Viewer • Updated 23 days ago • 1.47k • 51 • 1

upvoted a paper 3 months ago

IntelliAsk: Learning to Ask High-Quality Research Questions via RLVR

Paper • 2602.15849 • Published Jan 23 • 3

commented a paper 3 months ago

Agent READMEs: An Empirical Study of Context Files for Agentic Coding

Paper • 2511.12884 • Published Nov 17, 2025 • 29 •

published a dataset 3 months ago

alexshengzhili/dataclaw-harbor-candidates

Viewer • Updated 23 days ago • 1.47k • 51 • 1

upvoted 2 articles 4 months ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 167

Article

Reproducing NanoBanana 2's "Window Seat" with AI-Scientist-V3

alexshengzhili

•

Mar 1

• 1

published an article 4 months ago

Article

Reproducing NanoBanana 2's "Window Seat" with AI-Scientist-V3

alexshengzhili

•

Mar 1

• 1

upvoted an article 4 months ago

Article

AI Scientist v3: Agent Native refactor. Scale from 1-hour to 24 hours with Reviewer agent

alexshengzhili

•

Mar 1

• 4

updated a dataset 4 months ago

alexshengzhili/ai-scientist-blog-assets

Viewer • Updated Mar 1 • 369 • 273

published a dataset 4 months ago

alexshengzhili/ai-scientist-blog-assets

Viewer • Updated Mar 1 • 369 • 273

published an article 4 months ago

Article

AI Scientist v3: Agent Native refactor. Scale from 1-hour to 24 hours with Reviewer agent

alexshengzhili

•

Mar 1

• 4

commented a paper about 1 year ago

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Paper • 2507.01352 • Published Jul 2, 2025 • 61 •

updated a model about 1 year ago

alexshengzhili/testing_lora

Updated May 3, 2025

published a model about 1 year ago

alexshengzhili/testing_lora

Updated May 3, 2025

updated a model about 1 year ago

alexshengzhili/qwen2_5_vl_7b_repcount

8B • Updated Apr 30, 2025 • 2

published a model about 1 year ago

alexshengzhili/qwen2_5_vl_7b_repcount

8B • Updated Apr 30, 2025 • 2

updated a dataset about 1 year ago

alexshengzhili/generalreasoning-stage2-combined-filtered-kept

Viewer • Updated Apr 29, 2025 • 25.4k • 17

published a dataset about 1 year ago

alexshengzhili/generalreasoning-stage2-combined-filtered-kept

Viewer • Updated Apr 29, 2025 • 25.4k • 17

updated a dataset about 1 year ago

alexshengzhili/generalreasoning-stage2-combined-filtered

Viewer • Updated Apr 29, 2025 • 130k • 23

alex shengzhi li

AI & ML interests

Recent Activity

Organizations

alexshengzhili's activity

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

Reproducing NanoBanana 2's "Window Seat" with AI-Scientist-V3

Reproducing NanoBanana 2's "Window Seat" with AI-Scientist-V3

AI Scientist v3: Agent Native refactor. Scale from 1-hour to 24 hours with Reviewer agent

AI Scientist v3: Agent Native refactor. Scale from 1-hour to 24 hours with Reviewer agent