Lang Feng's picture

Lang Feng

langfeng01

·

https://langfengq.github.io/

langfengQ

AI & ML interests

PhD student @ NTU Singapore

Recent Activity

upvoted a paper about 4 hours ago

OPID: On-Policy Skill Distillation for Agentic Reinforcement Learning

upvoted a paper 4 months ago

CaveAgent: Transforming LLMs into Stateful Runtime Operators

authored a paper 4 months ago

Online Causal Kalman Filtering for Stable and Effective Policy Optimization

View all activity

Organizations

None yet

Collections 2

Papers 5

arxiv:2602.10609

arxiv:2602.08847

arxiv:2601.04786

arxiv:2506.13705

models 3

langfeng01/GiGPO-Qwen2.5-7B-Instruct-WebShop

8B • Updated Sep 28, 2025 • 361

langfeng01/GiGPO-Qwen2.5-7B-Instruct-ALFWorld

8B • Updated Sep 28, 2025 • 852 • 1

langfeng01/TimeMaster-SFT-Qwen2.5-VL-3B-CTU

4B • Updated Jun 21, 2025 • 17 • 4

datasets 0

None public yet