Mark
Makrrr
ยท
AI & ML interests
NLP, RLHF, IR
Recent Activity
upvoted a paper 24 days ago
Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses upvoted a paper about 2 months ago
SkillOS: Learning Skill Curation for Self-Evolving Agents updated a model 2 months ago
CL-From-Nothing/Qwen3-4B-SSD-RLVE-Eval20-N20-global-step-500