5 25 44

Chi PRO

ChilleD

AI & ML interests

Natural Language Processing.

Recent Activity

new activity about 24 hours ago

ChilleD/WebHarbor:Add Merriam-Webster mirror assets

liked a model 9 days ago

zai-org/GLM-5.2-FP8

liked a model 9 days ago

zai-org/GLM-5.2

View all activity

Organizations

upvoted a paper 13 days ago

EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments

Paper • 2606.13681 • Published 14 days ago • 140

upvoted a paper 15 days ago

Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts

Paper • 2606.05922 • Published 21 days ago • 67

upvoted 2 papers about 1 month ago

You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories

Paper • 2605.21468 • Published May 20 • 51

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

Paper • 2605.20025 • Published May 19 • 190

upvoted a paper about 2 months ago

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

Paper • 2605.06638 • Published May 7 • 16

upvoted a collection 2 months ago

DeepSeek-V4

Collection

4 items • Updated Apr 24 • 691

upvoted a paper 3 months ago

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Paper • 2603.12180 • Published Mar 12 • 65

upvoted 2 papers 4 months ago

GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

Paper • 2602.22190 • Published Feb 25 • 17

PETS: A Principled Framework Towards Optimal Trajectory Allocation for Efficient Test-Time Self-Consistency

Paper • 2602.16745 • Published Feb 18 • 8

upvoted a collection 4 months ago

Agent World Model

Collection

4 items • Updated Feb 11 • 9

upvoted 3 papers 4 months ago

Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning

Paper • 2602.10090 • Published Feb 10 • 53

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Paper • 2602.08234 • Published Feb 9 • 76

Towards Agentic Intelligence for Materials Science

Paper • 2602.00169 • Published Jan 29 • 48

upvoted 2 papers 7 months ago

GroupRank: A Groupwise Reranking Paradigm Driven by Reinforcement Learning

Paper • 2511.11653 • Published Nov 10, 2025 • 59

Adapting Web Agents with Synthetic Supervision

Paper • 2511.06101 • Published Nov 8, 2025 • 7

upvoted 2 papers 8 months ago

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5, 2025 • 83

When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought

Paper • 2511.02779 • Published Nov 4, 2025 • 60

upvoted a paper 9 months ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 276

upvoted a paper about 1 year ago

ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

Paper • 2504.10514 • Published Apr 10, 2025 • 48

upvoted a paper over 1 year ago

MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models

Paper • 2410.10139 • Published Oct 14, 2024 • 51

Chi PRO

AI & ML interests

Recent Activity

Organizations

ChilleD's activity