Phyo Phyo

ugh-45

AI & ML interests

None yet

Recent Activity

upvoted a paper about 6 hours ago

AgentCompass: A Unified Evaluation Infrastructure for Agent Capabilities

upvoted a paper 5 days ago

SIEVE: Structure-Aware Data Selection for Imitation Learning with VLA Models

liked a model 9 days ago

Pq234/tfjs-mobilenet-253

View all activity

Organizations

None yet

upvoted a paper about 6 hours ago

AgentCompass: A Unified Evaluation Infrastructure for Agent Capabilities

Paper • 2607.13705 • Published 1 day ago • 6

upvoted a paper 5 days ago

SIEVE: Structure-Aware Data Selection for Imitation Learning with VLA Models

Paper • 2607.06442 • Published 10 days ago • 5

upvoted a paper 10 days ago

Xiaomi-GUI-0 Technical Report

Paper • 2606.31410 • Published 17 days ago • 20

upvoted a paper 28 days ago

Embodied-R1.5: Evolving Physical Intelligence via Embodied Foundation Models

Paper • 2606.11324 • Published Jun 9 • 171

upvoted 2 papers about 1 month ago

Imaginative Perception Tokens Enhance Spatial Reasoning in Multimodal Language Models

Paper • 2606.03988 • Published Jun 3 • 126

Not only where, But when: Temporal Scheduling for RLVR

Paper • 2605.25381 • Published May 25 • 6

upvoted 5 papers about 2 months ago

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Paper • 2605.23904 • Published May 22 • 257

Learning to Act under Noise: Enhancing Agent Robustness via Noisy Environments

Paper • 2605.27209 • Published May 26 • 16

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

Paper • 2605.18703 • Published May 18 • 50

OCTOPUS: Optimized KV Cache for Transformers via Octahedral Parametrization Under optimal Squared error quantization

Paper • 2605.21226 • Published May 20 • 9

Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining

Paper • 2605.14747 • Published May 14 • 147

upvoted a paper 2 months ago

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

Paper • 2605.06130 • Published May 7 • 116

upvoted 3 papers 3 months ago

DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off

Paper • 2604.13902 • Published Apr 15 • 62

WildDet3D: Scaling Promptable 3D Detection in the Wild

Paper • 2604.08626 • Published Apr 9 • 248

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 509

upvoted 5 papers 4 months ago

GameplayQA: A Benchmarking Framework for Decision-Dense POV-Synced Multi-Video Understanding of 3D Virtual Agents

Paper • 2603.24329 • Published Mar 25 • 28

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published Mar 20 • 353

PixelSmile: Toward Fine-Grained Facial Expression Editing

Paper • 2603.25728 • Published Mar 26 • 118

Efficient Reasoning with Balanced Thinking

Paper • 2603.12372 • Published Mar 12 • 151

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 373