七海阿部's picture

七海阿部

myuming

AI & ML interests

None yet

Recent Activity

upvoted a paper about 15 hours ago

KVServe: Service-Aware KV Cache Compression for Communication-Efficient Disaggregated LLM Serving

upvoted a paper 1 day ago

S-Bus: Automatic Read-Set Reconstruction for Multi-Agent LLM State Coordination

liked a dataset 5 days ago

m-a-p/FineFineWeb

View all activity

Organizations

None yet

upvoted a paper about 15 hours ago

KVServe: Service-Aware KV Cache Compression for Communication-Efficient Disaggregated LLM Serving

Paper • 2605.13734 • Published 10 days ago • 10

upvoted a paper 1 day ago

S-Bus: Automatic Read-Set Reconstruction for Multi-Agent LLM State Coordination

Paper • 2605.17076 • Published 7 days ago • 1

upvoted a paper 7 days ago

CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence

Paper • 2605.12882 • Published 10 days ago • 261

upvoted a paper 9 days ago

Rethinking RL for LLM Reasoning: It's Sparse Policy Selection, Not Capability Learning

Paper • 2605.06241 • Published 16 days ago • 5

upvoted a paper 22 days ago

Heterogeneous Scientific Foundation Model Collaboration

Paper • 2604.27351 • Published 23 days ago • 217

upvoted 4 papers about 1 month ago

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published Apr 9 • 263

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 629

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

Paper • 2603.26164 • Published Mar 27 • 364

PLUME: Latent Reasoning Based Universal Multimodal Embedding

Paper • 2604.02073 • Published Apr 2 • 15

upvoted 2 papers about 2 months ago

Marco DeepResearch: Unlocking Efficient Deep Research Agents via Verification-Centric Design

Paper • 2603.28376 • Published Mar 30 • 24

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 503

upvoted 3 papers 2 months ago

Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models

Paper • 2603.17051 • Published Mar 17 • 109

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 371

SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models

Paper • 2603.16859 • Published Mar 17 • 248