1 80 7

Shijie Geng PRO

makitanikaze

AI & ML interests

None yet

Recent Activity

authored a paper about 2 months ago

CLIP-Adapter: Better Vision-Language Models with Feature Adapters

authored a paper about 2 months ago

T$^2$PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning

upvoted a paper about 2 months ago

T^2PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning

View all activity

Organizations

None yet

authored 2 papers about 2 months ago

CLIP-Adapter: Better Vision-Language Models with Feature Adapters

Paper • 2110.04544 • Published Oct 9, 2021

T$^2$PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning

Paper • 2605.02178 • Published May 4 • 10

upvoted a paper about 2 months ago

T^2PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning

Paper • 2605.02178 • Published May 4 • 10

upvoted 2 papers 3 months ago

MiroThinker-1.7 & H1: Towards Heavy-Duty Research Agents via Verification

Paper • 2603.15726 • Published Mar 16 • 187

Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 189

upvoted a paper 6 months ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 233

upvoted a collection 6 months ago

Self-Correcting Delta Transformer - Adaptive LLMs

Collection

Self-Correcting Delta Transformer - DDL provides the Hardware mechanism (The Erazor), NL solves the software problem. • 3 items • Updated Jan 16 • 2

upvoted 7 papers 6 months ago

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

Paper • 2512.24618 • Published Dec 31, 2025 • 155

Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization

Paper • 2512.24615 • Published Dec 31, 2025 • 119

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published Nov 20, 2025 • 110

updated a collection 6 months ago

gui agent

Collection

5 items • Updated Dec 19, 2025

upvoted 5 papers 6 months ago

Step-GUI Technical Report

Paper • 2512.15431 • Published Dec 17, 2025 • 134

Virtual Width Networks

Paper • 2511.11238 • Published Nov 14, 2025 • 39

DoPE: Denoising Rotary Position Embedding

Paper • 2511.09146 • Published Nov 12, 2025 • 98

SAM 3D: 3Dfy Anything in Images

Paper • 2511.16624 • Published Nov 20, 2025 • 116

SAM 3: Segment Anything with Concepts

Paper • 2511.16719 • Published Nov 20, 2025 • 137

Shijie Geng PRO

AI & ML interests

Recent Activity

Organizations

makitanikaze's activity