3 33 6

Makise Kurisu

kurisu0306

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence

upvoted a paper about 2 months ago

T^2PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning

upvoted a paper about 2 months ago

Flow-OPD: On-Policy Distillation for Flow Matching Models

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence

Paper • 2605.26494 • Published May 26 • 41

upvoted 2 papers about 2 months ago

T^2PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning

Paper • 2605.02178 • Published May 4 • 10

Flow-OPD: On-Policy Distillation for Flow Matching Models

Paper • 2605.08063 • Published May 8 • 102

upvoted a collection about 2 months ago

Qwen3.5

Collection

21 items • Updated Mar 9 • 1.7k

upvoted a collection 2 months ago

DeepSeek-V4

Collection

6 items • Updated 2 days ago • 701

upvoted a paper 2 months ago

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

Paper • 2604.18292 • Published Apr 20 • 88

upvoted 3 papers 3 months ago

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Paper • 2604.04707 • Published Apr 6 • 204

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

Paper • 2603.26164 • Published Mar 27 • 365

Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts

Paper • 2602.13367 • Published Feb 13 • 36

liked a model 3 months ago

Nanbeige/Nanbeige4.1-3B

Text Generation • 4B • Updated Mar 25 • 5.11k • • 1.14k

upvoted 2 papers 4 months ago

IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse

Paper • 2603.12201 • Published Mar 12 • 60

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Paper • 2602.08354 • Published Feb 9 • 266

upvoted 8 papers 5 months ago

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning

Paper • 2602.04634 • Published Feb 4 • 100

daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently

Paper • 2602.02619 • Published Feb 2 • 53

CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs

Paper • 2602.03048 • Published Feb 3 • 32

Makise Kurisu

AI & ML interests

Recent Activity

Organizations

kurisu0306's activity