💼 Hiring

2 34 12

Kai Hua

kkish

https://kifish.github.io

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

AIR: Post-training Data Selection for Reasoning via Attention Head Influence

upvoted a paper 9 days ago

LLMs are Also Effective Embedding Models: An In-depth Overview

upvoted a paper 11 days ago

MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

View all activity

Organizations

upvoted 2 papers 9 days ago

AIR: Post-training Data Selection for Reasoning via Attention Head Influence

Paper • 2512.13279 • Published Dec 15, 2025 • 2

LLMs are Also Effective Embedding Models: An In-depth Overview

Paper • 2412.12591 • Published Dec 17, 2024 • 2

upvoted 2 papers 11 days ago

MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling

Paper • 2606.13473 • Published 14 days ago • 90

MiniMax Sparse Attention

Paper • 2606.13392 • Published 14 days ago • 146

upvoted a paper about 1 month ago

GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment

Paper • 2605.19577 • Published May 19 • 59

upvoted a collection about 1 month ago

Qwen3-Reranker

Collection

3 items • Updated Dec 31, 2025 • 71

upvoted 3 papers about 1 month ago

OProver: A Unified Framework for Agentic Formal Theorem Proving

Paper • 2605.17283 • Published May 17 • 31

STALE: Can LLM Agents Know When Their Memories Are No Longer Valid?

Paper • 2605.06527 • Published May 7 • 46

Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context

Paper • 2605.13831 • Published May 13 • 88

upvoted 2 papers 3 months ago

In-Place Test-Time Training

Paper • 2604.06169 • Published Apr 7 • 30

Understanding by Reconstruction: Reversing the Software Development Process for LLM Pretraining

Paper • 2603.11103 • Published Mar 11 • 9

upvoted a paper 5 months ago

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11, 2025 • 157

upvoted a collection 5 months ago

OpenResearcher

Collection

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis • 8 items • Updated Mar 24 • 18

upvoted 2 papers 5 months ago

ERNIE 5.0 Technical Report

Paper • 2602.04705 • Published Feb 4 • 269

The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning

Paper • 2601.06002 • Published Jan 9 • 60

upvoted 2 papers 6 months ago

Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space

Paper • 2512.24617 • Published Dec 31, 2025 • 66

NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents

Paper • 2512.12730 • Published Dec 14, 2025 • 52

upvoted 2 papers 7 months ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 306

Virtual Width Networks

Paper • 2511.11238 • Published Nov 14, 2025 • 39

upvoted a paper 8 months ago

RLoop: An Self-Improving Framework for Reinforcement Learning with Iterative Policy Initialization

Paper • 2511.04285 • Published Nov 6, 2025 • 8

Kai Hua

AI & ML interests

Recent Activity

Organizations

kkish's activity