Building on HF

jiakai PRO

real-jiakai

https://blog.gujiakai.top

AI & ML interests

LLM && Smart QA

Recent Activity

liked a dataset about 7 hours ago

openbmb/Ultra-FineWeb-L3

upvoted a paper 10 days ago

Code as Agent Harness

upvoted a paper 17 days ago

Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs

View all activity

Organizations

upvoted a paper 10 days ago

Code as Agent Harness

Paper • 2605.18747 • Published 13 days ago • 210

upvoted a paper 17 days ago

Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs

Paper • 2605.09063 • Published 22 days ago • 80

upvoted a paper 22 days ago

Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction

Paper • 2605.05242 • Published 28 days ago • 116

upvoted a paper 25 days ago

From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published 28 days ago • 166

upvoted a paper about 1 month ago

Recursive Multi-Agent Systems

Paper • 2604.25917 • Published Apr 28 • 273

upvoted an article about 1 month ago

Article

DeepSeek-V4: a million-token context that agents can actually use

burtenshaw

•

Apr 24

• 47

upvoted a collection about 1 month ago

DeepSeek-V4

Collection

4 items • Updated Apr 24 • 661

upvoted an article about 1 month ago

Article

Meet HoloTab by HCompany. Your AI browser companion.

Hcompany

•

Apr 15

• 24

upvoted 4 papers about 1 month ago

KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance

Paper • 2604.12627 • Published Apr 14 • 101

upvoted 7 papers about 2 months ago

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

Paper • 2604.06628 • Published Apr 8 • 326

SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

Paper • 2604.08377 • Published Apr 9 • 291

Think in Strokes, Not Pixels: Process-Driven Image Generation via Interleaved Reasoning

Paper • 2604.04746 • Published Apr 8 • 72

Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents

Paper • 2604.06132 • Published Apr 7 • 121

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale

Paper • 2604.04771 • Published Apr 6 • 123

The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook

Paper • 2604.02029 • Published Apr 2 • 151

Terminal Agents Suffice for Enterprise Automation

Paper • 2604.00073 • Published Mar 31 • 96

upvoted an article about 2 months ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 902

jiakai PRO

AI & ML interests

Recent Activity

Organizations

real-jiakai's activity

DeepSeek-V4: a million-token context that agents can actually use

Meet HoloTab by HCompany. Your AI browser companion.

Welcome Gemma 4: Frontier multimodal intelligence on device