5 37 4

minghao

Liam-Liu

liam-liu-1b262631a

AI & ML interests

LLM, AD

Recent Activity

authored a paper 2 days ago

CLI-Universe: Towards Verifiable Task Synthesis Engine for Terminal Agents

upvoted a paper 3 days ago

CLI-Universe: Towards Verifiable Task Synthesis Engine for Terminal Agents

new activity 13 days ago

2077AIDataFoundation/KINA:Upload KINA.json

View all activity

Organizations

authored a paper 2 days ago

CLI-Universe: Towards Verifiable Task Synthesis Engine for Terminal Agents

Paper • 2606.22883 • Published 4 days ago • 31

upvoted a paper 3 days ago

CLI-Universe: Towards Verifiable Task Synthesis Engine for Terminal Agents

Paper • 2606.22883 • Published 4 days ago • 31

New activity in 2077AIDataFoundation/KINA 13 days ago

Upload KINA.json

#5 opened 14 days ago by

cnxjs

authored 3 papers 15 days ago

TVIR: Building Deep Research Agents Towards Text--Visual Interleaved Report Generation

Paper • 2606.02320 • Published 25 days ago • 14

MMAE: A Massive Multitask Audio Editing Benchmark

Paper • 2606.07229 • Published 21 days ago • 45

Sample-Efficient Post-Training for LEGO Spatial-Physics Reasoning

Paper • 2606.07602 • Published 28 days ago • 6

authored a paper 28 days ago

WebCompass: Towards Multimodal Web Coding Evaluation for Code Language Models

Paper • 2604.18224 • Published Apr 20 • 22

upvoted a paper about 1 month ago

OProver: A Unified Framework for Agentic Formal Theorem Proving

Paper • 2605.17283 • Published May 17 • 31

published a dataset 2 months ago

2077AIDataFoundation/KINA

Viewer • Updated 13 days ago • 899 • 354 • 2

updated a dataset 2 months ago

2077AIDataFoundation/KINA

Viewer • Updated 13 days ago • 899 • 354 • 2

authored 2 papers 3 months ago

ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding

Paper • 2603.27064 • Published Mar 28 • 29

Justified or Just Convincing? Error Verifiability as a Dimension of LLM Quality

Paper • 2604.04418 • Published Apr 6 • 1

upvoted 2 papers 3 months ago

Justified or Just Convincing? Error Verifiability as a Dimension of LLM Quality

Paper • 2604.04418 • Published Apr 6 • 1

ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding

Paper • 2603.27064 • Published Mar 28 • 29

updated a dataset 3 months ago

2077AIDataFoundation/ChartNet_RealWorldChart

Viewer • Updated Apr 3 • 30k • 205 • 2

published a dataset 3 months ago

2077AIDataFoundation/ChartNet_RealWorldChart

Viewer • Updated Apr 3 • 30k • 205 • 2

upvoted 2 papers 3 months ago

The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning

Paper • 2601.06002 • Published Jan 9 • 60

NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents

Paper • 2512.12730 • Published Dec 14, 2025 • 52

upvoted 2 papers 4 months ago

EcoGym: Evaluating LLMs for Long-Horizon Plan-and-Execute in Interactive Economies

Paper • 2602.09514 • Published Feb 10 • 11

Search More, Think Less: Rethinking Long-Horizon Agentic Search for Efficiency and Generalization

Paper • 2602.22675 • Published Feb 26 • 23

minghao

AI & ML interests

Recent Activity

Organizations

Liam-Liu's activity

Upload KINA.json