GDPval: Evaluating AI Model Performance on Real-World Economically Valuable Tasks Paper • 2510.04374 • Published Oct 5, 2025 • 1
PatRe: A Full-Stage Office Action and Rebuttal Generation Benchmark for Patent Examination Paper • 2605.03571 • Published 24 days ago • 7
InteractWeb-Bench: Can Multimodal Agent Escape Blind Execution in Interactive Website Generation? Paper • 2604.27419 • Published 29 days ago • 13
Beyond Quantity: Trajectory Diversity Scaling for Code Agents Paper • 2602.03219 • Published Feb 3 • 2
Benchmarks and Datasets Collection IP Intelligence Team Works about Benchmarks and Datasets. • 4 items • Updated 10 days ago • 1
AI for Patents Collection IP Intelligence Team Works about AI for Patents. • 1 item • Updated Feb 2 • 1
FlowPIE Collection Resources of Our Proposed Scientific Idea Generation Algorithm, FlowPIE. • 1 item • Updated Apr 1 • 2
FlowPIE: Test-Time Scientific Idea Evolution with Flow-Guided Literature Exploration Paper • 2603.29557 • Published Mar 31 • 17
WithAnyone: Towards Controllable and ID Consistent Image Generation Paper • 2510.14975 • Published Oct 16, 2025 • 86
Lingma SWE-GPT: An Open Development-Process-Centric Language Model for Automated Software Improvement Paper • 2411.00622 • Published Nov 1, 2024 • 3
Implicit Actor Critic Coupling via a Supervised Learning Framework for RLVR Paper • 2509.02522 • Published Sep 2, 2025 • 25
SWE-Flow: Synthesizing Software Engineering Data in a Test-Driven Manner Paper • 2506.09003 • Published Jun 10, 2025 • 17
Model Merging in Pre-training of Large Language Models Paper • 2505.12082 • Published May 17, 2025 • 39
IPBench: Benchmarking the Knowledge of Large Language Models in Intellectual Property Paper • 2504.15524 • Published Apr 22, 2025 • 3
IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs Paper • 2504.15415 • Published Apr 21, 2025 • 23
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines Paper • 2502.14739 • Published Feb 20, 2025 • 110
LLMs for Patent Collection Researches (Topic: LLMs4Patent) collection of Qiyao Wang. • 8 items • Updated 23 days ago • 1