IPBench

non-profit

https://ipbench.wangqiyao.me/

Activity Feed Request to join this org

AI & ML interests

LLMs for Intellectuall Property

Recent Activity

QiYao-Wang authored a paper about 2 months ago

PatRe: A Full-Stage Office Action and Rebuttal Generation Benchmark for Patent Examination

QiYao-Wang submitted a paper about 2 months ago

PatRe: A Full-Stage Office Action and Rebuttal Generation Benchmark for Patent Examination

QiYao-Wang authored a paper about 2 months ago

InteractWeb-Bench: Can Multimodal Agent Escape Blind Execution in Interactive Website Generation?

View all activity

authored a paper about 2 months ago

PatRe: A Full-Stage Office Action and Rebuttal Generation Benchmark for Patent Examination

Paper • 2605.03571 • Published May 5 • 7

submitted a paper to Daily Papers about 2 months ago

PatRe: A Full-Stage Office Action and Rebuttal Generation Benchmark for Patent Examination

Paper • 2605.03571 • Published May 5 • 7

authored a paper about 2 months ago

InteractWeb-Bench: Can Multimodal Agent Escape Blind Execution in Interactive Website Generation?

Paper • 2604.27419 • Published Apr 30 • 13

submitted a paper to Daily Papers about 2 months ago

InteractWeb-Bench: Can Multimodal Agent Escape Blind Execution in Interactive Website Generation?

Paper • 2604.27419 • Published Apr 30 • 13

authored 2 papers 3 months ago

Beyond Quantity: Trajectory Diversity Scaling for Code Agents

Paper • 2602.03219 • Published Feb 3 • 2

FlowPIE: Test-Time Scientific Idea Evolution with Flow-Guided Literature Exploration

Paper • 2603.29557 • Published Mar 31 • 17

submitted a paper to Daily Papers 3 months ago

FlowPIE: Test-Time Scientific Idea Evolution with Flow-Guided Literature Exploration

Paper • 2603.29557 • Published Mar 31 • 17

authored a paper 3 months ago

Learning Ordinal Probabilistic Reward from Preferences

Paper • 2602.12660 • Published Feb 13 • 3

authored a paper 10 months ago

A Survey on Large Language Model Benchmarks

Paper • 2508.15361 • Published Aug 21, 2025 • 19

authored a paper 10 months ago

A Survey on Large Language Model Benchmarks

Paper • 2508.15361 • Published Aug 21, 2025 • 19

updated a dataset 12 months ago

IPBench/IPBench

Viewer • Updated Jul 5, 2025 • 10.4k • 105 • 3

published a dataset about 1 year ago

IPBench/IPBench

Viewer • Updated Jul 5, 2025 • 10.4k • 105 • 3

authored a paper about 1 year ago

IPBench: Benchmarking the Knowledge of Large Language Models in Intellectual Property

Paper • 2504.15524 • Published Apr 22, 2025 • 3

authored 3 papers over 1 year ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20, 2025 • 110

AutoPatent: A Multi-Agent Framework for Automatic Patent Generation

Paper • 2412.09796 • Published Dec 13, 2024 • 3

IPEval: A Bilingual Intellectual Property Agency Consultation Evaluation Benchmark for Large Language Models

Paper • 2406.12386 • Published Jun 18, 2024 • 1