Spaces:

ItsMaxNorm
/

pc-bench

Sleeping

pc-bench / README.md

MAXNORM8650

Add PC-Bench leaderboard with benchmark results

291fb52 18 days ago

951 Bytes

	---
	title: PC-Bench
	emoji: 📚
	colorFrom: purple
	colorTo: blue
	sdk: gradio
	sdk_version: 6.3.0
	app_file: app.py
	pinned: false
	license: mit
	short_description: Paper Discovery Benchmark
	tags:
	- leaderboard
	- research
	- multi-agent
	- paper-retrieval
	---

	# PC-Bench: Paper Discovery Benchmark

	Leaderboard for evaluating AI agents on academic paper retrieval and analysis.

	## Benchmarks

	\| Benchmark \| Queries \| Description \|
	\|-----------\|---------\|-------------\|
	\| SemanticBench \| 50 \| Template-based semantic queries \|
	\| RAbench \| 500 \| LLM-perturbed natural queries \|

	## Metrics

	- MRR - Mean Reciprocal Rank
	- R@K - Recall at K (K=1,5,10,20,50)
	- Hit Rate - Successful retrieval percentage

	## Top Results

	\| Model \| Hit Rate \| MRR \| Time \|
	\|-------\|----------\|-----\|------\|
	\| Qwen3-Coder-30B \| 80% \| 0.627 \| 22s \|
	\| BM25 Baseline \| 78% \| 0.541 \| - \|

	## Links

	- [GitHub Repository](https://github.com/MAXNORM8650/papercircle)