CoverageBench: Evaluating Information Coverage across Tasks and Domains Paper • 2603.20034 • Published Mar 20 • 2
NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddings Paper • 2509.04011 • Published Sep 4, 2025 • 29
NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddings Paper • 2509.04011 • Published Sep 4, 2025 • 29
NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddings Paper • 2509.04011 • Published Sep 4, 2025 • 29 • 2
WideSearch: Benchmarking Agentic Broad Info-Seeking Paper • 2508.07999 • Published Aug 11, 2025 • 113
RefVNLI: Towards Scalable Evaluation of Subject-driven Text-to-image Generation Paper • 2504.17502 • Published Apr 24, 2025 • 55
Charting and Navigating Hugging Face's Model Atlas Paper • 2503.10633 • Published Mar 13, 2025 • 94
Running 601 Scaling test-time compute 📈 601 Boost LLM answers with flexible test‑time search strategies