kixx
/

LastingBench

kixx commited on Jun 25, 2025

Commit

88aaaa7

verified ·

1 Parent(s): 474831f

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,3 +1,11 @@
 # LastingBench: Defend Benchmarks Against Knowledge Leakage.
 Welcome to the repository for the research paper: "LastingBench: Defend Benchmarks Against Knowledge Leakage." This project addresses the growing concern about large language models (LLMs) "cheating" on standard Question Answering (QA) benchmarks by memorizing task-specific data, which undermines the validity of benchmark evaluations as they no longer reflect genuine model capabilities but instead the effects of data leakage.

+---
+title: "LastingBench: Defend Benchmarks Against Knowledge Leakage"
+tags:
+  - paper
+  - benchmark
+license: cc-by-4.0
+---
 # LastingBench: Defend Benchmarks Against Knowledge Leakage.
 Welcome to the repository for the research paper: "LastingBench: Defend Benchmarks Against Knowledge Leakage." This project addresses the growing concern about large language models (LLMs) "cheating" on standard Question Answering (QA) benchmarks by memorizing task-specific data, which undermines the validity of benchmark evaluations as they no longer reflect genuine model capabilities but instead the effects of data leakage.