kixx
/

LastingBench

kixx commited on Jun 26, 2025

Commit

007c5e6

verified ·

1 Parent(s): 1e35a15

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -6,11 +6,20 @@ tags:
 license: cc-by-4.0
 ---
-# LastingBench: Defend Benchmarks Against Knowledge Leakage.
-<iframe src="https://huggingface.co/kixx/LastingBench/resolve/main/paper.pdf"
-        width="100%"
-        height="700"></iframe>
 Welcome to the repository for the research paper: "LastingBench: Defend Benchmarks Against Knowledge Leakage." This project addresses the growing concern about large language models (LLMs) "cheating" on standard Question Answering (QA) benchmarks by memorizing task-specific data, which undermines the validity of benchmark evaluations as they no longer reflect genuine model capabilities but instead the effects of data leakage.

 license: cc-by-4.0
 ---
+# 📄 Paper
+<iframe
+  src="https://huggingface.co/kixx/LastingBench/resolve/main/paper.pdf#toolbar=0"
+  width="100%"
+  height="900"
+  style="border:none;">
+</iframe>
+<!-- 兼容备用： -->
+<p><a href="https://huggingface.co/kixx/LastingBench/resolve/main/paper.pdf">📥 Download the PDF</a></p>
+# LastingBench: Defend Benchmarks Against Knowledge Leakage.
 Welcome to the repository for the research paper: "LastingBench: Defend Benchmarks Against Knowledge Leakage." This project addresses the growing concern about large language models (LLMs) "cheating" on standard Question Answering (QA) benchmarks by memorizing task-specific data, which undermines the validity of benchmark evaluations as they no longer reflect genuine model capabilities but instead the effects of data leakage.