Spaces:

YLab-Open
/

BRIDGE-Medical-Leaderboard

Running

App Files Files Community

jiagengwu commited on Apr 30, 2025

Commit

ea9676f

verified ·

1 Parent(s): dbd134d

Update docs.md

Browse files

Files changed (1) hide show

docs.md +32 -20

docs.md CHANGED Viewed

@@ -24,12 +24,13 @@
 <h2>📜 Background</h2>
 <p>Recent advances in <strong>Large Language Models (LLMs)</strong> have demonstrated transformative potential in <strong>healthcare</strong>,  yet concerns remain around their reliability and clinical validity across diverse clinical tasks, specialties, and languages. To support timely and trustworthy evaluation, building upon our <a href="https://ai.nejm.org/doi/full/10.1056/AIra2400012">systematic review</a> of global clinical text resources, we introduce <a href="https://arxiv.org/abs/2504.19467">BRIDGE</a>, <strong>a multilingual benchmark that comprises 87 real-world clinical text tasks spanning nine languages and more than one million samples</strong>. Furthermore, we construct this leaderboard of LLM in clinical text understanding by systematically evaluating <strong>52 state-of-the-art LLMs</strong> (by 2025/04/28).</p>
-<div style="display: flex; align-items: center; justify-content: center; width: 100%; height: 500px;">
   <img
-    src="https://cdn-uploads.huggingface.co/production/uploads/67a040fb6934f9aa1c866f99/j5tJ9xh3t6U1JlqGbKbrj.png"
-    alt="HMS"
     style="max-width: 80%; max-height: 100%; object-fit: contain;"
   />
 </div>
@@ -44,7 +45,7 @@
 </ul>
 <p>In addition, BRIDGE offers multiple <strong>model filters</strong> and <strong>task filters</strong> to enable users to explore LLM performance across <strong>different clinical contexts</strong>, empowering researchers and clinicians to make informed decisions and track model advancements over time.</p>
-<div style="display: flex; align-items: center; justify-content: center; width: 100%; height: 450px;">
   <img
     src="https://cdn-uploads.huggingface.co/production/uploads/67a040fb6934f9aa1c866f99/xpyabfXWqacZD-ThQ5guU.jpeg"
     alt="HMS"
@@ -87,6 +88,11 @@ Importantly, all 87 datasets have been verified to be either fully open-access o
 </ul>
 We will review and evaluate your submission and update the leaderboard accordingly.
 <h2>🤝 Contributing</h2>
 <p>We welcome and greatly value contributions and collaborations from the community!
@@ -94,23 +100,27 @@ If you have clinical text datasets that you would like to share for broader expl
 <p>We are committed to expanding BRIDGE while strictly adhering to appropriate data use agreements and ethical guidelines. Let's work together to advance the responsible application of LLMs in medicine!</p>
-<h2>📢 Updates</h2>
-<ul>
-    <li>🗓️ 2025/04/28: BRIDGE Leaderboard V1.0.0 is now live!</li>
-    <li>🗓️ 2025/04/28: Our paper <a href="https://arxiv.org/abs/2504.19467">BRIDGE</a> is now available on arXiv!</li>
-</ul>
-<h2>📬 Contact Information</h2>
-<p>If you have any questions about BRIDGE or the leaderboard, feel free to reach out!</p>
-<ul>
-    <li><strong>Leaderboard Managers</strong>: Jiageng Wu (jiwu7@bwh.harvard.edu), Kevin Xie (kevinxie@mit.edu)</li>
-    <li><strong>Benchmark Managers</strong>: Jiageng Wu (jiwu7@bwh.harvard.edu), Bowen Gu (bogu@bwh.harvard.edu)</li>
-    <li><strong>Program Lead</strong>: Jie Yang (jyang66@bwh.harvard.edu)</li>
-</ul>
-<h2>📚 Citation</h2>
-<p>If you find this leaderboard useful for your research and applications, please cite the following papers:</p>
-<pre><code>@article{BRIDGE-benchmark,
     title={BRIDGE: Benchmarking Large Language Models for Understanding Real-world Clinical Practice Text},
     author={Wu, Jiageng and Gu, Bowen and Zhou, Ren and Xie, Kevin and Snyder, Doug and Jiang, Yixing and Carducci, Valentina and Wyss, Richard and Desai, Rishi J and Alsentzer, Emily and Celi, Leo Anthony and Rodman, Adam and Schneeweiss, Sebastian and Chen, Jonathan H. and Romero-Brufau, Santiago and Lin, Kueiyu Joshua and Yang, Jie},
     year={2025},
@@ -129,4 +139,6 @@ If you have clinical text datasets that you would like to share for broader expl
     year={2024},
     publisher={Massachusetts Medical Society}
 }
-</code></pre>

 <h2>📜 Background</h2>
 <p>Recent advances in <strong>Large Language Models (LLMs)</strong> have demonstrated transformative potential in <strong>healthcare</strong>,  yet concerns remain around their reliability and clinical validity across diverse clinical tasks, specialties, and languages. To support timely and trustworthy evaluation, building upon our <a href="https://ai.nejm.org/doi/full/10.1056/AIra2400012">systematic review</a> of global clinical text resources, we introduce <a href="https://arxiv.org/abs/2504.19467">BRIDGE</a>, <strong>a multilingual benchmark that comprises 87 real-world clinical text tasks spanning nine languages and more than one million samples</strong>. Furthermore, we construct this leaderboard of LLM in clinical text understanding by systematically evaluating <strong>52 state-of-the-art LLMs</strong> (by 2025/04/28).</p>
+This project is led and maintained by the team of <a href="https://ylab.top/">Prof. Jie Yang</a> and <a href="https://www.drugepi.org/team/joshua-kueiyu-lin">Prof. Kueiyu Joshua Lin</a> at Harvard Medical School and Brigham and Women's Hospital.
+<div style="display: flex; align-items: center; justify-content: center; width: 100%; height: auto;">
   <img
+    src="https://cdn-uploads.huggingface.co/production/uploads/633c70c4ccce04161f841c30/OLN3J8_Yq8dx_LrgjYSsC.png"
+    alt="dataset"
     style="max-width: 80%; max-height: 100%; object-fit: contain;"
   />
 </div>
 </ul>
 <p>In addition, BRIDGE offers multiple <strong>model filters</strong> and <strong>task filters</strong> to enable users to explore LLM performance across <strong>different clinical contexts</strong>, empowering researchers and clinicians to make informed decisions and track model advancements over time.</p>
+<div style="display: flex; align-items: center; justify-content: center; width: 100%; height: auto;">
   <img
     src="https://cdn-uploads.huggingface.co/production/uploads/67a040fb6934f9aa1c866f99/xpyabfXWqacZD-ThQ5guU.jpeg"
     alt="HMS"
 </ul>
 We will review and evaluate your submission and update the leaderboard accordingly.
+<h2>📢 Updates</h2>
+<ul>
+    <li>🗓️ 2025/04/28: BRIDGE Leaderboard V1.0.0 is now live!</li>
+    <li>🗓️ 2025/04/28: Our paper <a href="https://arxiv.org/abs/2504.19467">BRIDGE</a> is now available on arXiv!</li>
+</ul>
 <h2>🤝 Contributing</h2>
 <p>We welcome and greatly value contributions and collaborations from the community!
 <p>We are committed to expanding BRIDGE while strictly adhering to appropriate data use agreements and ethical guidelines. Let's work together to advance the responsible application of LLMs in medicine!</p>
+## 🚀 Donation
+BRIDGE is a non-profit, researcher-led benchmark that requires substantial resources (e.g., high-performance GPUs, a dedicated team) to sustain. To support open and impactful academic research that advances clinical care, we welcome your contributions. Please contact Prof. Jie Yang at <jyang66@bwh.harvard.edu> to discuss donation opportunities.</p>
+## 📬 Contact Information
+If you have any questions about BRIDGE or the leaderboard, feel free to reach out!
+- **Leaderboard Managers**: Jiageng Wu (<jiwu7@bwh.harvard.edu>), Kevin Xie (<kevinxie@mit.edu>), Bowen Gu (<bogu@bwh.harvard.edu>)
+- **Benchmark Managers**: Jiageng Wu (<jiwu7@bwh.harvard.edu>), Bowen Gu (<bogu@bwh.harvard.edu>)
+- **Project Lead**: Jie Yang (<jyang66@bwh.harvard.edu>)
+## 📚 Citation
+If you find this leaderboard useful for your research and applications, please cite the following papers:
+<pre style="white-space: pre-wrap; overflow-wrap: anywhere;">
+<code>@article{BRIDGE-benchmark,
     title={BRIDGE: Benchmarking Large Language Models for Understanding Real-world Clinical Practice Text},
     author={Wu, Jiageng and Gu, Bowen and Zhou, Ren and Xie, Kevin and Snyder, Doug and Jiang, Yixing and Carducci, Valentina and Wyss, Richard and Desai, Rishi J and Alsentzer, Emily and Celi, Leo Anthony and Rodman, Adam and Schneeweiss, Sebastian and Chen, Jonathan H. and Romero-Brufau, Santiago and Lin, Kueiyu Joshua and Yang, Jie},
     year={2025},
     year={2024},
     publisher={Massachusetts Medical Society}
 }
+</code></pre>
+If you use the datasets in BRIDGE, please also cite the original paper of datasets, which can be found in the our BRIDGE paper.