Improve model card with metadata and links

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +33 -3
README.md CHANGED
@@ -1,3 +1,33 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ library_name: transformers
4
+ pipeline_tag: text-generation
5
+ ---
6
+
7
+ # MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale
8
+
9
+ The model was presented in the paper [MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at
10
+ Scale](https://huggingface.co/papers/2506.04405).
11
+
12
+ # Paper abstract
13
+
14
+ We introduce MedAgentGYM, the first publicly available training environment
15
+ designed to enhance coding-based medical reasoning capabilities in large
16
+ language model (LLM) agents. MedAgentGYM comprises 72,413 task instances across
17
+ 129 categories derived from authentic real-world biomedical scenarios. Tasks
18
+ are encapsulated within executable coding environments, each featuring detailed
19
+ task descriptions, interactive feedback mechanisms, verifiable ground-truth
20
+ annotations, and scalable training trajectory generation. Extensive
21
+ benchmarking of over 30 LLMs reveals a notable performance disparity between
22
+ commercial API-based models and open-source counterparts. Leveraging
23
+ MedAgentGYM, Med-Copilot-7B achieves substantial performance gains through
24
+ supervised fine-tuning (+36.44%) and continued reinforcement learning
25
+ (+42.47%), emerging as an affordable and privacy-preserving alternative
26
+ competitive with gpt-4o. By offering both a comprehensive benchmark and
27
+ accessible, expandable training resources within unified execution
28
+ environments, MedAgentGYM delivers an integrated platform to develop LLM-based
29
+ coding assistants for advanced biomedical research and practice.
30
+
31
+ Project page: [https://huggingface.co/MedAgentGym](https://huggingface.co/MedAgentGym)
32
+
33
+ Code: [https://github.com/MedAgentGym/MedAgentGym](https://github.com/MedAgentGym/MedAgentGym) (Please replace with the actual link if different)