biomni
/

Biomni-R0-32B-Preview

Model card Files Files and versions

RyanLi0802 commited on Oct 7, 2025

Commit

6aeab8d

·

verified ·

1 Parent(s): 058b008

Update README.md

Files changed (1) hide show

README.md +12 -1

README.md CHANGED Viewed

@@ -28,4 +28,15 @@ Read more about how the model is trained and evaluted in our [technical report](
 ```bash
 python -m sglang.launch_server --model-path RyanLi0802/Biomni-R0-Preview --port 30000 --host 0.0.0.0 --mem-fraction-static 0.8 --tp 2 --trust-remote-code --json-model-override-args '{"rope_scaling":{"rope_type":"yarn","factor":4.0,"original_max_position_embeddings":32768}, "max_position_embeddings": 131072}
 ```
-Note, `rope_scaling` might degrade performance on tasks with shorter trajectories. Please tune the rope scaling factor according to your usage.

 ```bash
 python -m sglang.launch_server --model-path RyanLi0802/Biomni-R0-Preview --port 30000 --host 0.0.0.0 --mem-fraction-static 0.8 --tp 2 --trust-remote-code --json-model-override-args '{"rope_scaling":{"rope_type":"yarn","factor":4.0,"original_max_position_embeddings":32768}, "max_position_embeddings": 131072}
 ```
+Note, `rope_scaling` might degrade performance on tasks with shorter trajectories. Please tune the rope scaling factor according to your usage.
+# Citation
+```
+@misc{biomnir0,
+  title     = {Biomni-R0: Using RL to Hill-Climb Biomedical Reasoning Agents to Expert-Level},
+  author    = {Ryan Li and Kexin Huang and Shiyi Cao and Yuanhao Qu and Jure Leskovec},
+  year      = {2025},
+  month     = {September},
+  note      = {Technical Report}
+}
+```