RyanLi0802 commited on
Commit
6aeab8d
·
verified ·
1 Parent(s): 058b008

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -1
README.md CHANGED
@@ -28,4 +28,15 @@ Read more about how the model is trained and evaluted in our [technical report](
28
  ```bash
29
  python -m sglang.launch_server --model-path RyanLi0802/Biomni-R0-Preview --port 30000 --host 0.0.0.0 --mem-fraction-static 0.8 --tp 2 --trust-remote-code --json-model-override-args '{"rope_scaling":{"rope_type":"yarn","factor":4.0,"original_max_position_embeddings":32768}, "max_position_embeddings": 131072}
30
  ```
31
- Note, `rope_scaling` might degrade performance on tasks with shorter trajectories. Please tune the rope scaling factor according to your usage.
 
 
 
 
 
 
 
 
 
 
 
 
28
  ```bash
29
  python -m sglang.launch_server --model-path RyanLi0802/Biomni-R0-Preview --port 30000 --host 0.0.0.0 --mem-fraction-static 0.8 --tp 2 --trust-remote-code --json-model-override-args '{"rope_scaling":{"rope_type":"yarn","factor":4.0,"original_max_position_embeddings":32768}, "max_position_embeddings": 131072}
30
  ```
31
+ Note, `rope_scaling` might degrade performance on tasks with shorter trajectories. Please tune the rope scaling factor according to your usage.
32
+
33
+ # Citation
34
+ ```
35
+ @misc{biomnir0,
36
+ title = {Biomni-R0: Using RL to Hill-Climb Biomedical Reasoning Agents to Expert-Level},
37
+ author = {Ryan Li and Kexin Huang and Shiyi Cao and Yuanhao Qu and Jure Leskovec},
38
+ year = {2025},
39
+ month = {September},
40
+ note = {Technical Report}
41
+ }
42
+ ```