manglu3935 commited on
Commit
41d6c4b
·
verified ·
1 Parent(s): e238c0f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +64 -3
README.md CHANGED
@@ -1,3 +1,64 @@
1
- ---
2
- license: cc-by-4.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-4.0
3
+ ---
4
+
5
+ # 🧬 Thoth-mini
6
+
7
+ **Thoth-mini** is a lightweight version of Thoth, designed for **efficient and scalable biological protocol generation** while retaining strong scientific reasoning ability.
8
+
9
+ - 📄 **Paper**: *Unleashing Scientific Reasoning for Bio-experimental Protocol Generation via Structured Component-based Reward Mechanism* (ICLR 2026)
10
+ - 🔗 **GitHub**: https://github.com/manglu097/Thoth
11
+ - 🤗 **Dataset**: https://huggingface.co/datasets/manglu3935/SciRecipe
12
+
13
+ ---
14
+
15
+ ## 🔍 Model Overview
16
+
17
+ - **Base model**: Qwen3-4B
18
+ - **Parameters**: 4B
19
+ - **GPU memory**: ~8GB
20
+ - **Primary task**: Biological experimental protocol generation
21
+
22
+ Thoth-mini is trained with the same **Sketch-and-Fill paradigm** and **SCORE reward mechanism** as Thoth, offering a strong performance–efficiency trade-off.
23
+
24
+ ---
25
+
26
+ ## 🧠 Output Format
27
+
28
+ ```
29
+ <think> reasoning and planning </think>
30
+ <key> structured machine-readable steps </key>
31
+ <orc> natural language protocol </orc>
32
+ <note> optional safety notes </note>
33
+ ```
34
+
35
+ ---
36
+
37
+ ## 🚀 Usage
38
+
39
+ ```python
40
+ from transformers import AutoModelForCausalLM, AutoTokenizer
41
+
42
+ tokenizer = AutoTokenizer.from_pretrained("manglu3935/Thoth-mini")
43
+ model = AutoModelForCausalLM.from_pretrained("manglu3935/Thoth-mini")
44
+ ```
45
+
46
+ ---
47
+
48
+ ## ⚠️ Intended Use
49
+
50
+ For fast scientific reasoning experiments and scalable research deployment.
51
+ Generated protocols must be reviewed by qualified experts prior to laboratory execution.
52
+
53
+ ---
54
+
55
+ ## 📖 Citation
56
+
57
+ ```bibtex
58
+ @article{sun2025unleashing,
59
+ title={Unleashing Scientific Reasoning for Bio-experimental Protocol Generation via Structured Component-based Reward Mechanism},
60
+ author={Sun, Haoran and Jiang, Yankai and Tang, Zhenyu and others},
61
+ journal={arXiv preprint arXiv:2510.15600},
62
+ year={2025}
63
+ }
64
+ ```