Huggggooo commited on
Commit
94447ba
·
verified ·
1 Parent(s): f1006cd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -25
README.md CHANGED
@@ -33,15 +33,6 @@ Reward) stage**, initialised from the SFT checkpoint
33
  - Reward manager: `protein` (see
34
  [ProtoCycle/verl/workers/reward_manager/protein.py](https://github.com/huggggoooooo/ProtoCycle/blob/main/verl/workers/reward_manager/protein.py))
35
 
36
- ## Reward Signal
37
-
38
- The GRPO-TCR reward combines:
39
- 1. **Protocol compliance** — `<think>` / `<plan>` / `<tool_call>` / `<answer>`
40
- format.
41
- 2. **Tool-call quality** — per-step ProTrek score of the best sequence after
42
- each Stage-1 / Stage-3 tool call.
43
- 3. **Outcome signal** — final (global-best) ProTrek score vs the requirement.
44
- 4. **Efficiency** — penalty for excessively long rollouts.
45
 
46
  See
47
  [`recipe/protein/reward.py`](https://github.com/huggggoooooo/ProtoCycle/blob/main/recipe/protein/reward.py)
@@ -57,25 +48,11 @@ for the exact formulation.
57
  <answer>MAEGEITPLKTF...</answer>
58
  ```
59
 
60
- Registered tools (10):
61
- - **Stage 1:** `function2seq`, `pathway2seq`, `domain2seq`, `go2seq`,
62
- `dna_binding2seq`
63
- - **Stage 2:** `cofactor2constraints`, `motif2constraints`,
64
- `signal2constraints`
65
- - **Stage 3:** `esm_inpaint`, `get_score`
66
-
67
  ## How to Use
68
 
69
- Inference requires the biology tools shipped in the
70
- [ProtoCycle](https://github.com/huggggoooooo/ProtoCycle) repo:
71
 
72
- ```bash
73
- export MODEL_DIR=/path/to/ProtoCycle-7B # this checkpoint
74
- export MODEL_NAME=ProtoCycle-7B
75
- export CONDA_ROOT=/path/to/miniconda3
76
- # Also export PROTREK_*_DIR, ESM_MODEL_PATH, etc. (see ProtoCycle README)
77
- bash infer_tools.sh
78
- ```
79
 
80
  ## License
81
 
 
33
  - Reward manager: `protein` (see
34
  [ProtoCycle/verl/workers/reward_manager/protein.py](https://github.com/huggggoooooo/ProtoCycle/blob/main/verl/workers/reward_manager/protein.py))
35
 
 
 
 
 
 
 
 
 
 
36
 
37
  See
38
  [`recipe/protein/reward.py`](https://github.com/huggggoooooo/ProtoCycle/blob/main/recipe/protein/reward.py)
 
48
  <answer>MAEGEITPLKTF...</answer>
49
  ```
50
 
 
 
 
 
 
 
 
51
  ## How to Use
52
 
53
+ See the ProtoCycle repository:
54
+ [ProtoCycle](https://github.com/huggggoooooo/ProtoCycle) repo.
55
 
 
 
 
 
 
 
 
56
 
57
  ## License
58