ProtoCycle-7B / README.md
Huggggooo's picture
Update README.md
9ff14bd verified
metadata
license: apache-2.0
library_name: transformers
pipeline_tag: text-generation
base_model: Huggggooo/ProtoCycle-7B-SFT
tags:
  - protein-design
  - agentic
  - tool-use
  - qwen2.5
  - reinforcement-learning
  - grpo
language:
  - en

ProtoCycle-7B

RL checkpoint for ProtoCycle — an agentic protein design model that performs multi-step, tool-augmented sequence design.

This is the GRPO-TCR (Group Relative Policy Optimization with Tool-Call Reward) stage, initialised from the SFT checkpoint Huggggooo/ProtoCycle-7B-SFT.

See recipe/protein/reward.py for the exact formulation.

Training Data

10,000 RL prompts for GRPO-TCR training, available at Huggggooo/ProtoCycle-Data (rl/ subset).}

Agent Protocol

<think>  ... reasoning ...  </think>
<plan>   ... stage plan ...  </plan>
<tool_call>{"name": "...", "arguments": {...}}</tool_call>
...
<answer>MAEGEITPLKTF...</answer>

How to Use

See the ProtoCycle repository: ProtoCycle repo.

License

Apache-2.0.

Citation

If you find this work useful, please cite ProtoCycle (forthcoming) and the upstream frameworks: VeRL, Open-AgentRL, ProTrek, ESM.