--- pipeline_tag: feature-extraction library_name: transformers license: mit --- # VenusFactory: A Unified Platform for Protein Engineering Data Retrieval and Language Model Fine-Tuning This repository provides the tokenizer used in the VenusFactory platform, described in [VenusFactory: A Unified Platform for Protein Engineering Data Retrieval and Language Model Fine-Tuning](https://huggingface.co/papers/2503.15438). VenusFactory is a unified platform for protein engineering that integrates data retrieval, standardized task benchmarking, and modular fine-tuning of protein language models (PLMs). Code and further details are available at: https://github.com/tyang816/VenusFactory