| language: en | |
| license: mit | |
| datasets: | |
| - ronig/pdb_sequences | |
| # PDB Protein BPE Tokenizer | |
| A protein sequence tokenizer trained on [PDB Sequences](https://huggingface.co/datasets/ronig/pdb_sequences) with `vocabulary size = 1024` |
| language: en | |
| license: mit | |
| datasets: | |
| - ronig/pdb_sequences | |
| # PDB Protein BPE Tokenizer | |
| A protein sequence tokenizer trained on [PDB Sequences](https://huggingface.co/datasets/ronig/pdb_sequences) with `vocabulary size = 1024` |