Update README.md
Browse files
README.md
CHANGED
|
@@ -22,8 +22,8 @@ model-index:
|
|
| 22 |
|
| 23 |
**Model ID:** `Chrode/H3BERTa`
|
| 24 |
**Architecture:** RoBERTa-base (encoder-only, Masked Language Model)
|
| 25 |
-
**Sequence type:** Heavy chain CDR-H3 regions
|
| 26 |
-
**Training:** Pretrained on >17M curated CDR-H3 sequences from healthy donor repertoires (OAS, IgG/IgA sources)
|
| 27 |
**Max sequence length:** 100 amino acids
|
| 28 |
**Vocabulary:** 25 tokens (20 standard amino acids + special tokens)
|
| 29 |
**Mask token:** `[MASK]`
|
|
@@ -129,7 +129,7 @@ If you use this model, please cite:
|
|
| 129 |
|
| 130 |
Rodella C. et al.
|
| 131 |
H3BERTa: A CDR-H3-specific language model for antibody repertoire analysis.
|
| 132 |
-
|
| 133 |
|
| 134 |
---
|
| 135 |
|
|
|
|
| 22 |
|
| 23 |
**Model ID:** `Chrode/H3BERTa`
|
| 24 |
**Architecture:** RoBERTa-base (encoder-only, Masked Language Model)
|
| 25 |
+
**Sequence type:** Heavy chain CDR-H3 regions
|
| 26 |
+
**Training:** Pretrained on >17M curated CDR-H3 sequences from healthy donor repertoires (OAS, IgG/IgA sources)
|
| 27 |
**Max sequence length:** 100 amino acids
|
| 28 |
**Vocabulary:** 25 tokens (20 standard amino acids + special tokens)
|
| 29 |
**Mask token:** `[MASK]`
|
|
|
|
| 129 |
|
| 130 |
Rodella C. et al.
|
| 131 |
H3BERTa: A CDR-H3-specific language model for antibody repertoire analysis.
|
| 132 |
+
- under review.
|
| 133 |
|
| 134 |
---
|
| 135 |
|