gbyuvd commited on
Commit
d378b8a
·
verified ·
1 Parent(s): f5b18a8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -49,10 +49,10 @@ Trained on ~2.7M valid SMILES built and curated from ChemBL34 (Zdrazil _et al._
49
 
50
  ## 🛠️ Implementation
51
 
52
- - **Algorithm**: Trie-based longest-prefix-match (no regex, no BPE)
53
  - **Caching**: `@lru_cache` for repeated string encoding
54
  - **HF Compatible**: Implements `__call__`, `encode_plus`, `batch_encode_plus`, `save_pretrained`, `from_pretrained`
55
- - **Memory Efficient**: No token set — pure trie traversal
56
 
57
  ```python
58
  from FastChemTokenizer import FastChemTokenizer
@@ -175,4 +175,4 @@ Apache 2.0
175
  pages = {D654-D659},
176
  doi = {10.1093/nar/gkac1008}
177
  }
178
- ```
 
49
 
50
  ## 🛠️ Implementation
51
 
52
+ - **Algorithm**: Trie-based longest-prefix-match
53
  - **Caching**: `@lru_cache` for repeated string encoding
54
  - **HF Compatible**: Implements `__call__`, `encode_plus`, `batch_encode_plus`, `save_pretrained`, `from_pretrained`
55
+ - **Memory Efficient**: Trie traversal and cache
56
 
57
  ```python
58
  from FastChemTokenizer import FastChemTokenizer
 
175
  pages = {D654-D659},
176
  doi = {10.1093/nar/gkac1008}
177
  }
178
+ ```