BathSalt-1
/

daedalus_mobile

Text Generation

Model card Files Files and versions

BathSalt-1 commited on Jul 30, 2024

Commit

fe04db1

·

verified ·

1 Parent(s): 8b41d42

Create tokenizer.py

Files changed (1) hide show

tokenizer.py +13 -0

tokenizer.py ADDED Viewed

	@@ -0,0 +1,13 @@

+import torch
+from transformers import AutoTokenizer
+class DaedalusTokenizer(AutoTokenizer):
+    def __init__(self, config):
+        super(DaedalusTokenizer, self).__init__(config)
+        self.config = config
+    def encode(self, text):
+        return self.encode_plus(text, max_length=self.config.max_seq_length, padding='max_length', truncation=True)
+    def decode(self, ids):
+        return self.decode(ids, skip_special_tokens=True)