ApexOracle / configs /data /openwebtext-split.yaml
Kiria-Nozan's picture
solve same embedding bug
80ad4cd
raw
history blame contribute delete
160 Bytes
train: openwebtext-train
valid: openwebtext-valid
tokenizer_name_or_path: gpt2
cache_dir: /share/kuleshov/ssahoo/textdiffusion/data
wrap: True
streaming: False