HuggingFaceTB/cosmopedia-100k
Viewer β’ Updated β’ 100k β’ 636 β’ 49
How to use qikp/pika-2 with Transformers:
# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("qikp/pika-2", dtype="auto")π You are looking at pika 2, which incorporates the following changes:
pika is a simple and public domain-like tokenizer.
[EOS][PAD]pika was trained on the first 6K rows of a Cosmopedia sample.
Due to its small corpus, pika may split words into smaller pieces. Also, some uncommon special tokens aren't present, you'll have to add them manually if needed.