This is a fine-tuned version of the ByGPT5 byte tokenized LLM. It was fine-tuned using conversational style sentences mined from the Colossal Clean Crawled Corpus and movie subtitle corpora.

See our EMNLP 2025 paper for details.

Downloads last month
1
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for figmtu/bygpt5-aac

Finetuned
(1)
this model