This is a fine-tuned version of the ByGPT5 byte tokenized LLM. It was fine-tuned using conversational style sentences mined from the Colossal Clean Crawled Corpus and movie subtitle corpora.

See our EMNLP 2025 paper for details.

Downloads last month
64
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for figmtu/bygpt5-aac

Finetuned
(1)
this model