Reza8848/MUFFIN_68k
Viewer • Updated • 1.46k • 49 • 10
How to use Reza8848/MUFFIN-T5-11B with Transformers:
# Load model directly
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
tokenizer = AutoTokenizer.from_pretrained("Reza8848/MUFFIN-T5-11B")
model = AutoModelForSeq2SeqLM.from_pretrained("Reza8848/MUFFIN-T5-11B")This is the model weight of MUFFIN-T5-11B (Multi-Faceted Instructions).
We fine-tune the T5-11B model on our MUFFIN dataset.
We released both 3B and 11B models:
| Model | Number of parameters |
|---|---|
| MUFFIN-T5-3B | 3 billion |
| MUFFIN-T5-11B | 11 billion |
Please refer to MUFFIN-T5-3B for detailed documentation.
Please kindly cite our paper if you use any resources in this repository:
@inproceedings{Lou2023MUFFIN,
title={{MUFFIN}: Curating Multi-Faceted Instructions for Improving Instruction Following},
author={Renze Lou and Kai Zhang and Jian Xie and Yuxuan Sun and Janice Ahn and Hanzi Xu and Yu su and Wenpeng Yin},
booktitle={The Twelfth International Conference on Learning Representations},
year={2024},
url={https://openreview.net/forum?id=1vrS1zwekw}
}