EdinburghNLP/xsum
Viewer • Updated • 227k • 32k • 146
This is a Mixture-of-Experts (MoE) model for text summarization, trained on a small subset of the XSum dataset as a trial run. The model demonstrates the MoE architecture with 4 experts and top-2 routing.
import torch
from transformers import AutoTokenizer
# Load tokenizer
tokenizer = AutoTokenizer.from_pretrained('facebook/bart-large-xsum')
# Load model (you'll need the MoE implementation)
# model = MoESummarizationModel.from_pretrained('vivekdhayaal/moe-xsum')
# Example usage
text = "Your input text here..."
# Generate summary with the model
This is a trial run model trained on only 10 samples for demonstration purposes. For production use, train on the full XSum dataset.
@misc{moe-xsum,
title={MoE Text Summarization Trial Model},
author={vivekdhayaal},
year={2024},
url={https://huggingface.co/vivekdhayaal/moe-xsum}
}