distilbart-cnn-12-6 / README.md
vccarvalho11's picture
Upload distilbart-cnn-12-6 ONNX model
61ec627 verified
metadata
library_name: onnx
tags:
  - text2text-generation
  - bart
  - summarization
  - encoder-decoder
  - onnx
  - inference4j
license: apache-2.0
pipeline_tag: summarization

DistilBART CNN 12-6 — ONNX

ONNX export of DistilBART CNN 12-6 (306M parameters) with encoder-decoder architecture and KV cache support.

A distilled version of BART Large CNN, fine-tuned for text summarization on the CNN/DailyMail dataset.

Converted for use with inference4j, an inference-only AI library for Java.

Original Source

Usage with inference4j

try (var summarizer = BartSummarizer.distilBartCnn().build()) {
    System.out.println(summarizer.summarize("Long article text..."));
}

Model Details

Property Value
Architecture BART encoder-decoder (306M parameters, 12 encoder + 6 decoder layers)
Task Text summarization
Training data CNN/DailyMail
Tokenizer BPE (50,265 tokens)
Original framework PyTorch (transformers)
Export method Hugging Face Optimum (encoder-decoder with KV cache)

License

This model is licensed under the Apache License 2.0. Original model by Sam Shleifer.