File size: 1,412 Bytes
e65f2bc
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
---
library_name: onnx
tags:
  - text2text-generation
  - bart
  - summarization
  - encoder-decoder
  - onnx
  - inference4j
license: apache-2.0
pipeline_tag: summarization
---

# BART Large CNN — ONNX

ONNX export of [BART Large CNN](https://huggingface.co/facebook/bart-large-cnn) (406M parameters) with encoder-decoder architecture and KV cache support.

Fine-tuned for text summarization on the CNN/DailyMail dataset.

Converted for use with [inference4j](https://github.com/inference4j/inference4j), an inference-only AI library for Java.

## Original Source

- **Repository:** [facebook/bart-large-cnn](https://huggingface.co/facebook/bart-large-cnn)
- **License:** Apache 2.0

## Usage with inference4j

```java
try (var summarizer = BartSummarizer.bartLargeCnn().build()) {
    System.out.println(summarizer.summarize("Long article text..."));
}
```

## Model Details

| Property | Value |
|----------|-------|
| Architecture | BART encoder-decoder (406M parameters, 12 encoder + 12 decoder layers) |
| Task | Text summarization |
| Training data | CNN/DailyMail |
| Tokenizer | BPE (50,265 tokens) |
| Original framework | PyTorch (transformers) |
| Export method | Hugging Face Optimum (encoder-decoder with KV cache) |

## License

This model is licensed under the [Apache License 2.0](https://www.apache.org/licenses/LICENSE-2.0). Original model by [Facebook AI](https://huggingface.co/facebook).