Teradata
/

opus-mt_tiny_tur-eng

@@ -33,7 +33,6 @@ packaged for use with the Teradata `mldb.ONNXSeq2Seq` BYOM function.
 **This repository does not redistribute the original model weights.** It contains only:
 - `onnx/model-fp32.onnx` — full-precision ONNX graph
-- `onnx/model-int8.onnx` — dynamically quantized ONNX graph
 - `tokenizer.json` — repacked Marian tokenizer suitable for BYOM
 - `config.json` — model architecture metadata, copied unchanged from the upstream repo
 - `generation_config.json` — generation defaults, copied unchanged from the upstream repo
@@ -50,7 +49,7 @@ For the original PyTorch weights and training details, see the upstream model:
 | Architecture | MarianMT (encoder-decoder) |
 | Max input tokens | 256 |
 | Max output tokens | 512 |
-| ONNX file sizes | fp32 (177 MB), int8 (94 MB) |
 | ONNX opset | 14 |
 | ONNX IR version | 8 (BYOM 7.0+ compatible) |
 | License | Apache-2.0 (from upstream) |
@@ -121,15 +120,12 @@ FROM mldb.ONNXSeq2Seq(
 print(tdml.DataFrame.from_query(query))
 ```
-An int8-quantized variant is also published as `onnx/model-int8.onnx`. The int8 variant does not accept `num_beams` (configured internally).
 ## How this model was converted
 This model was produced with the open-source
 [`teradata-opus-translate`](https://pypi.org/project/teradata-opus-translate/)
 package, which exports the encoder/decoder, stitches in the BeamSearch op,
-applies dynamic int8 quantization, and verifies parity against PyTorch on a
-small sample set.
 > **Note:** the same package can convert *any* Helsinki-NLP MarianMT model
 > (including ones not in this collection) to a BYOM-ready ONNX bundle. If

 **This repository does not redistribute the original model weights.** It contains only:
 - `onnx/model-fp32.onnx` — full-precision ONNX graph
 - `tokenizer.json` — repacked Marian tokenizer suitable for BYOM
 - `config.json` — model architecture metadata, copied unchanged from the upstream repo
 - `generation_config.json` — generation defaults, copied unchanged from the upstream repo
 | Architecture | MarianMT (encoder-decoder) |
 | Max input tokens | 256 |
 | Max output tokens | 512 |
+| ONNX file size | 177 MB |
 | ONNX opset | 14 |
 | ONNX IR version | 8 (BYOM 7.0+ compatible) |
 | License | Apache-2.0 (from upstream) |
 print(tdml.DataFrame.from_query(query))
 ```
 ## How this model was converted
 This model was produced with the open-source
 [`teradata-opus-translate`](https://pypi.org/project/teradata-opus-translate/)
 package, which exports the encoder/decoder, stitches in the BeamSearch op,
+and verifies parity against PyTorch on a small sample set.
 > **Note:** the same package can convert *any* Helsinki-NLP MarianMT model
 > (including ones not in this collection) to a BYOM-ready ONNX bundle. If