Upload coedit-large ONNX model

Browse files

Files changed (6) hide show

README.md +49 -0
config.json +32 -0
decoder_model.onnx +3 -0
decoder_with_past_model.onnx +3 -0
encoder_model.onnx +3 -0
tokenizer.json +0 -0

README.md ADDED Viewed

	@@ -0,0 +1,49 @@

+---
+library_name: onnx
+tags:
+  - text2text-generation
+  - t5
+  - coedit
+  - grammar-correction
+  - encoder-decoder
+  - onnx
+  - inference4j
+license: apache-2.0
+pipeline_tag: text2text-generation
+---
+# CoEdIT Large — ONNX
+ONNX export of [CoEdIT Large](https://huggingface.co/grammarly/coedit-large) (780M parameters) with encoder-decoder architecture and KV cache support.
+CoEdIT is a T5-based model fine-tuned for text editing tasks including grammar correction, simplification, coherence, and paraphrasing.
+Converted for use with [inference4j](https://github.com/inference4j/inference4j), an inference-only AI library for Java.
+## Original Source
+- **Repository:** [grammarly/coedit-large](https://huggingface.co/grammarly/coedit-large)
+- **License:** Apache 2.0
+## Usage with inference4j
+```java
+try (var corrector = CoeditGrammarCorrector.coeditLarge().build()) {
+    System.out.println(corrector.correct("She don't likes swimming."));
+    // She doesn't like swimming.
+}
+```
+## Model Details
+| Property | Value |
+|----------|-------|
+| Architecture | T5 encoder-decoder (780M parameters) |
+| Task | Grammar correction, text editing |
+| Tokenizer | SentencePiece (32,128 tokens) |
+| Original framework | PyTorch (transformers) |
+| Export method | Hugging Face Optimum (encoder-decoder with KV cache) |
+## License
+This model is licensed under the [Apache License 2.0](https://www.apache.org/licenses/LICENSE-2.0). Original model by [Grammarly](https://huggingface.co/grammarly).

config.json ADDED Viewed

	@@ -0,0 +1,32 @@

+{
+  "architectures": [
+    "T5ForConditionalGeneration"
+  ],
+  "classifier_dropout": 0.0,
+  "d_ff": 2816,
+  "d_kv": 64,
+  "d_model": 1024,
+  "decoder_start_token_id": 0,
+  "dense_act_fn": "gelu_new",
+  "dropout_rate": 0.1,
+  "dtype": "float32",
+  "eos_token_id": 1,
+  "feed_forward_proj": "gated-gelu",
+  "initializer_factor": 1.0,
+  "is_encoder_decoder": true,
+  "is_gated_act": true,
+  "layer_norm_epsilon": 1e-06,
+  "model_type": "t5",
+  "n_positions": 512,
+  "num_decoder_layers": 24,
+  "num_heads": 16,
+  "num_layers": 24,
+  "output_past": true,
+  "pad_token_id": 0,
+  "relative_attention_max_distance": 128,
+  "relative_attention_num_buckets": 32,
+  "tie_word_embeddings": false,
+  "transformers_version": "4.57.6",
+  "use_cache": true,
+  "vocab_size": 32100
+}

decoder_model.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:88eca8ec7b6851307bc1f529aef4d91d16628594a905a80d90f676868441dda0
+size 1899772475

decoder_with_past_model.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:de52f72025365f7b20fff956a6aba0739c758596f46d4096e3ad22c7fdaf031f
+size 1698374979

encoder_model.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9f85005973bab0025145b4f74500614b23725c0dc986a2c359da10f6012f707a
+size 1365181498

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff