Adding ONNX file of this model

Beep boop I am the [ONNX export bot 🤖🏎️](https://huggingface.co/spaces/onnx/export). On behalf of [tehkehyong](https://huggingface.co/tehkehyong), I would like to add to this repository the model converted to ONNX.

What is ONNX? It stands for "Open Neural Network Exchange", and is the most commonly used open standard for machine learning interoperability. You can find out more at [onnx.ai](https://onnx.ai/)!

The exported ONNX model can be then be consumed by various backends as TensorRT or TVM, or simply be used in a few lines with 🤗 Optimum through ONNX Runtime, check out how [here](https://huggingface.co/docs/optimum/main/en/onnxruntime/usage_guides/models)!

Files changed (11) hide show

README.md +2 -0
onnx/config.json +34 -0
onnx/decoder_model.onnx +3 -0
onnx/decoder_model_merged.onnx +3 -0
onnx/decoder_with_past_model.onnx +3 -0
onnx/encoder_model.onnx +3 -0
onnx/generation_config.json +9 -0
onnx/preprocessor_config.json +9 -0
onnx/special_tokens_map.json +1 -0
onnx/tokenizer.json +0 -0
onnx/tokenizer_config.json +0 -0

README.md CHANGED Viewed

@@ -5,6 +5,8 @@ language:
 library_name: transformers
 pipeline_tag: automatic-speech-recognition
 arxiv: https://arxiv.org/abs/2410.15608
 ---
 # Moonshine

 library_name: transformers
 pipeline_tag: automatic-speech-recognition
 arxiv: https://arxiv.org/abs/2410.15608
+tags:
+- onnx
 ---
 # Moonshine

onnx/config.json ADDED Viewed

	@@ -0,0 +1,34 @@

+{
+  "_attn_implementation_autoset": true,
+  "architectures": [
+    "MoonshineForConditionalGeneration"
+  ],
+  "attention_bias": false,
+  "attention_dropout": 0.0,
+  "bos_token_id": 1,
+  "decoder_hidden_act": "silu",
+  "decoder_num_attention_heads": 8,
+  "decoder_num_hidden_layers": 8,
+  "decoder_num_key_value_heads": 8,
+  "decoder_start_token_id": 1,
+  "encoder_hidden_act": "gelu",
+  "encoder_num_attention_heads": 8,
+  "encoder_num_hidden_layers": 8,
+  "encoder_num_key_value_heads": 8,
+  "eos_token_id": 2,
+  "hidden_size": 416,
+  "initializer_range": 0.02,
+  "intermediate_size": 1664,
+  "is_encoder_decoder": true,
+  "max_position_embeddings": 194,
+  "model_type": "moonshine",
+  "pad_head_dim_to_multiple_of": 8,
+  "pad_token_id": 2,
+  "partial_rotary_factor": 0.62,
+  "rope_scaling": null,
+  "rope_theta": 10000.0,
+  "torch_dtype": "float32",
+  "transformers_version": "4.51.3",
+  "use_cache": true,
+  "vocab_size": 32768
+}

onnx/decoder_model.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:787a50bff53d5e27c7ca33156d4c50275a5189203dd136b0bdcd76e90252b452
+size 220550817

onnx/decoder_model_merged.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:2843a351768822b22011efdde830033c6144620f7ce991b17376db6306cd69fc
+size 221115751

onnx/decoder_with_past_model.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:feeadf4f0ec021cdebf953cab5b251fd8856889ed9180c30d083f6f150b1bebb
+size 209453359

onnx/encoder_model.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:85ff09b3d810666d37d98a3dccc1025ee4722ea1a556b5b1d1bdd85b5f583a5f
+size 80900411

onnx/generation_config.json ADDED Viewed

	@@ -0,0 +1,9 @@

+{
+  "_from_model_config": true,
+  "bos_token_id": 1,
+  "decoder_start_token_id": 1,
+  "eos_token_id": 2,
+  "max_length": 194,
+  "pad_token_id": 2,
+  "transformers_version": "4.51.3"
+}

onnx/preprocessor_config.json ADDED Viewed

	@@ -0,0 +1,9 @@

+{
+  "do_normalize": false,
+  "feature_extractor_type": "Wav2Vec2FeatureExtractor",
+  "feature_size": 1,
+  "padding_side": "right",
+  "padding_value": 0.0,
+  "return_attention_mask": true,
+  "sampling_rate": 16000
+}

onnx/special_tokens_map.json ADDED Viewed

	@@ -0,0 +1 @@


1	+ {}

onnx/tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

onnx/tokenizer_config.json ADDED Viewed

The diff for this file is too large to render. See raw diff