Upload wav2vec2-base-960h ONNX model

Browse files

Files changed (3) hide show

README.md +49 -0
model.onnx +3 -0
vocab.json +34 -0

README.md ADDED Viewed

	@@ -0,0 +1,49 @@

+---
+library_name: onnx
+tags:
+  - wav2vec2
+  - speech-to-text
+  - automatic-speech-recognition
+  - ctc
+  - audio
+  - onnx
+  - inference4j
+license: mit
+pipeline_tag: automatic-speech-recognition
+---
+# Wav2Vec2 Base 960h — ONNX
+ONNX export of [wav2vec2-base-960h](https://huggingface.co/Xenova/wav2vec2-base-960h), a Wav2Vec2 model fine-tuned on 960 hours of LibriSpeech for automatic speech recognition using CTC decoding.
+Mirrored for use with [inference4j](https://github.com/inference4j/inference4j), an inference-only AI library for Java.
+## Original Source
+- **Repository:** [Xenova (originally facebook/wav2vec2-base-960h)](https://huggingface.co/Xenova/wav2vec2-base-960h)
+- **License:** mit
+## Usage with inference4j
+```java
+try (Wav2Vec2 model = Wav2Vec2.fromPretrained("models/wav2vec2-base-960h")) {
+    Transcription result = model.transcribe(Path.of("audio.wav"));
+    System.out.println(result.text());
+}
+```
+## Model Details
+| Property | Value |
+|----------|-------|
+| Architecture | Wav2Vec2 Base (12 transformer layers) |
+| Task | Automatic speech recognition (CTC decoding) |
+| Training data | LibriSpeech 960h |
+| Input | 16kHz mono audio (float32 waveform) |
+| Output | CTC logits → greedy-decoded text |
+| Original framework | PyTorch (HuggingFace Transformers) |
+| ONNX export | By Xenova (Transformers.js) |
+## License
+This model is licensed under the [MIT License](https://opensource.org/licenses/MIT). Original model by [Facebook AI](https://huggingface.co/facebook/wav2vec2-base-960h), ONNX export by [Xenova](https://huggingface.co/Xenova).

model.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e46614273f03ff4b87923a965e417fa72004825522cb007c9c25633b8475490d
+size 377887594

vocab.json ADDED Viewed

	@@ -0,0 +1,34 @@

+{
+  "'": 27,
+  "</s>": 2,
+  "<pad>": 0,
+  "<s>": 1,
+  "<unk>": 3,
+  "A": 7,
+  "B": 24,
+  "C": 19,
+  "D": 14,
+  "E": 5,
+  "F": 20,
+  "G": 21,
+  "H": 11,
+  "I": 10,
+  "J": 29,
+  "K": 26,
+  "L": 15,
+  "M": 17,
+  "N": 9,
+  "O": 8,
+  "P": 23,
+  "Q": 30,
+  "R": 13,
+  "S": 12,
+  "T": 6,
+  "U": 16,
+  "V": 25,
+  "W": 18,
+  "X": 28,
+  "Y": 22,
+  "Z": 31,
+  "|": 4
+}