Upload folder using huggingface_hub

Browse files

Files changed (4) hide show

.gitattributes +2 -0
README.md +60 -0
poem.wav +3 -0
sortformer.pte +3 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,5 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+poem.wav filter=lfs diff=lfs merge=lfs -text
+sortformer.pte filter=lfs diff=lfs merge=lfs -text

README.md ADDED Viewed

	@@ -0,0 +1,60 @@

+---
+license: apache-2.0
+tags:
+  - executorch
+  - xnnpack
+  - speaker-diarization
+  - on-device
+  - streaming
+pipeline_tag: audio-classification
+base_model: nvidia/diar_streaming_sortformer_4spk-v2
+---
+# Sortformer-ExecuTorch-XNNPACK
+Pre-exported [ExecuTorch](https://github.com/pytorch/executorch) `.pte` file
+for [Streaming Sortformer](https://huggingface.co/nvidia/diar_streaming_sortformer_4spk-v2)
+with **XNNPACK** backend (CPU). A streaming speaker diarization model that
+identifies up to 4 speakers in audio.
+## Installation
+```bash
+git clone https://github.com/pytorch/executorch/ ~/executorch
+cd ~/executorch && ./install_executorch.sh
+make sortformer-cpu
+```
+## Download
+```bash
+pip install huggingface_hub
+huggingface-cli download younghan-meta/Sortformer-ExecuTorch-XNNPACK --local-dir ~/sortformer
+```
+## Run
+```bash
+cmake-out/examples/models/sortformer/sortformer_runner \
+    --model_path ~/sortformer/sortformer.pte \
+    --audio_path ~/sortformer/poem.wav
+```
+Output shows detected speaker segments with start/end times.
+Optional flags:
+- `--threshold 0.5` -- speaker activity threshold (0.0-1.0)
+- `--chunk_len 124` -- encode chunk size in 80ms frames
+- `--fifo_len 124` -- FIFO buffer size in 80ms frames
+## Export Command
+```bash
+pip install "nemo_toolkit[asr]"
+python examples/models/sortformer/export_sortformer.py --backend xnnpack --output-dir ./sortformer_exports
+```
+## More Info
+- [Official ExecuTorch Sortformer guide](https://github.com/pytorch/executorch/tree/main/examples/models/sortformer)
+- [Original model](https://huggingface.co/nvidia/diar_streaming_sortformer_4spk-v2)

poem.wav ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:0dd03dfb6fe83b7d10df166cb77d28bf139f9be2c739e9927c757d88255aa88b
+size 768042

sortformer.pte ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e763fae031bc8675252f2d8de0e84ff71992db4eb04257e4a50b43c9b31a77c1
+size 492384528