nvidia/diar_sortformer_4spk-v1

Update Readme

#11

by jbalam-nv - opened 16 days ago

base: refs/heads/main

←

from: refs/pr/11

Discussion Files changed

+13

-1

Files changed (1) hide show

README.md +13 -1

README.md CHANGED Viewed

@@ -111,7 +111,7 @@ img {
 <!-- | [![Language](https://img.shields.io/badge/Language-multilingual-lightgrey#model-badge)](#datasets) -->
-[Sortformer](https://arxiv.org/abs/2409.06656)[1] is a novel end-to-end neural model for speaker diarization, trained with unconventional objectives compared to existing end-to-end diarization models.
 <div align="center">
     <img src="sortformer_intro.png" width="750" />
@@ -119,6 +119,18 @@ img {
 Sortformer resolves permutation problem in diarization following the arrival-time order of the speech segments from each speaker.
 ## Model Architecture
 Sortformer consists of an L-size (18 layers) [NeMo Encoder for

 <!-- | [![Language](https://img.shields.io/badge/Language-multilingual-lightgrey#model-badge)](#datasets) -->
+NVIDIA [Sortformer](https://arxiv.org/abs/2409.06656)[1] is a novel end-to-end neural model for speaker diarization, trained with unconventional objectives compared to existing end-to-end diarization models.
 <div align="center">
     <img src="sortformer_intro.png" width="750" />
 Sortformer resolves permutation problem in diarization following the arrival-time order of the speech segments from each speaker.
+## Discover more from NVIDIA:
+For documentation, deployment guides, enterprise-ready APIs, and the latest open models—including Nemotron and other cutting-edge speech, translation, and generative AI—visit the NVIDIA Developer Portal at [developer.nvidia.com](developer.nvidia.com).
+Join the community to access tools, support, and resources to accelerate your development with NVIDIA’s NeMo, Riva, NIM, and foundation models.<br>
+### Explore more from NVIDIA:  <br>
+What is [Nemotron](https://www.nvidia.com/en-us/ai-data-science/foundation-models/nemotron/)?<br>
+NVIDIA Developer [Nemotron](https://developer.nvidia.com/nemotron)<br>
+[NVIDIA Riva Speech](https://developer.nvidia.com/riva?sortBy=developer_learning_library%2Fsort%2Ffeatured_in.riva%3Adesc%2Ctitle%3Aasc#demos)<br>
+[NeMo Documentation](https://docs.nvidia.com/nemo-framework/user-guide/latest/nemotoolkit/asr/models.html)<br>
 ## Model Architecture
 Sortformer consists of an L-size (18 layers) [NeMo Encoder for