Update README.md
Browse files
README.md
CHANGED
|
@@ -15,7 +15,7 @@ base_model: kyutai/moshiko-pytorch-bf16
|
|
| 15 |
pipeline_tag: audio-to-audio
|
| 16 |
---
|
| 17 |
|
| 18 |
-
#
|
| 19 |
**🎙️ [Try the live demo →](https://ai.joshtalks.com/research/josh1)**
|
| 20 |
|
| 21 |
Hindi-Moshi is the first full-duplex spoken dialogue model for Hindi, built by adapting [Kyutai's Moshi](https://github.com/kyutai-labs/moshi) architecture. It enables real-time, natural Hindi conversation with support for interruptions, overlaps, backchannels, and natural turn-taking — trained on 26,000 hours of real spontaneous Hindi conversations from 14,695 speakers.
|
|
@@ -120,9 +120,9 @@ Temperature τ=0.9 produces turn-taking dynamics closest to ground-truth.
|
|
| 120 |
| Model | τ | IPU/min | Pause | Gap | Overlap |
|
| 121 |
|---|---|---|---|---|---|
|
| 122 |
| Ground-truth | — | 35.30 | 10.49 | 8.51 | 3.03 |
|
| 123 |
-
|
|
| 124 |
-
|
|
| 125 |
-
|
|
| 126 |
|
| 127 |
## Conversation Style
|
| 128 |
|
|
|
|
| 15 |
pipeline_tag: audio-to-audio
|
| 16 |
---
|
| 17 |
|
| 18 |
+
# Human-1: A Full-Duplex Conversational Model for Hindi
|
| 19 |
**🎙️ [Try the live demo →](https://ai.joshtalks.com/research/josh1)**
|
| 20 |
|
| 21 |
Hindi-Moshi is the first full-duplex spoken dialogue model for Hindi, built by adapting [Kyutai's Moshi](https://github.com/kyutai-labs/moshi) architecture. It enables real-time, natural Hindi conversation with support for interruptions, overlaps, backchannels, and natural turn-taking — trained on 26,000 hours of real spontaneous Hindi conversations from 14,695 speakers.
|
|
|
|
| 120 |
| Model | τ | IPU/min | Pause | Gap | Overlap |
|
| 121 |
|---|---|---|---|---|---|
|
| 122 |
| Ground-truth | — | 35.30 | 10.49 | 8.51 | 3.03 |
|
| 123 |
+
| Human-1 | 0.8 | 23.12 | 9.16 | 6.77 | 1.67 |
|
| 124 |
+
| Human-1 | 0.9 | 29.14 | 9.24 | 8.54 | 4.30 |
|
| 125 |
+
| Human-1 | 1.0 | 38.90 | 11.67 | 8.10 | 9.68 |
|
| 126 |
|
| 127 |
## Conversation Style
|
| 128 |
|