sharath25 commited on
Commit
4a62bb3
·
verified ·
1 Parent(s): e903772

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -0
README.md CHANGED
@@ -9,6 +9,22 @@ tags:
9
  arxiv: 2602.23068
10
  ---
11
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12
  # Text-Acoustic Dual-Alignment Large Language Model
13
 
14
  TADA is a unified speech-language model that synchronizes speech and text into a single, cohesive stream via 1:1 alignment. By leveraging a novel tokenizer and architectural design, TADA achieves high-fidelity synthesis and generation with a fraction of the computational overhead required by traditional models.
 
9
  arxiv: 2602.23068
10
  ---
11
 
12
+ <h1 align="center">TADA: A Generative Framework for Speech Modeling via Text-Acoustic Dual Alignment</h1>
13
+
14
+ <p align="center">
15
+ <a href="https://arxiv.org/abs/2602.23068"><img src="https://img.shields.io/badge/arXiv-Paper-b31b1b.svg" alt="Paper"></a>
16
+ <a href="https://huggingface.co/spaces/HumeAI/tada"><img src="https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Demo-blue" alt="Demo"></a>
17
+ <a href="https://huggingface.co/collections/HumeAI/tada-684390a52ed08a7717b5e702"><img src="https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Collection-yellow" alt="Collection"></a>
18
+ <a href="https://pypi.org/project/hume-tada/"><img src="https://img.shields.io/badge/PyPI-hume--tada-3775A9.svg?logo=pypi&logoColor=white" alt="PyPI"></a>
19
+ <a href="https://github.com/HumeAI/tada/blob/main/LICENSE"><img src="https://img.shields.io/badge/License-MIT-green.svg" alt="License"></a>
20
+ </p>
21
+
22
+ <img width="2400" height="1260" alt="image" src="https://github.com/user-attachments/assets/800eb8c5-eb6f-4e03-b8f3-150055a6cdfc" />
23
+
24
+ <p align="center"><br/><em>A unified speech-language model that synchronizes speech and text into a single, cohesive stream via 1:1 alignment.</em></p>
25
+
26
+ ---
27
+
28
  # Text-Acoustic Dual-Alignment Large Language Model
29
 
30
  TADA is a unified speech-language model that synchronizes speech and text into a single, cohesive stream via 1:1 alignment. By leveraging a novel tokenizer and architectural design, TADA achieves high-fidelity synthesis and generation with a fraction of the computational overhead required by traditional models.