AGI-Eval
/

Auto-ATT

@@ -5,8 +5,8 @@ language:
 base_model:
 - Qwen/Qwen2-Audio-7B-Instruct
 datasets:
-- AudioTuring/Audio-Turing-Test-Audios
-- AudioTuring/Audio-Turing-Test-Corpus
 pipeline_tag: audio-classification
 library_name: transformers
 ---
@@ -21,16 +21,16 @@ library_name: transformers
 ## About Audio Turing Test (ATT)
-ATT is an evaluation framework with a standardized human evaluation protocol and an accompanying dataset, aiming to resolve the lack of unified protocols in TTS evaluation and the difficulty in comparing multiple TTS systems. To further support the training and iteration of TTS systems, we utilized additional private evaluation data to train Auto-ATT model based on Qwen2-Audio-7B, enabling a model-as-a-judge approach for rapid evaluation of TTS systems on the ATT dataset. The datasets and Auto-ATT model can be cound in [ATT Collection](https://huggingface.co/collections/AudioTuring/audio-turing-test-6826e24d2197bf91fae6d7f5).
 ## Usage
-[Inference Code](https://github.com/AudioTuring/Auto-ATT-Inference)
 ## Datasets & Benchmarks
-See [ATT Collection](https://huggingface.co/collections/AudioTuring/audio-turing-test-6826e24d2197bf91fae6d7f5).
@@ -41,7 +41,7 @@ See [ATT Collection](https://huggingface.co/collections/AudioTuring/audio-turing
   author = {Wang, Xihuai and Zhao, Ziyi and Ren, Siyu and Zhang, Shao and Li, Song and Li, Xiaoyu and Wang, Ziwen and Qiu, Lin and Wan, Guanglu and Cao, Xuezhi and Cai, Xunliang and Zhang, Weinan},
   title = {Audio Turing Test: Benchmarking the Human-likeness and Naturalness of Large Language Model-based Text-to-Speech Systems in Chinese},
   year = {2025},
-  url = {https://huggingface.co/AudioTuring/Auto-ATT},
   publisher = {huggingface},
 }
 ```

 base_model:
 - Qwen/Qwen2-Audio-7B-Instruct
 datasets:
+- Meituan/Audio-Turing-Test-Audios
+- Meituan/Audio-Turing-Test-Corpus
 pipeline_tag: audio-classification
 library_name: transformers
 ---
 ## About Audio Turing Test (ATT)
+ATT is an evaluation framework with a standardized human evaluation protocol and an accompanying dataset, aiming to resolve the lack of unified protocols in TTS evaluation and the difficulty in comparing multiple TTS systems. To further support the training and iteration of TTS systems, we utilized additional private evaluation data to train Auto-ATT model based on Qwen2-Audio-7B, enabling a model-as-a-judge approach for rapid evaluation of TTS systems on the ATT dataset. The datasets and Auto-ATT model can be cound in [ATT Collection](https://huggingface.co/collections/meituan/audio-turing-test-682446320368164faeaf38a4).
 ## Usage
+[Inference Code](https://github.com/Meituan/Auto-ATT-Inference)
 ## Datasets & Benchmarks
+See [ATT Collection](https://huggingface.co/collections/meituan/audio-turing-test-682446320368164faeaf38a4).
   author = {Wang, Xihuai and Zhao, Ziyi and Ren, Siyu and Zhang, Shao and Li, Song and Li, Xiaoyu and Wang, Ziwen and Qiu, Lin and Wan, Guanglu and Cao, Xuezhi and Cai, Xunliang and Zhang, Weinan},
   title = {Audio Turing Test: Benchmarking the Human-likeness and Naturalness of Large Language Model-based Text-to-Speech Systems in Chinese},
   year = {2025},
+  url = {https://huggingface.co/Meituan/Auto-ATT},
   publisher = {huggingface},
 }
 ```