Add pipeline tag, library name, link to paper and code

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +10 -7
README.md CHANGED
@@ -1,18 +1,21 @@
1
  ---
2
- license: apache-2.0
 
3
  language:
4
  - zh
5
  - en
6
- base_model:
7
- - Qwen/Qwen2.5-7B-Instruct
 
8
  ---
9
 
 
10
 
 
11
 
 
12
 
13
-
14
-
15
-
16
 
17
  ## ACCEPTABLE USE POLICY
18
 
@@ -31,7 +34,7 @@ Tencent endeavors to promote safe and fair use of its tools and features, includ
31
  10. To generate or disseminate personal identifiable information with the purpose of harming others;
32
  11. To generate or disseminate information (including images, code, posts, articles), and place the information in any public context (including –through the use of bot generated tweets), without expressly and conspicuously identifying that the information and/or content is machine generated;
33
  12. To impersonate another individual without consent, authorization, or legal right;
34
- 13. To make high-stakes automated decisions in domains that affect an individual’s safety, rights or wellbeing (e.g., law enforcement, migration, medicine/health, management of critical infrastructure, safety components of products, essential services, credit, employment, housing, education, social scoring, or insurance);
35
  14. In a manner that violates or disrespects the social ethics and moral standards of other countries or regions;
36
  15. To perform, facilitate, threaten, incite, plan, promote or encourage violent extremism or terrorism;
37
  16. For any use intended to discriminate against or harm individuals or groups based on protected characteristics or categories, online or offline social behavior or known or predicted personal or personality characteristics;
 
1
  ---
2
+ base_model:
3
+ - Qwen/Qwen2.5-7B-Instruct
4
  language:
5
  - zh
6
  - en
7
+ license: apache-2.0
8
+ pipeline_tag: text-to-audio
9
+ library_name: transformers
10
  ---
11
 
12
+ ## Paper
13
 
14
+ The model was presented in the paper [VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model](https://huggingface.co/papers/2505.03739).
15
 
16
+ ## Github repository
17
 
18
+ The code for this model can be found at [https://github.com/VITA-MLLM/VITA-Audio](https://github.com/VITA-MLLM/VITA-Audio).
 
 
19
 
20
  ## ACCEPTABLE USE POLICY
21
 
 
34
  10. To generate or disseminate personal identifiable information with the purpose of harming others;
35
  11. To generate or disseminate information (including images, code, posts, articles), and place the information in any public context (including –through the use of bot generated tweets), without expressly and conspicuously identifying that the information and/or content is machine generated;
36
  12. To impersonate another individual without consent, authorization, or legal right;
37
+ 13. To make high-stakes automated decisions in domains that affect an individual’s safety, rights or wellbeing (e.g., law enforcement, migration, medicine/health, management of critical infrastructure, safety components of products, essential services, credit, employment, housing, education, social scoring, or insurance);\
38
  14. In a manner that violates or disrespects the social ethics and moral standards of other countries or regions;
39
  15. To perform, facilitate, threaten, incite, plan, promote or encourage violent extremism or terrorism;
40
  16. For any use intended to discriminate against or harm individuals or groups based on protected characteristics or categories, online or offline social behavior or known or predicted personal or personality characteristics;