root commited on
Commit
220fc0c
ยท
1 Parent(s): 722557b

update readme

Browse files
Files changed (2) hide show
  1. README.md +11 -5
  2. img/output.png +3 -0
README.md CHANGED
@@ -9,9 +9,7 @@ library_name: tencent-song-generation
9
  # SongGeneration
10
 
11
  <p align="center"><img src="img/logo.jpg" width="40%"></p>
12
- <p align="center">
13
- <a href="https://levo-demo.github.io/">Demo</a> &nbsp;|&nbsp; <a href="https://arxiv.org/abs/2506.07520">Paper</a> &nbsp;|&nbsp; <a href="https://github.com/tencent-ailab/songgeneration">Code</a> &nbsp;|&nbsp; <a href="https://huggingface.co/spaces/tencent/SongGeneration">Space Demo</a>
14
- </p>
15
 
16
 
17
  This repository is the official weight repository for LeVo: High-Quality Song Generation with Multi-Preference Alignment. In this repository, we provide the SongGeneration model, inference scripts, and the checkpoint that has been trained on the Million Song Dataset.
@@ -30,9 +28,17 @@ This repository is the official weight repository for LeVo: High-Quality Song Ge
30
 
31
  ## Overview
32
 
33
- We develop the SongGeneration model. It is an LM-based framework consisting of **LeLM** and a **music codec**. LeLM is capable of parallelly modeling two types of tokens: mixed tokens, which represent the combined audio of vocals and accompaniment to achieve vocal-instrument harmony, and dual-track tokens, which separately encode vocals and accompaniment for high-quality song generation. The music codec reconstructs the dual-track tokens into highfidelity music audio. SongGeneration significantly improves over the open-source music generation models and performs competitively with current state-of-the-art industry systems. For more details, please refer to our [paper](https://arxiv.org/abs/2506.07520).
34
 
35
- <img src="https://github.com/tencent-ailab/songgeneration/blob/main/img/over.jpg?raw=true" alt="img" style="zoom:100%;" />
 
 
 
 
 
 
 
 
36
 
37
  ## License
38
 
 
9
  # SongGeneration
10
 
11
  <p align="center"><img src="img/logo.jpg" width="40%"></p>
12
+ [![Project Page](https://img.shields.io/badge/Project%20Page-GitHub-blue)](https://github.com/tencent-ailab/songgeneration) [![Hugging Face](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Model-blue)](https://huggingface.co/tencent/SongGeneration) [![Live Playground](https://img.shields.io/badge/Live%20PlayGround-Demo-orange)](https://huggingface.co/spaces/waytan22/SongGeneration-LeVo) [![Samples](https://img.shields.io/badge/Audio%20Samples-Page-green)](https://levo-demo.github.io/levo_v2_demo/)
 
 
13
 
14
 
15
  This repository is the official weight repository for LeVo: High-Quality Song Generation with Multi-Preference Alignment. In this repository, we provide the SongGeneration model, inference scripts, and the checkpoint that has been trained on the Million Song Dataset.
 
28
 
29
  ## Overview
30
 
31
+ ๐Ÿš€ We introduce LeVo 2 (SongGeneration 2), an open-source music foundation model designed to shatter the ceiling of open-source AI music by achieving true commercial-grade generation.
32
 
33
+ Through a large-scale, rigorous expert evaluation (20 industry professionals, 6 core dimensions, 100 songs per model), LeVo 2 has proven its superiority:
34
+
35
+ - ๐Ÿ† Commercial-Grade Musicality: Comprehensively outperforms all open-source baselines across Overall Quality, Melody, Arrangement, Sound Quality, and Structure. Its subjective generation quality successfully rivals top-tier closed-source commercial systems (e.g., MiniMax 2.5).
36
+ - ๐ŸŽฏ Precise Lyric Accuracy: Achieves an outstanding Phoneme Error Rate (PER) of 8.55%, effectively solving the lyrical hallucination problem. This remarkable accuracy significantly outperforms top commercial models like Suno v5 (12.4%) and Mureka v8 (9.96%).
37
+ - ๐ŸŽ›๏ธ Exceptional Controllability: Highly responsive to multi-modal instructions, including text descriptions and audio prompts, allowing for precise control over the generated music.
38
+
39
+ ๐Ÿ“Š *For detailed experimental setups and comprehensive metrics, please refer to the [Evaluation Performance](#Evaluation-Performance) section below or our upcoming technical report.*
40
+
41
+ <img src="img/output.png" alt="img" style="zoom:100%;" />
42
 
43
  ## License
44
 
img/output.png ADDED

Git LFS Details

  • SHA256: 82db2add9eef950a8e81b34bea6b4505ce427745f4c05adc86c8430f12eaefca
  • Pointer size: 131 Bytes
  • Size of remote file: 110 kB