Zhaoye Fei's picture

Zhaoye Fei

ngc7293

·

https://ngc7292.github.io/

AI & ML interests

NLP & Ro.

Recent Activity

authored a paper 13 days ago

MOSS-Audio Technical Report

updated a collection 13 days ago

upvoted a paper 13 days ago

MOSS-Audio Technical Report

View all activity

Organizations

authored a paper 13 days ago

MOSS-Audio Technical Report

Paper • 2606.01802 • Published 23 days ago • 2

updated a collection 13 days ago

MOSS-Audio

An open-source audio understanding model supporting speech recognition, environmental sound analysis, music understanding, time-aware QA, and complex • 9 items • Updated 13 days ago • 66

upvoted a paper 13 days ago

MOSS-Audio Technical Report

Paper • 2606.01802 • Published 23 days ago • 2

updated a collection 13 days ago

MOSS-Audio

An open-source audio understanding model supporting speech recognition, environmental sound analysis, music understanding, time-aware QA, and complex • 9 items • Updated 13 days ago • 66

liked a model 19 days ago

nex-agi/Nex-N2-Pro

Text Generation • 397B • Updated 14 days ago • 8.1k • 350

liked a model 23 days ago

OpenMOSS-Team/MOSS-VL-Instruct-0408

Video-Text-to-Text • 11B • Updated Apr 22 • 341 • 97

liked a model 29 days ago

OpenMOSS-Team/MOSS-TTS-v1.5

Text-to-Speech • 8B • Updated 30 days ago • 169k • 176

upvoted an article 30 days ago

Article

Ulysses Sequence Parallelism: Training with Million-Token Contexts

kashif, stas

•

Mar 9

• 30

authored 2 papers about 1 month ago

MOSS-TTS Technical Report

Paper • 2603.18090 • Published Mar 18 • 16

World Action Models: The Next Frontier in Embodied AI

Paper • 2605.12090 • Published May 12 • 68

upvoted a paper about 1 month ago

Robust Speech Recognition via Large-Scale Weak Supervision

Paper • 2212.04356 • Published Dec 6, 2022 • 54

liked a model about 1 month ago

OpenMOSS-Team/MOSS-Music-8B-Thinking

Audio-Text-to-Text • 9B • Updated May 1 • 359 • 31

upvoted 2 papers about 1 month ago

MOSS-TTS Technical Report

Paper • 2603.18090 • Published Mar 18 • 16

World Action Models: The Next Frontier in Embodied AI

Paper • 2605.12090 • Published May 12 • 68

upvoted a paper about 2 months ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published Jan 6 • 183

New activity in Soul-AILab/SoulX-Singer about 2 months ago

二维码过期了！！

#6 opened about 2 months ago by

upvoted a collection about 2 months ago

MOVA

3 items • Updated Apr 20 • 22

liked 2 models 2 months ago

openai/privacy-filter

Token Classification • 1B • Updated Apr 22 • 304k • • 1.67k

moonshotai/Kimi-K2.6

Image-Text-to-Text • 1.1T • Updated May 19 • 2.62M • • 1.48k

updated a collection 2 months ago

MOSS-Audio

An open-source audio understanding model supporting speech recognition, environmental sound analysis, music understanding, time-aware QA, and complex • 9 items • Updated 13 days ago • 66