HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding Paper • 2601.14724 • Published 5 days ago • 71
ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development Paper • 2601.11077 • Published 10 days ago • 63
MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization Paper • 2601.01554 • Published 22 days ago • 56 • 7
FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs Paper • 2601.13836 • Published 6 days ago • 34
InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning Paper • 2402.06332 • Published Feb 9, 2024 • 19
MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization Paper • 2601.01554 • Published 22 days ago • 56 • 7
MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization Paper • 2601.01554 • Published 22 days ago • 56 • 7
view post Post 2091 Happy birthday to me!!! See translation 2 replies · 🤗 15 15 👍 7 7 😎 3 3 ❤️ 2 2 + Reply
MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization Paper • 2601.01554 • Published 22 days ago • 56
MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization Paper • 2601.01554 • Published 22 days ago • 56
MOSS Transcribe Diarize Collection A unified multimodal large language model for end-to-end speaker-attributed, time-stamped transcription. • 2 items • Updated 20 days ago • 2
MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization Paper • 2601.01554 • Published 22 days ago • 56
DiRL: An Efficient Post-Training Framework for Diffusion Language Models Paper • 2512.22234 • Published Dec 23, 2025 • 20
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation Paper • 2512.23576 • Published 28 days ago • 65
Running Featured 48 MOSS Transcribe Diarize 🏢 48 Transcribe audio/video files with speaker identification
Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm Paper • 2511.04570 • Published Nov 6, 2025 • 213
SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models Paper • 2511.15605 • Published Nov 19, 2025 • 24