Revisiting the Shape Convention of Transformer Language Models Paper • 2602.06471 • Published Feb 6 • 4
Revisiting the Shape Convention of Transformer Language Models Paper • 2602.06471 • Published Feb 6 • 4
Let's Fuse Step by Step: A Generative Fusion Decoding Algorithm with LLMs for Multi-modal Text Recognition Paper • 2405.14259 • Published May 23, 2024 • 2
RAD-Bench: Evaluating Large Language Models Capabilities in Retrieval Augmented Dialogues Paper • 2409.12558 • Published Sep 19, 2024
BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation -- Challenges and Insights Paper • 2501.17790 • Published Jan 29, 2025 • 3
Group Think: Multiple Concurrent Reasoning Agents Collaborating at Token Level Granularity Paper • 2505.11107 • Published May 16, 2025 • 29
Group Think: Multiple Concurrent Reasoning Agents Collaborating at Token Level Granularity Paper • 2505.11107 • Published May 16, 2025 • 29
Group Think: Multiple Concurrent Reasoning Agents Collaborating at Token Level Granularity Paper • 2505.11107 • Published May 16, 2025 • 29
TASTE: Text-Aligned Speech Tokenization and Embedding for Spoken Language Modeling Paper • 2504.07053 • Published Apr 9, 2025 • 6
The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling Capabilities Paper • 2501.13921 • Published Jan 23, 2025 • 3
Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning Paper • 2307.10274 • Published Jul 18, 2023