Cool Papers
updated
BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane
Extrapolation
Paper
• 2401.17053
• Published • 33
Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning
Tasks
Paper
• 2402.04248
• Published • 32
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open
Language Models
Paper
• 2402.03300
• Published • 143
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Paper
• 2402.05930
• Published • 39
Animated Stickers: Bringing Stickers to Life with Video Diffusion
Paper
• 2402.06088
• Published • 11
HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting
Paper
• 2402.06149
• Published • 18
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model
on 100K hours of data
Paper
• 2402.08093
• Published • 61
Paper
• 2402.13144
• Published • 100
MobileLLM: Optimizing Sub-billion Parameter Language Models for
On-Device Use Cases
Paper
• 2402.14905
• Published • 134
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper
• 2402.17764
• Published • 628