-
Can LLMs Follow Simple Rules?
Paper • 2311.04235 • Published • 13 -
The Unreasonable Ineffectiveness of the Deeper Layers
Paper • 2403.17887 • Published • 83 -
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Paper • 2403.03507 • Published • 190 -
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
Paper • 2402.17177 • Published • 88
Kevin-Brian N'Diaye
kevin-nd
·
AI & ML interests
- Computer Vision
- Vision-Language-Action Models
Organizations
None yet
Papers
-
Can LLMs Follow Simple Rules?
Paper • 2311.04235 • Published • 13 -
The Unreasonable Ineffectiveness of the Deeper Layers
Paper • 2403.17887 • Published • 83 -
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Paper • 2403.03507 • Published • 190 -
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
Paper • 2402.17177 • Published • 88