useful sharded checkpoints for users to run inference / fine-tuning on a Google colab without having to deal with CPU OOM issues.
Younes B
ybelkada
AI & ML interests
Large Language Models, Quantization, Vision, Multimodality, Diffusion models
Recent Activity
new activity
1 day ago
tiiuae/Falcon-H1-Tiny-R-0.6B:Update README.md
upvoted
a
paper
2 days ago
SERA: Soft-Verified Efficient Repository Agents
upvoted
a
collection
3 days ago
Open Coding Agents