A wider Baby Berta Model trained using curriculum learning and layer stacking for the BabyLM Challenge Strict Small track.

Downloads last month
3
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support