AI & ML interests
None yet
Organizations
None yet
view article Scaling Pedagogical Pre-training: From Optimal Mixing to 10 Billion Tokens
upvoted a paper 5 months ago upvoted a paper 6 months ago upvoted a paper 10 months ago upvoted a paper about 1 year ago upvoted a paper almost 2 years ago