Egor Petrov
moderntalker
AI & ML interests
None yet
Recent Activity
upvoted a paper about 9 hours ago
One-Step Gradient Delay is Not a Barrier for Large-Scale Asynchronous Pipeline Parallel LLM Pretraining updated a model about 23 hours ago
moderntalker/efficient_pretrain_checkpoints published a model about 2 months ago
moderntalker/efficient_pretrain_checkpointsOrganizations
None yet