🔄 In a Training Loop

Ahmed Khaled Khamis

KickItLikeShika

34 36

https://kickitlikeshika.github.io/

AI & ML interests

NLP

Recent Activity

updated a model 2 days ago

KickItLikeShika/qwen3-1.7b-eopd-tooluse

published a model 2 days ago

KickItLikeShika/qwen3-1.7b-eopd-tooluse

updated a model 2 days ago

KickItLikeShika/qwen3-1.7b-eopd-science

View all activity

Organizations

upvoted a paper about 2 months ago

Robots Need More than VLA and World Models

Paper • 2606.06556 • Published Jun 4 • 30

upvoted 2 papers 2 months ago

MinT: Managed Infrastructure for Training and Serving Millions of LLMs

Paper • 2605.13779 • Published May 13 • 225

Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning

Paper • 2402.13669 • Published Feb 21, 2024 • 2

upvoted 17 papers 3 months ago

Heterogeneous Scientific Foundation Model Collaboration

Paper • 2604.27351 • Published Apr 30 • 222

UniSD: Towards a Unified Self-Distillation Framework for Large Language Models

Paper • 2605.06597 • Published May 7 • 16

Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+ Languages

Paper • 2406.12739 • Published Jun 18, 2024 • 2

Self-Distillation Zero: Self-Revision Turns Binary Rewards into Dense Supervision

Paper • 2604.12002 • Published Apr 13 • 12

Unifying Group-Relative and Self-Distillation Policy Optimization via Sample Routing

Paper • 2604.02288 • Published Apr 2 • 34

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Paper • 2603.24472 • Published Mar 25 • 57

Co-Evolving Policy Distillation

Paper • 2604.27083 • Published Apr 29 • 68

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

Paper • 2603.26164 • Published Mar 27 • 365

Goedel-Prover-V2: Scaling Formal Theorem Proving with Scaffolded Data Synthesis and Self-Correction

Paper • 2508.03613 • Published Aug 5, 2025 • 16

DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition

Paper • 2504.21801 • Published Apr 30, 2025 • 7

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 510

MA-LoT: Multi-Agent Lean-based Long Chain-of-Thought Reasoning enhances Formal Theorem Proving

Paper • 2503.03205 • Published Mar 5, 2025 • 5

InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning

Paper • 2402.06332 • Published Feb 9, 2024 • 20

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

Paper • 2405.14333 • Published May 23, 2024 • 48

Ahmed Khaled Khamis

AI & ML interests

Recent Activity

Organizations

KickItLikeShika's activity