Embarrassingly Simple Self-Distillation Improves Code Generation Paper • 2604.01193 • Published Apr 1 • 47
LightMem: Lightweight and Efficient Memory-Augmented Generation Paper • 2510.18866 • Published Oct 21, 2025 • 116
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 501 • 9
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 501
Running on CPU Upgrade Featured 306 ML Intern 🤖 306 Chat with an AI‑powered ML Intern for quick answers
chenhaodev/unsloth_gemma-4-E4B-MedCaseReasoning-merged Image-Text-to-Text • Updated 16 days ago • 34
chenhaodev/unsloth_gemma-4-E4B-MedCaseReasoning-merged Image-Text-to-Text • Updated 16 days ago • 34
chenhaodev/unsloth_gemma-4-E4B-medical-o1-reasoning-SFT-merged Image-Text-to-Text • Updated 19 days ago • 87
chenhaodev/unsloth_gemma-4-E4B-medical-o1-reasoning-SFT-merged Image-Text-to-Text • Updated 19 days ago • 87