mSFT: Addressing Dataset Mixtures Overfiting Heterogeneously in Multi-task SFT Paper • 2603.21606 • Published 3 days ago • 34
MolHIT: Advancing Molecular-Graph Generation with Hierarchical Discrete Diffusion Models Paper • 2602.17602 • Published Feb 19 • 56
thwannbe/Llama-3.1-8B-Instruct-GSM8K-Gemma-Distill-Persona-Mixed Text Generation • 8B • Updated Feb 11 • 181
thwannbe/Llama-3.1-8B-Instruct-GSM8K-Gemma-Distill-Persona-Mixed Text Generation • 8B • Updated Feb 11 • 181
thwannbe/Llama-3.1-8B-Instruct-GSM8K-Rlvr-Distill-Persona-Mixed Text Generation • 8B • Updated Feb 11 • 220
thwannbe/Llama-3.1-8B-Instruct-GSM8K-Rlvr-Distill-Persona-Mixed Text Generation • 8B • Updated Feb 11 • 220
thwannbe/Llama-3.1-8B-Instruct-GSM8K-PO-Distill-Persona-Mixed Text Generation • 8B • Updated Feb 9 • 292
thwannbe/Llama-3.1-8B-Instruct-GSM8K-PO-Distill-Persona-Mixed Text Generation • 8B • Updated Feb 9 • 292
thwannbe/Llama-3.1-8B-Instruct-GSM8K-GPT5-mini-Style-distill Text Generation • 8B • Updated Feb 5 • 194
thwannbe/Llama-3.1-8B-Instruct-GSM8K-GPT5-mini-Style-distill Text Generation • 8B • Updated Feb 5 • 194