4B SFT Experiments Collection Systematic SFT for Qwen3-4B. We explore diverse dataset compositions and training recipes to benchmark and improve performance across tasks. • 22 items • Updated 9 days ago
Smoothie-Qwen: Post-Hoc Smoothing to Reduce Language Bias in Multilingual LLMs Paper • 2507.05686 • Published Jul 8, 2025 • 1
DNA 1.0 Collection 8B Korean SoTA model, which is instruction-tuned by Dnotitia Inc. • 3 items • Updated Jan 26 • 1
DNA 1.0 Collection 8B Korean SoTA model, which is instruction-tuned by Dnotitia Inc. • 3 items • Updated Jan 26 • 1
DNA 1.0 Collection 8B Korean SoTA model, which is instruction-tuned by Dnotitia Inc. • 3 items • Updated Jan 26 • 1