view article Article Comparing sub 50GB Llama 4 Scout quants (KLD/Top P) bartowski • Apr 9, 2025 • 45
view article Article Fine-tune a SmolLM on domain-specific synthetic data from a LLM davidberenstein1957 • Jan 3, 2025 • 38
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs Paper • 2412.18925 • Published Dec 25, 2024 • 107
YuLan-Mini Collection A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details. • 5 items • Updated Mar 2 • 16
I Don't Know: Explicit Modeling of Uncertainty with an [IDK] Token Paper • 2412.06676 • Published Dec 9, 2024 • 9