GPT-1900 Collection Pre-1900 LLMs for physics reasoning. RL models are physics-only; use the SFT model for general chat. Tune temperature (0.6-0.7). • 11 items • Updated Apr 2 • 9
science-of-finetuning/diff-mining-qwen3-14b-cross-method-intersection Viewer • Updated Mar 31 • 5 • 7
science-of-finetuning/diff-mining-qwen3-14b-cross-method-intersection Viewer • Updated Mar 31 • 5 • 7
science-of-finetuning/diff-mining-qwen3-14b-union-tulu-frac-fineweb-nmf Viewer • Updated Mar 31 • 20 • 15
science-of-finetuning/diff-mining-qwen3-14b-union-tulu-frac-fineweb-nmf Viewer • Updated Mar 31 • 20 • 15
science-of-finetuning/diff-mining-qwen3-14b-tulu-fraction-positive-diff Viewer • Updated Mar 31 • 5 • 14
science-of-finetuning/diff-mining-qwen3-14b-tulu-fraction-positive-diff Viewer • Updated Mar 31 • 5 • 14