mhla/gpt1900-d34-22btok
Updated
Pre-1900 LLMs for physics reasoning. RL models are physics-only; use the SFT model for general chat. Tune temperature (0.6-0.7).
Note Base model — pre-1900 text, 22B tokens
Note Base model — pre-1905 text, 40B tokens
Note Base model — 1900-1964 text
Note Instruct v3 (full) — default chat model
Note Instruct v3 (safe) — no opinions, physics focus
Note Contradiction RL v6 — physics eval 0.58
Note Contradiction RL v11 — BEST, physics eval 1.25
Note Pre-1900 English text corpus with metadata
Note Physics texts for continued pretraining
Note Instruction-tuning conversation pairs
Note Physics contradiction evaluation problems