TokenBPO tonyshelby/mistral-QTBPO-merged Updated Jan 12 tonyshelby/mistral-ATBPO-merged Updated Jan 24 tonyshelby/llama-QTBPO-merged Updated Jan 13 tonyshelby/llama-ATBPO-merged Updated Jan 24
Format Distributional Reasoning FDR tonyshelby/qwen2.5_3b_checkpoints Updated Dec 24, 2025 tonyshelby/qwen2.5_7b_checkpoints Updated Dec 24, 2025 tp140205/arm-router-base 0.2B • Updated Dec 25, 2025 • 3
TokenBPO tonyshelby/mistral-QTBPO-merged Updated Jan 12 tonyshelby/mistral-ATBPO-merged Updated Jan 24 tonyshelby/llama-QTBPO-merged Updated Jan 13 tonyshelby/llama-ATBPO-merged Updated Jan 24
Format Distributional Reasoning FDR tonyshelby/qwen2.5_3b_checkpoints Updated Dec 24, 2025 tonyshelby/qwen2.5_7b_checkpoints Updated Dec 24, 2025 tp140205/arm-router-base 0.2B • Updated Dec 25, 2025 • 3