alphaXiv PRO

alphaXiv

1 3

·

https://www.alphaxiv.org

AI & ML interests

None yet

Recent Activity

updated a model 6 days ago

alphaXiv/sdpo-tau-retail-sft-qwen3-4b

published a model 6 days ago

alphaXiv/sdpo-tau-retail-sft-qwen3-4b

updated a dataset 29 days ago

alphaXiv/vpo-toolrl-sft-m5

View all activity

Organizations

None yet

alphaXiv 's models 44

alphaXiv/retrieve-4B-945

4B • Updated Feb 13 • 2

alphaXiv/retrieve-4B-1

4B • Updated Feb 13 • 2

alphaXiv/maths-Qwen-2.5-0.5B

0.6B • Updated Jan 21 • 3

alphaXiv/attention-is-not-all-you-need-models

alphaXiv/spurious-rewards-reasoning-traces

alphaXiv/spurious-rewards-rlvr-training-qwen-2.5-1.5b-math-ckpt-400

2B • Updated Jan 1 • 2

alphaXiv/spurious-rewards-rlvr-training-qwen-2.5-1.5b-math-ckpt-1000

2B • Updated Jan 1 • 2

alphaXiv/spurious-rewards-rlvr-training-qwen-2.5-1.5b-math-ckpt-200

2B • Updated Jan 1 • 2

alphaXiv/spurious-rewards-rlvr-training-qwen-2.5-1.5b-math-ckpt-50

2B • Updated Jan 1 • 4

alphaXiv/Qwen-2.5-1.5b-instruct-ppo

2B • Updated Dec 26, 2025 • 6

alphaXiv/Qwen-2.5-1.5b-instruct-grpo

2B • Updated Dec 26, 2025 • 2

alphaXiv/trm-model-arc-agi-1

Updated Oct 22, 2025 • 4

alphaXiv/trm-model-sudoku

Updated Oct 22, 2025 • 3

alphaXiv/trm-model-maze

Updated Oct 22, 2025 • 5