Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
1
3
alphaXiv
PRO
alphaXiv
Follow
rodolphogurgel's profile picture
hghalebi's profile picture
mmahdi-sz's profile picture
12 followers
·
0 following
https://www.alphaxiv.org
askalphaxiv
alphaXiv
AI & ML interests
None yet
Recent Activity
updated
a model
6 days ago
alphaXiv/evidence-multi-rlm-sft-simplified-4b
published
a model
6 days ago
alphaXiv/evidence-multi-rlm-sft-simplified-4b
updated
a model
6 days ago
alphaXiv/entropy-collapse-sft-epoch-4
View all activity
Organizations
None yet
alphaXiv
's models
43
Sort: Recently updated
alphaXiv/retrieve-4B-1
4B
•
Updated
Feb 13
•
1
alphaXiv/maths-Qwen-2.5-0.5B
0.6B
•
Updated
Jan 21
•
1
alphaXiv/attention-is-not-all-you-need-models
Updated
Jan 12
alphaXiv/spurious-rewards-reasoning-traces
Updated
Jan 6
alphaXiv/spurious-rewards-rlvr-training-qwen-2.5-1.5b-math-ckpt-400
2B
•
Updated
Jan 1
•
3
alphaXiv/spurious-rewards-rlvr-training-qwen-2.5-1.5b-math-ckpt-1000
2B
•
Updated
Jan 1
•
2
alphaXiv/spurious-rewards-rlvr-training-qwen-2.5-1.5b-math-ckpt-200
2B
•
Updated
Jan 1
•
3
alphaXiv/spurious-rewards-rlvr-training-qwen-2.5-1.5b-math-ckpt-50
2B
•
Updated
Jan 1
•
2
alphaXiv/Qwen-2.5-1.5b-instruct-ppo
2B
•
Updated
Dec 26, 2025
•
3
alphaXiv/Qwen-2.5-1.5b-instruct-grpo
2B
•
Updated
Dec 26, 2025
•
2
alphaXiv/trm-model-arc-agi-1
Updated
Oct 22, 2025
•
4
alphaXiv/trm-model-sudoku
Updated
Oct 22, 2025
•
3
alphaXiv/trm-model-maze
Updated
Oct 22, 2025
•
5
Previous
1
2
Next