Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
Liv d'Aliberti
od2961
Follow
0 followers
·
1 following
https://liv-daliberti.github.io/
liv-daliberti
AI & ML interests
None yet
Recent Activity
authored
a paper
about 2 months ago
The Illusion of Insight in Reasoning Models
upvoted
a
paper
about 2 months ago
The Illusion of Insight in Reasoning Models
updated
a dataset
2 months ago
od2961/illusion-of-reasoning-main-traces
View all activity
Organizations
od2961
's models
44
Sort: Recently updated
od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v4
2B
•
Updated
Aug 4, 2025
od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v3
2B
•
Updated
Aug 3, 2025
od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v2
Text Generation
•
2B
•
Updated
Jul 31, 2025
•
9
od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords
2B
•
Updated
Jul 15, 2025
od2961/Qwen2.5-7B-Open-R1-GRPO
8B
•
Updated
Jun 28, 2025
od2961/Qwen2.5-1.5B-Open-R1-GRPO
2B
•
Updated
Jun 21, 2025
•
1.24k
od2961/Qwen2.5-1.5B-Open-R1-SFT
Text Generation
•
2B
•
Updated
Jun 11, 2025
•
2
od2961/Qwen2.5-1.5B-Open-R1-Code-GRPO
Updated
Jun 7, 2025
od2961/Qwen2.5-1.5B-Open-R1-Math-GRPO
2B
•
Updated
Jun 7, 2025
od2961/Qwen2.5-1.5B-Instruct-SFT
Text Generation
•
2B
•
Updated
Jun 6, 2025
•
966
od2961/Qwen2.5-1.5B-Instruct-GRPO-vs-SFT
Updated
Jun 6, 2025
od2961/Qwen2.5-1.5B-Instruct-GRPO
2B
•
Updated
Jun 3, 2025
•
1
od2961/Qwen2.5-7B-Instruct-GRPO
8B
•
Updated
Apr 30, 2025
od2961/Qwen2.5-7B-Instruct-SFT
Text Generation
•
8B
•
Updated
Apr 19, 2025
•
1
Previous
1
2
Next