Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Liv d'Aliberti's picture
1

Liv d'Aliberti

od2961
·
https://liv-daliberti.github.io/
  • liv-daliberti

AI & ML interests

None yet

Recent Activity

authored a paper about 2 months ago
The Illusion of Insight in Reasoning Models
upvoted a paper about 2 months ago
The Illusion of Insight in Reasoning Models
updated a dataset 2 months ago
od2961/illusion-of-reasoning-main-traces
View all activity

Organizations

Princeton University's profile picture

od2961 's models 44

od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v4

2B • Updated Aug 4, 2025

od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v3

2B • Updated Aug 3, 2025

od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v2

Text Generation • 2B • Updated Jul 31, 2025 • 9

od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords

2B • Updated Jul 15, 2025

od2961/Qwen2.5-7B-Open-R1-GRPO

8B • Updated Jun 28, 2025

od2961/Qwen2.5-1.5B-Open-R1-GRPO

2B • Updated Jun 21, 2025 • 1.24k

od2961/Qwen2.5-1.5B-Open-R1-SFT

Text Generation • 2B • Updated Jun 11, 2025 • 2

od2961/Qwen2.5-1.5B-Open-R1-Code-GRPO

Updated Jun 7, 2025

od2961/Qwen2.5-1.5B-Open-R1-Math-GRPO

2B • Updated Jun 7, 2025

od2961/Qwen2.5-1.5B-Instruct-SFT

Text Generation • 2B • Updated Jun 6, 2025 • 966

od2961/Qwen2.5-1.5B-Instruct-GRPO-vs-SFT

Updated Jun 6, 2025

od2961/Qwen2.5-1.5B-Instruct-GRPO

2B • Updated Jun 3, 2025 • 1

od2961/Qwen2.5-7B-Instruct-GRPO

8B • Updated Apr 30, 2025

od2961/Qwen2.5-7B-Instruct-SFT

Text Generation • 8B • Updated Apr 19, 2025 • 1
  • Previous
  • 1
  • 2
  • Next
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs