Liv d'Aliberti PRO

od2961

1

·

https://liv-daliberti.github.io/

liv-daliberti

AI & ML interests

None yet

Recent Activity

updated a dataset 1 day ago

od2961/polymarket-full-market-dataset

published a dataset 1 day ago

od2961/polymarket-full-market-dataset

updated a model about 2 months ago

od2961/adaptive-entropy-mad-td-gym8-public-3m

View all activity

Organizations

od2961 's models 46

od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v7

2B • Updated Aug 8, 2025 • 2

od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v8

2B • Updated Aug 8, 2025 • 1

od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v6

2B • Updated Aug 7, 2025 • 2

od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v5

2B • Updated Aug 5, 2025 • 1

od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v4

2B • Updated Aug 4, 2025 • 1

od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v3

2B • Updated Aug 3, 2025

od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords-v2

Text Generation • 2B • Updated Jul 31, 2025 • 4

od2961/Qwen2.5-1.5B-Open-R1-GRPO-Crosswords

2B • Updated Jul 15, 2025 • 2

od2961/Qwen2.5-7B-Open-R1-GRPO

8B • Updated Jun 28, 2025 • 1

od2961/Qwen2.5-1.5B-Open-R1-GRPO

2B • Updated Jun 21, 2025 • 1

od2961/Qwen2.5-1.5B-Open-R1-Code-GRPO

Updated Jun 7, 2025

od2961/Qwen2.5-1.5B-Open-R1-Math-GRPO

2B • Updated Jun 7, 2025 • 1

od2961/Qwen2.5-1.5B-Instruct-GRPO-vs-SFT

Updated Jun 6, 2025

od2961/Qwen2.5-1.5B-Instruct-GRPO

2B • Updated Jun 3, 2025 • 2 • 1

od2961/Qwen2.5-7B-Instruct-GRPO

8B • Updated Apr 30, 2025 • 1

od2961/Qwen2.5-7B-Instruct-SFT

Text Generation • 8B • Updated Apr 19, 2025 • 24 •