·
AI & ML interests
None yet
Organizations
albagon/CS3244-Group-10-models
Updated
albagon/grpo-countdown-grpo-L512-1760317693
Text Generation
• 2B • Updated • 9
albagon/grpo-countdown-dr_grpo-L512-1760317457
Text Generation
• 2B • Updated • 5
albagon/grpo-countdown-dr_grpo-L256-1760317440
Text Generation
• 2B • Updated • 6
albagon/grpo-countdown-grpo-L512-1760297395
Text Generation
• 2B • Updated • 5
albagon/grpo-countdown-dr_grpo-L512-1760296928
Text Generation
• 2B • Updated • 6
albagon/grpo-countdown-dr_grpo-L256-1760296914
Text Generation
• 2B • Updated • 13
albagon/grpo-countdown-grpo-L256-1760297318
Text Generation
• 2B • Updated • 4
albagon/grpo-countdown-grpo-L256-1760293744
Text Generation
• 2B • Updated • 7
albagon/grpo-countdown-grpo-L512-1760289811
Text Generation
• 2B • Updated • 8
albagon/grpo-countdown-grpo-L512-1760289047
Text Generation
• 2B • Updated • 5
albagon/grpo-countdown-grpo-L256-1760289031
Text Generation
• 2B • Updated • 5
albagon/grpo-countdown-grpo-L256-1760287363
Text Generation
• 2B • Updated • 3
albagon/grpo-countdown-grpo-L256-1760286946
Text Generation
• 2B • Updated • 3
albagon/Reinforce-CartPole-v1
Reinforcement Learning
• Updated albagon/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
• Updated • 1
Reinforcement Learning
• Updated albagon/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
• Updated Reinforcement Learning
• Updated albagon/ppo-LunarLander-v2
Reinforcement Learning
• Updated • 2