AI & ML interests
None yet
Organizations
None yet
albagon/CS3244-Group-10-models
Updated
albagon/grpo-countdown-grpo-L512-1760317693
Text Generation
• 2B • Updated
albagon/grpo-countdown-dr_grpo-L512-1760317457
Text Generation
• 2B • Updated
albagon/grpo-countdown-dr_grpo-L256-1760317440
Text Generation
• 2B • Updated
albagon/grpo-countdown-grpo-L512-1760297395
Text Generation
• 2B • Updated
albagon/grpo-countdown-dr_grpo-L512-1760296928
Text Generation
• 2B • Updated
albagon/grpo-countdown-dr_grpo-L256-1760296914
Text Generation
• 2B • Updated
albagon/grpo-countdown-grpo-L256-1760297318
Text Generation
• 2B • Updated
albagon/grpo-countdown-grpo-L256-1760293744
Text Generation
• 2B • Updated
albagon/grpo-countdown-grpo-L512-1760289811
Text Generation
• 2B • Updated
albagon/grpo-countdown-grpo-L512-1760289047
Text Generation
• 2B • Updated
albagon/grpo-countdown-grpo-L256-1760289031
Text Generation
• 2B • Updated
albagon/grpo-countdown-grpo-L256-1760287363
Text Generation
• 2B • Updated
albagon/grpo-countdown-grpo-L256-1760286946
Text Generation
• 2B • Updated
albagon/Reinforce-CartPole-v1
Reinforcement Learning
• Updated
albagon/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
• Updated
• 3
Reinforcement Learning
• Updated
albagon/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
• Updated
Reinforcement Learning
• Updated
• 9
albagon/ppo-LunarLander-v2
Reinforcement Learning
• Updated
• 1