AI & ML interests
Offline RL datasets
farama-minari/HumanoidStandup-v5-SAC-simple
Reinforcement Learning
• Updated • 6
farama-minari/HumanoidStandup-v5-SAC-medium
Reinforcement Learning
• Updated • 4
farama-minari/HumanoidStandup-v5-SAC-expert
Reinforcement Learning
• Updated • 30
farama-minari/Ant-v5-SAC-expert-fine-tuned
Updated
farama-minari/Humanoid-v5-TQC-simple
Reinforcement Learning
• Updated • 2
farama-minari/Humanoid-v5-TQC-medium
Reinforcement Learning
• Updated • 4
farama-minari/Humanoid-v5-TQC-expert
Reinforcement Learning
• Updated • 8
farama-minari/Swimmer-v5-PPO-medium
Reinforcement Learning
• Updated • 6
farama-minari/Swimmer-v5-PPO-expert
Reinforcement Learning
• Updated • 48
farama-minari/Ant-v5-SAC-simple
Reinforcement Learning
• Updated • 7
farama-minari/HalfCheetah-v5-TQC-simple
Reinforcement Learning
• Updated • 3
farama-minari/Hopper-v5-SAC-simple
Reinforcement Learning
• Updated • 5
farama-minari/Hopper-v5-SAC-medium
Reinforcement Learning
• Updated • 5
farama-minari/Hopper-v5-SAC-expert
Reinforcement Learning
• Updated • 35
farama-minari/HalfCheetah-v5-TQC-medium
Reinforcement Learning
• Updated • 3
farama-minari/HalfCheetah-v5-TQC-expert
Reinforcement Learning
• Updated • 11
farama-minari/Hopper-v5-TQC-expert
Reinforcement Learning
• Updated • 2
farama-minari/Pusher-v5-SAC-medium
Reinforcement Learning
• Updated • 2
farama-minari/Pusher-v5-SAC-expert
Reinforcement Learning
• Updated • 44
farama-minari/Reacher-v5-SAC-medium
Reinforcement Learning
• Updated • 36
farama-minari/Reacher-v5-SAC-expert
Reinforcement Learning
• Updated • 7
farama-minari/Ant-v5-SAC-medium
Reinforcement Learning
• Updated • 150
farama-minari/Walker2d-v5-SAC-simple
Reinforcement Learning
• Updated • 3
farama-minari/Walker2d-v5-SAC-medium
Reinforcement Learning
• Updated • 56
farama-minari/Walker2d-v5-SAC-expert
Reinforcement Learning
• Updated • 143
farama-minari/Reacher-v5-SAC-simple
Reinforcement Learning
• Updated • 24
farama-minari/HumanoidStandup-v5-PPO-medium
Reinforcement Learning
• Updated • 1
farama-minari/HumanoidStandup-v5-PPO-simple
Reinforcement Learning
• Updated • 1
farama-minari/Humanoid-v5-SAC-medium
Reinforcement Learning
• Updated • 5
farama-minari/Humanoid-v5-SAC-simple
Reinforcement Learning
• Updated • 4