Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
1
Languages
Licenses
Other
Reset Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Safetensors
Transformers
PEFT
TensorBoard
GGUF
Diffusers
ONNX
stable-baselines3
sentence-transformers
ml-agents
MLX
Keras
TF-Keras
Adapters
Joblib
Transformers.js
setfit
timm
sample-factory
OpenVINO
Flair
fastai
Core ML
ESPnet
NeMo
BERTopic
LiteRT
spaCy
fastText
Rust
OpenCLIP
Scikit-learn
KerasHub
Asteroid
ExecuTorch
speechbrain
AllenNLP
llamafile
Fairseq
PaddlePaddle
PaddleOCR
Stanza
pyannote.audio
Habana
Graphcore
SpanMarker
paddlenlp
unity-sentis
DDUF
univa
Apply filters
Models
25,812
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
stable-baselines3
Clear all
Chris1/a2c-AntBulletEnv-v0
Reinforcement Learning
•
Updated
Jul 24, 2022
•
10
Chris1/a2c-HalfCheetahBulletEnv-v0
Reinforcement Learning
•
Updated
Jul 25, 2022
•
10
Chris1/a2c-Walker2DBulletEnv-v0
Reinforcement Learning
•
Updated
Jul 25, 2022
•
8
osanseviero/a2c-AntBulletEnv-v0
Reinforcement Learning
•
Updated
Jan 17, 2023
•
11
HumanCompatibleAI/ppo-seals-MountainCar-v0
Reinforcement Learning
•
Updated
Sep 19, 2023
•
39
•
1
HumanCompatibleAI/ppo-seals-Ant-v0
Reinforcement Learning
•
Updated
Dec 29, 2022
•
28
HumanCompatibleAI/ppo-seals-Swimmer-v0
Reinforcement Learning
•
Updated
Dec 31, 2022
•
22
HumanCompatibleAI/ppo-seals-Hopper-v0
Reinforcement Learning
•
Updated
Dec 31, 2022
•
26
HumanCompatibleAI/ppo-seals-Humanoid-v0
Reinforcement Learning
•
Updated
Jan 2, 2023
•
20
HumanCompatibleAI/ppo-seals-Walker2d-v0
Reinforcement Learning
•
Updated
Jan 2, 2023
•
24
HumanCompatibleAI/ppo-seals-HalfCheetah-v0
Reinforcement Learning
•
Updated
Dec 31, 2022
•
19
HumanCompatibleAI/ppo-Pendulum-v1
Reinforcement Learning
•
Updated
Sep 19, 2023
•
62.6k
•
5
IPPK/LunarLander-v0.1
Reinforcement Learning
•
Updated
Jul 25, 2022
•
8
Chris1/ppo-CarRacing-v0
Reinforcement Learning
•
Updated
Jul 25, 2022
•
14
th1s1s1t/dqn-SpaceInvadersNoFrameskip-v1
Reinforcement Learning
•
Updated
Jul 26, 2022
•
18
th1s1s1t/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
Jul 26, 2022
•
16
ntinosmg/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Dec 7, 2022
•
11
r3sist/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Jul 26, 2022
•
9
butchland/rl-ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Jul 27, 2022
•
18
AlexChe/a2c-AntBulletEnv-v0
Reinforcement Learning
•
Updated
Jul 26, 2022
•
9
suvadityamuk/ppo-LunarLander-v2-practicecourse-1
Reinforcement Learning
•
Updated
Jul 27, 2022
•
10
HumanCompatibleAI/sac-seals-Walker2d-v0
Reinforcement Learning
•
Updated
Jan 2, 2023
•
21
HumanCompatibleAI/sac-seals-Hopper-v0
Reinforcement Learning
•
Updated
Dec 31, 2022
•
20
HumanCompatibleAI/sac-seals-HalfCheetah-v0
Reinforcement Learning
•
Updated
Dec 31, 2022
•
26
HumanCompatibleAI/sac-seals-Ant-v0
Reinforcement Learning
•
Updated
Dec 31, 2022
•
24
HumanCompatibleAI/sac-seals-Humanoid-v0
Reinforcement Learning
•
Updated
Jan 2, 2023
•
20
•
1
HumanCompatibleAI/sac-seals-Swimmer-v0
Reinforcement Learning
•
Updated
Dec 31, 2022
•
18
heriosousa/a2c-AntBulletEnv-v0
Reinforcement Learning
•
Updated
Jul 27, 2022
•
8
jaybeeja/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Dec 17, 2022
•
13
dbarbedillo/a2c-AntBulletEnv-v0
Reinforcement Learning
•
Updated
Jul 27, 2022
•
10
Previous
1
...
39
40
41
42
43
...
100
Next