Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Apps
llama.cpp
LM Studio
Jan
Draw Things
DiffusionBee
Jellybox
JoyFusion
LocalAI
vLLM
Ollama
MLX LM
Docker Model Runner
Lemonade
SGLang
Pi
Inference Providers
Select all
Groq
Novita
Cerebras
SambaNova
Nscale
fal
Hyperbolic
Together AI
Fireworks
Featherless AI
Zai
Replicate
Cohere
Scaleway
Public AI
OVHcloud AI Endpoints
HF Inference API
WaveSpeed
Misc
Reset Misc
deep-rl-class
Inference Endpoints
text-generation-inference
Eval Results (legacy)
text-embeddings-inference
4-bit precision
Merge
custom_code
8-bit precision
Mixture of Experts
Carbon Emissions
Eval Results
Apply filters
Models
8,223
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
deep-rl-class
Clear all
ManarAli/Reinforce-pixelcopter
Reinforcement Learning
•
Updated
Mar 30, 2023
BoschAI/Reinforce-pixelcopter
Reinforcement Learning
•
Updated
Mar 30, 2023
joe-hug/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
Mar 29, 2023
feratur/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
Mar 29, 2023
kenzo4433/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
Mar 29, 2023
kenzo4433/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
Mar 29, 2023
stelladk/Reinforce-PixelCopter-PLE-v0
Reinforcement Learning
•
Updated
Apr 19, 2023
JamesEJarvis/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
Mar 29, 2023
mobiusmatt/Reinforce-CartPole-v1initial
Reinforcement Learning
•
Updated
Mar 29, 2023
JamesEJarvis/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
Mar 30, 2023
adavies25/Reinforce-Cartpole-1
Reinforcement Learning
•
Updated
Mar 29, 2023
mobiusmatt/Reinforce-Pixelcopter-PLE-v0initial
Reinforcement Learning
•
Updated
Mar 29, 2023
sofiapecora/Reinforce-cartpole2
Reinforcement Learning
•
Updated
Mar 29, 2023
gf2rl/david1
Reinforcement Learning
•
Updated
Mar 29, 2023
gf2rl/david2
Reinforcement Learning
•
Updated
Mar 29, 2023
gf2rl/david3
Reinforcement Learning
•
Updated
Mar 30, 2023
gf2rl/david4
Reinforcement Learning
•
Updated
Mar 30, 2023
gf2rl/h_size_2
Reinforcement Learning
•
Updated
Mar 30, 2023
gf2rl/h_size_16_standard
Reinforcement Learning
•
Updated
Mar 30, 2023
gf2rl/h_size_100_fail
Reinforcement Learning
•
Updated
Mar 30, 2023
gf2rl/h_size_100_success_with_training_5000_episodes
Reinforcement Learning
•
Updated
Mar 30, 2023
gf2rl/max_t_50_fail
Reinforcement Learning
•
Updated
Mar 30, 2023
gf2rl/lr_1e-1_fail
Reinforcement Learning
•
Updated
Mar 30, 2023
gf2rl/lr_1e-3_not_perfect_but_not_a_complete_fail
Reinforcement Learning
•
Updated
Mar 30, 2023
gf2rl/gamma_0_05_fail
Reinforcement Learning
•
Updated
Mar 30, 2023
OMARS200/Cartpole-v1
Reinforcement Learning
•
Updated
Mar 30, 2023
gf2rl/gamma_0_5_Partial_fail
Reinforcement Learning
•
Updated
Mar 30, 2023
gf2rl/partial_observability_pole_pose_only
Reinforcement Learning
•
Updated
Mar 30, 2023
Isaac009/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
Mar 30, 2023
gf2rl/partial_observability_cart_pose_only
Reinforcement Learning
•
Updated
Mar 30, 2023
Previous
1
...
56
57
58
59
60
...
100
Next