DERL_Group

non-profit

AI & ML interests

None defined yet.

Papers

Differentiable Evolutionary Reinforcement Learning

View all Papers

Organization Card

Community About org cards

This repo presents the models of the paper Differentiable Evolutionary Reinforcement Learning. The models are trained by our bi-level evolutionary training loop.

We release the base initial Meta-Optimizer and the best policy model for each task.

models 9

DifferentiableEvolutionaryRL/DERL-ALFWorld-L2-Qwen2.5-1.5B

2B • Updated Dec 25, 2025 • 8 • 1

DifferentiableEvolutionaryRL/DERL-ALFWorld-L1-Qwen2.5-1.5B

2B • Updated Dec 25, 2025 • 6

DifferentiableEvolutionaryRL/DERL-ScienceWorld-L0-Qwen2.5-1.5B

2B • Updated Dec 25, 2025 • 2

DifferentiableEvolutionaryRL/DERL-ScienceWorld-L1-Qwen2.5-1.5B

2B • Updated Dec 25, 2025 • 3

DifferentiableEvolutionaryRL/DERL-ScienceWorld-L2-Qwen2.5-1.5B

2B • Updated Dec 25, 2025 • 2

DifferentiableEvolutionaryRL/DERL-ALFWorld-L0-Qwen2.5-1.5B

2B • Updated Dec 25, 2025 • 3 • 1

DifferentiableEvolutionaryRL/DERL-Meta-Optimizer-Init-Qwen2.5-0.5B-Instruct

Text Generation • 0.5B • Updated Dec 21, 2025 • 2 • 1

DifferentiableEvolutionaryRL/DERL-GSM8k-Math-Qwen2.5-3B

3B • Updated Dec 19, 2025 • 2

DifferentiableEvolutionaryRL/DERL-MATH-Qwen-2.5-3B

3B • Updated Dec 19, 2025 • 3

datasets 0

None public yet