oozan
/

EVChargeEnv

Reinforcement Learning

Model card Files Files and versions

EVChargeEnv / BLOG.md

oozan's picture

Upload folder using huggingface_hub

0a8d192 verified 2 days ago

|

history blame contribute delete

3.53 kB

	EVChargeEnv: An OpenEnv Benchmark for EV Charging Optimization

	1. Motivation

	As AI agents move from static prediction to acting autonomously in dynamic environments, we need richer environments than toy grids and games. One domain that naturally combines uncertainty, long-horizon planning, and multi-objective decision-making is electric vehicle (EV) charging.

	EVChargeEnv is my contribution to the OpenEnv Challenge. It simulates a simplified but realistic EV charging process where an agent must decide how much to charge at each timestep while adapting to fluctuating electricity prices and grid load.

	The core objective is:

	Reach full battery while minimizing cost and avoiding grid overload.

	This makes EVChargeEnv a clean and interpretable environment that still contains meaningful complexity.

	---

	2. Environment Design

	EVChargeEnv exposes a continuous-control RL task with a 4-dimensional state:

	- charge_level ∈ [0, 1] – battery state of charge
	- price ∈ [0, 1] – dynamic energy price
	- grid_load ∈ [0, 1] – current grid stress/instability
	- time_step_norm ∈ [0, 1] – normalized timestep

	Agents output a continuous charging rate between 0 and 1.

	Scenarios

	To test robustness, EVChargeEnv includes three difficulty modes:

	- easy – smooth price/load curves and short episodes
	- medium – balanced volatility (default scenario)
	- hard – noisy price/load dynamics and slower charging

	---

	3. Reward Function

	The reward balances several competing factors:

	- progress_reward – reward for increasing charge
	– cost_penalty – charging during high prices costs more
	– overload_penalty – charging when grid load is high is discouraged
	– time_penalty – each step costs a tiny penalty to encourage faster execution

	This encourages agents to:

	- charge aggressively during low-price, low-load periods
	- slow or stop charging during peak price/load
	- finish charging efficiently

	---

	4. Implementation and OpenEnv Integration

	The environment is implemented using gymnasium and structured to reflect OpenEnv specifications. Key files include:

	- env/ev_charge_env.py – main environment logic
	- openenv.yaml – metadata describing observation/action spaces, rewards, and termination criteria
	- run_evaluation.py – produces standardized JSON outputs for assessment

	Example output:

	{
	"avg_reward": -1.23,
	"avg_steps": 31.2,
	"episodes": 5
	}

	Published on the Hugging Face Hub:
	https://huggingface.co/oozan/EVChargeEnv

	---

	5. Baseline Agents and Training

	Baselines included:

	Random Baseline – ignores price/load.
	Price-Aware Baseline – charges more when price is low.
	PyTorch Policy-Gradient Agent – small neural model trained with REINFORCE.

	The learned agent shows:

	- improved reward
	- sensible patterns
	- adaptation to medium scenario

	---

	6. Running the Environment

	Install:

	pip install -r requirements.txt

	Evaluate:

	python run_evaluation.py

	Heuristic baseline:

	python run_price_aware_evaluation.py

	Train agent:

	python train_evchargeenv_pg.py

	Notebook also included for Colab execution.

	---

	7. Future Improvements

	Possible extensions:

	- Day/night pricing cycles
	- Renewable energy influence
	- Emergency events / blackouts
	- Battery degradation modeling
	- PPO/SAC/LLM-based agents
	- Visualization tools

	---

	Final Thoughts

	EVChargeEnv is:

	- simple
	- realistic
	- modular
	- OpenEnv-compliant

	It provides a practical environment for research on planning and resource optimization under uncertainty.

	Repo:
	https://huggingface.co/oozan/EVChargeEnv