README.md · TensorAeroSpace/mpc-b747-step-response at main

mpc-b747-step-response / README.md

Mr8bit

Update README.md

c767c15 verified 6 days ago

preview code

raw

history blame contribute delete

11.2 kB

	---
	license: mit
	language:
	- en
	tags:
	- model-predictive-control
	- mpc
	- pytorch
	- aerospace
	- flight-control
	- boeing-747
	- learned-dynamics
	- neural-network
	- continuous-control
	- gymnasium
	library_name: tensoraerospace
	pipeline_tag: reinforcement-learning
	model-index:
	- name: MPC-OneStepMLP-B747-PitchControl
	results:
	- task:
	type: model-predictive-control
	name: Pitch Angle Tracking Control
	dataset:
	type: custom
	name: Boeing 747 Longitudinal Dynamics Simulation
	metrics:
	- type: overshoot
	value: 0.27
	name: Overshoot (%)
	- type: settling_time
	value: 1.40
	name: Settling Time (s)
	- type: rise_time
	value: 0.80
	name: Rise Time (s)
	- type: peak_time
	value: 1.70
	name: Peak Time (s)
	- type: static_error
	value: 0.038
	name: Static Error
	- type: oscillation_count
	value: 5
	name: Oscillation Count
	- type: performance_index
	value: 72.62
	name: Performance Index
	- type: iae
	value: 41.25
	name: IAE
	- type: ise
	value: 147.43
	name: ISE
	- type: itae
	value: 33.99
	name: ITAE
	- type: dynamics_loss
	value: 8.69e-6
	name: Dynamics Model MSE Loss
	---

	# TorchMPC with Learned Dynamics (OneStepMLP) for Boeing 747 Pitch Angle Control

	<div align="center">

	![TensorAeroSpace](https://raw.githubusercontent.com/TensorAeroSpace/TensorAeroSpace/main/img/logo-no-background.png)

	Model Predictive Control with Neural Network Dynamics for Longitudinal Aircraft Control

	[![TensorAeroSpace](https://img.shields.io/badge/%F0%9F%9A%80-TensorAeroSpace-blue)](https://github.com/TensorAeroSpace/TensorAeroSpace)
	[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
	[![PyTorch](https://img.shields.io/badge/PyTorch-2.0+-red.svg)](https://pytorch.org/)

	</div>

	## Model Description

	This model combines Model Predictive Control (MPC) with a learned neural network dynamics model (OneStepMLP) to control the pitch angle (θ) of a Boeing 747 aircraft in a longitudinal flight dynamics simulation. The approach first learns the aircraft dynamics from exploration data, then uses gradient-based MPC optimization to compute optimal control actions for reference tracking.

	![image](https://cdn-uploads.huggingface.co/production/uploads/602bf7c9c4f8038e9a1e0a65/yZYzPcK_PU7uFh_j6pZii.png)

	### Key Features

	- Data-driven dynamics: Learns one-step transition model f(x, u) → Δx from exploration data
	- Gradient-based MPC: Differentiable optimization through learned dynamics
	- Step response optimization: Custom cost function for overshoot/settling time minimization
	- Warm-starting: Efficient action sequence initialization across timesteps

	### Intended Uses

	- Primary Use: Automatic pitch angle tracking and stabilization for Boeing 747 aircraft simulation
	- Research Applications: Benchmarking learning-based MPC algorithms for aerospace control systems
	- Educational: Learning MPC concepts with neural network dynamics in aerospace applications
	- Hybrid Control: Can be combined with analytical models for robust flight control

	## Model Architecture

	### Dynamics Model (OneStepMLP)

	The dynamics model predicts state transitions using a multi-layer perceptron:

	\| Layer \| Configuration \|
	\|-------\|---------------\|
	\| Input \| 5 (state_dim=4 + action_dim=1) \|
	\| Hidden 1 \| Linear(5, 256) + ReLU \|
	\| Hidden 2 \| Linear(256, 256) + ReLU \|
	\| Output \| Linear(256, 4) \|
	\| Mode \| Predict Δx (delta dynamics) \|

	Total Parameters: ~70K

	### MPC Controller

	\| Parameter \| Value \|
	\|-----------\|-------\|
	\| Horizon \| 20 steps \|
	\| Iterations per step \| 60 \|
	\| Optimizer \| Adam \|
	\| MPC Learning Rate \| 0.02 \|
	\| Warm Start \| Enabled \|
	\| Track Best \| Enabled \|

	### State Space

	The observation vector consists of 4 states representing the longitudinal dynamics:

	\| Index \| State \| Description \| Units \|
	\|-------\|-------\|-------------\|-------\|
	\| 0 \| u \| Forward velocity perturbation \| m/s (rad internally) \|
	\| 1 \| w \| Vertical velocity perturbation \| m/s (rad internally) \|
	\| 2 \| q \| Pitch rate \| rad/s \|
	\| 3 \| θ \| Pitch angle (tracking target) \| rad \|

	### Action Space

	\| Dimension \| Description \| Range \| Rate Limit \|
	\|-----------\|-------------\|-------\|------------\|
	\| 1 \| Elevator deflection \| [-25°, 25°] \| ±10°/step \|

	## Training Details

	### Data Collection

	\| Parameter \| Value \|
	\|-----------\|-------\|
	\| Collection Episodes \| 1500 \|
	\| Transitions Collected \| 297,000 \|
	\| Exploration Strategy \| Multi-signal exploration \|
	\| Signal Types \| random_steps, unit_step, multi_step, ramp, sinusoid, multisine, chirp, square_wave, triangular_wave, sawtooth, doublet, pulse, gaussian_pulse, exponential, damped_sinusoid \|
	\| Action Amplitude \| 100% of action space \|

	### Dynamics Training

	\| Parameter \| Value \|
	\|-----------\|-------\|
	\| Epochs \| 120 \|
	\| Batch Size \| 2048 \|
	\| Learning Rate \| 1e-4 \|
	\| Loss Function \| MSE \|
	\| Final Loss \| 8.69e-6 \|
	\| Normalization \| Enabled \|

	### MPC Cost Weights

	\| Weight \| Value \| Description \|
	\|--------\|-------\|-------------\|
	\| W_θ \| 2000.0 \| Pitch tracking weight \|
	\| W_q \| 0.2 \| Pitch rate weight \|
	\| W_action \| 0.01 \| Control effort weight \|
	\| W_Δu \| 5.0 \| Control rate weight \|
	\| Terminal \| 10.0 \| Terminal cost multiplier \|

	### Step Response Cost Configuration

	\| Parameter \| Value \|
	\|-----------\|-------\|
	\| W_overshoot \| 8,000 \|
	\| W_settle \| 8,000 \|
	\| W_sse_steady \| 40,000 \|
	\| W_time \| 800 \|
	\| W_osc \| 500 \|
	\| W_jerk \| 50 \|
	\| Overshoot limit \| 0.05° \|
	\| Settle band \| 0.10° \|
	\| Settle time target \| 1.0 s \|

	### Environment Configuration

	\| Parameter \| Value \|
	\|-----------\|-------\|
	\| Environment \| `LinearLongitudinalB747-v0` \|
	\| Time Step (dt) \| 0.1 s \|
	\| Episode Duration \| 20 s \|
	\| Initial State \| [0, 0, 0, 0] \|
	\| Reference Signal \| Step function \|
	\| Step Amplitude \| 1.0° \|
	\| Step Time \| 5.0 s \|

	### Training Infrastructure

	- Hardware: CUDA GPU (recommended) / CPU
	- Framework: PyTorch 2.0+
	- Compile Mode: reduce-overhead (CUDA only)

	## Evaluation Results

	### Performance Metrics

	\| Metric \| Value \|
	\|--------\|-------\|
	\| Overshoot \| 0.27% \|
	\| Settling Time (±5%) \| 1.40 s \|
	\| Rise Time \| 0.80 s \|
	\| Peak Time \| 1.70 s \|
	\| Static Error \| 0.038 \|
	\| Oscillation Count \| 5 \|
	\| Performance Index \| 72.62 \|
	\| Damping Degree \| -0.002 \|

	### Integral Criteria

	\| Criterion \| Value \|
	\|-----------\|-------\|
	\| IAE (Integral Absolute Error) \| 41.25 \|
	\| ISE (Integral Squared Error) \| 147.43 \|
	\| ITAE (Integral Time-weighted Absolute Error) \| 33.99 \|

	### Step Response Characteristics

	The MPC controller demonstrates good step tracking performance with:
	- ✅ Very low overshoot (~0.27%)
	- ✅ Fast settling time (1.4s)
	- ✅ Quick rise time (0.8s)
	- ⚠️ Some oscillations (5 cycles)
	- ⚠️ Small static error (0.038)

	## Usage

	### Installation

	```bash
	pip install tensoraerospace
	```

	### Quick Start

	```python
	import numpy as np
	import gymnasium as gym
	import torch
	from tensoraerospace.signals.standart import unit_step
	from tensoraerospace.agent.mpc import MPCAgent

	def pick_device() -> str:
	if torch.cuda.is_available():
	return "cuda"
	if hasattr(torch.backends, "mps") and torch.backends.mps.is_available():
	return "mps"
	return "cpu"

	# Setup environment
	DT = 0.1
	TN = 20.0
	N_STEPS = int(TN / DT) + 1
	T = np.arange(N_STEPS, dtype=np.float32) * DT

	# Create step reference signal (1 degree step at t=5s)
	reference_signal = unit_step(
	tp=T,
	degree=1.0,
	time_step=5.0,
	output_rad=True,
	).reshape(1, -1)

	env = gym.make(
	"LinearLongitudinalB747-v0",
	number_time_steps=N_STEPS,
	initial_state=np.array([[0.0], [0.0], [0.0], [0.0]], dtype=np.float32),
	reference_signal=reference_signal,
	dt=DT,
	)

	# Load pretrained agent
	agent = MPCAgent.from_pretrained("TensorAeroSpace/torchmpc-mlp-b747-step-response")
	agent.env = env
	agent.to_device(pick_device())

	# Run evaluation
	_ = env.reset()
	agent.reset()

	ref_theta_rad = reference_signal[0]
	x_ref = np.zeros((21, 4), dtype=np.float32) # horizon + 1

	for step in range(N_STEPS - 2):
	k = int(env.unwrapped.current_step)
	x0 = np.asarray(env.unwrapped.model.xt, dtype=np.float32).reshape(-1)

	# Set reference for horizon
	ref_k = float(ref_theta_rad[min(k, len(ref_theta_rad) - 1)])
	x_ref[:, 3] = ref_k

	action = agent.select_action(x0, x_ref=x_ref)
	obs, reward, terminated, truncated, info = env.step(action)

	if terminated or truncated:
	break
	```

	### Custom Dynamics Training

	```python
	# Collect exploration data
	agent.collect_data(
	num_episodes=1500,
	max_steps=199,
	exploration="signals",
	signal_kinds=["random_steps", "sinusoid", "chirp", ...],
	dt=0.1,
	action_amplitude_frac=1.0,
	)

	# Train dynamics model
	metrics = agent.train_dynamics(
	epochs=120,
	batch_size=2048,
	loss="mse",
	)
	print(f"Final dynamics loss: {metrics['loss']:.2e}")
	```

	## Comparison with Other Methods

	\| Method \| Overshoot \| Settling Time \| Rise Time \| Static Error \|
	\|--------\|-----------\|---------------\|-----------\|--------------\|
	\| MPC-MLP \| 0.27% \| 1.40 s \| 0.80 s \| 0.038 \|
	\| DSAC \| 0.99% \| 0.40 s \| 0.40 s \| 0.0002 \|
	\| PID (tuned) \| ~5% \| ~2.0 s \| ~1.0 s \| ~0 \|

	## Limitations

	- Fixed Aircraft Model: Trained specifically on Boeing 747 longitudinal dynamics; may not generalize to other aircraft
	- Step Reference Focus: Optimized for step reference tracking; performance on other signal types may vary
	- Simulation Gap: Trained in simulation; real-world deployment would require additional validation
	- Computational Cost: MPC optimization at each step requires more computation than pure RL policies
	- Linear Dynamics: Based on linearized aircraft model around trim conditions
	- Some Oscillations: The controller exhibits 5 oscillation cycles during settling

	## Ethical Considerations

	- Not for Real Flight Control: This model is for research and educational purposes only. It should NOT be used for actual aircraft control systems without extensive testing, certification, and regulatory approval.
	- Simulation Only: All training and evaluation performed in simulation environments.

	## Citation

	If you use this model in your research, please cite:

	```bibtex
	@software{tensoraerospace2024,
	title = {TensorAeroSpace: Advanced Aerospace Control Systems \& Reinforcement Learning Framework},
	author = {TensorAeroSpace Team},
	year = {2024},
	url = {https://github.com/TensorAeroSpace/TensorAeroSpace},
	license = {MIT}
	}
	```

	## Model Card Authors

	TensorAeroSpace Team

	## Model Card Contact

	- GitHub: [TensorAeroSpace/TensorAeroSpace](https://github.com/TensorAeroSpace/TensorAeroSpace)
	- Documentation: [tensoraerospace.readthedocs.io](https://tensoraerospace.readthedocs.io/)
	- Hugging Face: [TensorAeroSpace](https://huggingface.co/TensorAeroSpace)
	```