NASA Aviation Incident Classifier

This model is a fine-tuned version of microsoft/deberta-v3-large on the NASA Aviation Incident Reports dataset.

It achieves the following results on the evaluation set:

Loss: 0.3248
Accuracy: 95%
F1 Score: 0.9504

Model Description

A state-of-the-art text classification model designed to categorize aviation incident reports into five distinct categories. Built on DeBERTa-v3-large (435M parameters), this model leverages disentangled attention mechanisms for superior natural language understanding of technical aviation terminology.

Classification Categories

Label	Description	Examples
`mechanical_failure`	Equipment/system malfunctions	Engine failures, hydraulic leaks, electrical issues
`human_error`	Crew mistakes or procedural errors	Wrong settings, missed checklists, miscommunication
`weather_related`	Weather-induced incidents	Turbulence, icing, windshear, storms
`bird_strike`	Wildlife collisions	Bird ingestion, flock encounters
`ground_incident`	Ground operations issues	Runway incursions, vehicle collisions, ramp incidents

Intended Uses & Limitations

Intended Uses

Automated classification of aviation safety reports (ASRS-style)
Safety data analysis and trend identification
Training data augmentation for aviation NLP research
Educational demonstrations of transformer-based text classification

Limitations

Trained on synthetic data modeled after NASA ASRS reports
Best suited for English-language incident narratives
May struggle with highly ambiguous multi-cause incidents
Not validated for operational safety-critical decisions

Training and Evaluation Data

Dataset: neurontorch/nasa_aviation_incident_reports

Split	Samples
Train	400
Eval	100
Total	500

Each category contains 100 balanced samples with professional aviation terminology and root-cause-based labeling.

Training Procedure

Training Hyperparameters

The following hyperparameters were used during training:

learning_rate: 2e-05
train_batch_size: 8
eval_batch_size: 8
seed: 42
gradient_accumulation_steps: 2
total_train_batch_size: 16
optimizer: AdamW (fused) with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
lr_scheduler_warmup_ratio: 0.1
num_epochs: 20 (early stopping at epoch 10)
mixed_precision_training: Native AMP (FP16)

Training Results

Training Loss	Epoch	Step	Validation Loss	Accuracy	F1
1.626	1.0	25	1.5861	0.28	0.1813
1.2794	2.0	50	1.1491	0.57	0.4570
0.9383	3.0	75	0.8447	0.71	0.7000
0.2942	4.0	100	0.3566	0.90	0.9007
0.1401	5.0	125	0.3002	0.94	0.9386
0.1607	6.0	150	0.3129	0.94	0.9402
0.0038	7.0	175	0.3071	0.95	0.9504
0.0022	8.0	200	0.3248	0.95	0.9504
0.0015	9.0	225	0.3409	0.95	0.9504
0.0012	10.0	250	0.3545	0.95	0.9504

Usage

from transformers import pipeline

classifier = pipeline("text-classification", model="neurontorch/nasa_incident_classifier")

# Example inference
text = "Engine oil pressure warning illuminated during cruise at FL350. Crew followed QRH procedures and diverted."
result = classifier(text)
print(result)  # [{'label': 'mechanical_failure', 'score': 0.98}]

Downloads last month: 5

Safetensors

Model size

0.4B params

Tensor type

F32

Model tree for neurontorch/nasa_incident_classifier

Base model

microsoft/deberta-v3-large

Finetuned

(283)

this model

Dataset used to train neurontorch/nasa_incident_classifier

Space using neurontorch/nasa_incident_classifier 1

Evaluation results

accuracy on NASA Aviation Incident Reports
self-reported

0.950
f1 on NASA Aviation Incident Reports
self-reported

0.950