NASA Aviation Incident Classifier

This model is a fine-tuned version of microsoft/deberta-v3-large on the NASA Aviation Incident Reports dataset.

It achieves the following results on the evaluation set:

  • Loss: 0.3248
  • Accuracy: 95%
  • F1 Score: 0.9504

Model Description

A state-of-the-art text classification model designed to categorize aviation incident reports into five distinct categories. Built on DeBERTa-v3-large (435M parameters), this model leverages disentangled attention mechanisms for superior natural language understanding of technical aviation terminology.

Classification Categories

Label Description Examples
mechanical_failure Equipment/system malfunctions Engine failures, hydraulic leaks, electrical issues
human_error Crew mistakes or procedural errors Wrong settings, missed checklists, miscommunication
weather_related Weather-induced incidents Turbulence, icing, windshear, storms
bird_strike Wildlife collisions Bird ingestion, flock encounters
ground_incident Ground operations issues Runway incursions, vehicle collisions, ramp incidents

Intended Uses & Limitations

Intended Uses

  • Automated classification of aviation safety reports (ASRS-style)
  • Safety data analysis and trend identification
  • Training data augmentation for aviation NLP research
  • Educational demonstrations of transformer-based text classification

Limitations

  • Trained on synthetic data modeled after NASA ASRS reports
  • Best suited for English-language incident narratives
  • May struggle with highly ambiguous multi-cause incidents
  • Not validated for operational safety-critical decisions

Training and Evaluation Data

Dataset: neurontorch/nasa_aviation_incident_reports

Split Samples
Train 400
Eval 100
Total 500

Each category contains 100 balanced samples with professional aviation terminology and root-cause-based labeling.

Training Procedure

Training Hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • gradient_accumulation_steps: 2
  • total_train_batch_size: 16
  • optimizer: AdamW (fused) with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • lr_scheduler_warmup_ratio: 0.1
  • num_epochs: 20 (early stopping at epoch 10)
  • mixed_precision_training: Native AMP (FP16)

Training Results

Training Loss Epoch Step Validation Loss Accuracy F1
1.626 1.0 25 1.5861 0.28 0.1813
1.2794 2.0 50 1.1491 0.57 0.4570
0.9383 3.0 75 0.8447 0.71 0.7000
0.2942 4.0 100 0.3566 0.90 0.9007
0.1401 5.0 125 0.3002 0.94 0.9386
0.1607 6.0 150 0.3129 0.94 0.9402
0.0038 7.0 175 0.3071 0.95 0.9504
0.0022 8.0 200 0.3248 0.95 0.9504
0.0015 9.0 225 0.3409 0.95 0.9504
0.0012 10.0 250 0.3545 0.95 0.9504

Usage

from transformers import pipeline

classifier = pipeline("text-classification", model="neurontorch/nasa_incident_classifier")

# Example inference
text = "Engine oil pressure warning illuminated during cruise at FL350. Crew followed QRH procedures and diverted."
result = classifier(text)
print(result)  # [{'label': 'mechanical_failure', 'score': 0.98}]
Downloads last month
19
Safetensors
Model size
0.4B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for neurontorch/nasa_incident_classifier

Finetuned
(237)
this model

Dataset used to train neurontorch/nasa_incident_classifier

Space using neurontorch/nasa_incident_classifier 1

Evaluation results