omaryasserhassan commited on
Commit
6421445
·
verified ·
1 Parent(s): 83d34e0

Update app/data/analysis_summary.txt

Browse files
Files changed (1) hide show
  1. app/data/analysis_summary.txt +304 -56
app/data/analysis_summary.txt CHANGED
@@ -1,83 +1,331 @@
1
- Question: What is the purpose of the Flight Delay Dashboard?
2
- Answer: The dashboard provides analytical insights into flight delays across U.S. domestic airlines, helping users explore causes, trends, carrier behavior, and risk levels using interactive visualizations and machine learning models.
3
 
4
- Question: What are the main tabs in the Flight Delay Dashboard?
5
- Answer: The main tabs are: Explore Data (interactive plotting), Machine Learning Models (classifier and regressor), and Final Insights (business conclusions and strategic recommendations).
6
 
7
- Question: What does the Explore Data page offer?
8
- Answer: It allows users to create custom visualizations using univariate and bivariate plots, including bar charts, scatterplots, and heatmaps. A filterable data viewer is also provided.
9
 
10
- Question: Which seasons have the highest flight traffic?
11
- Answer: Flight volume typically peaks during summer and winter, according to traffic trend analyses in the Explore Data section.
12
 
13
- Question: Which carriers dominate the the U.S. domestic flight market?
14
- Answer: The dominant carriers include OO (SkyWest), DL (Delta), and MQ (Envoy). About 26% of flights fall under the “Other” category, indicating high market concentration.
15
 
16
- Question: What causes most delays according to the data?
17
- Answer: More than 70% of delays are caused by carrier-related issues and late-arriving aircraft, making internal airline operations the leading sources of disruption.
18
 
19
- Question: Which seasons have the most and least delays?
20
- Answer: Summer shows the highest delay rates due to operational stress, while fall is the most efficient season. Winter also sees high delays, mainly due to weather.
21
 
22
- Question: Which airports are the most and least efficient?
23
- Answer: Chicago (ORD) and San Francisco (SFO) experience high delays, while Atlanta (ATL) and Dallas (DFW) maintain high traffic with relatively low delay rates.
24
 
25
- Question: How is risk level classified in the dashboard?
26
- Answer: Risk levels are categorized into low, medium, and high based on delay prediction models. Around 40% of flights fall into medium or high-risk categories.
27
 
28
- Question: What model is used to predict delay risk levels?
29
- Answer: The classifier uses an XGBoost model, optimized using Optuna. It predicts risk levels as 0 (Low), 1 (Moderate), or 2 (High) and supports threshold tuning based on user priorities.
30
 
31
- Question: What are the three thresholding strategies for the classifier?
32
- Answer: The options are: Maximize Precision (reduce false positives), Maximize Recall (capture all delays), and Balanced (best trade-off between accuracy and sensitivity).
33
 
34
- Question: What does the regressor model predict?
35
- Answer: The regressor estimates the arrival delay in minutes for each flight. It uses an XGBoost Regressor and reports metrics such as MAE, RMSE, and R².
36
 
37
- Question: How is the dataset preprocessed before training?
38
- Answer: The preprocessing includes dropping leakage columns, encoding categorical features, scaling numeric features, aligning with training-time column order, and filling missing values with zeros.
39
 
40
- Question: What recommendations are made for airlines based on analysis?
41
- Answer: Airlines are advised to improve internal turnaround processes, benchmark against high-performing peers, and adopt seasonal strategies to reduce disruptions and improve service.
42
 
43
- Question: What insights are provided about carrier behavior?
44
- Answer: JetBlue and Envoy show the worst delay ratios, while Delta and Southwest maintain strong performance. Delay causes differ significantly between carriers.
45
 
46
- Question: Why is winter the most disruptive season despite low traffic?
47
- Answer: Winter sees the highest disruption rate due to severe weather and system vulnerabilities, highlighting that low volume does not ensure efficiency.
48
 
49
- Question: What are the most common delay types in peak seasons?
50
- Answer: During peak seasons, late aircraft delays surge to over 40%, indicating that carriers struggle to maintain efficient turnaround under stress.
51
 
52
- Question: What delay types dominate per airport?
53
- Answer: Airports like ORD face mostly NAS-related delays, while ATL and DFW face more internal delays from carriers and late aircraft.
54
 
55
- Question: How are features engineered in the model pipeline?
56
- Answer: Feature engineering includes delay rates, seasonality flags, dominant delay cause flags, and interactions between carriers and airports.
57
 
58
- Question: What is the main prediction target for the classifier?
59
- Answer: The classifier predicts the disruption risk level as a categorical variable: 0 (Low), 1 (Moderate), or 2 (High).
60
 
61
- Question: What is the main prediction target for the regressor?
62
- Answer: The regressor predicts the arrival delay in minutes as a continuous variable.
63
 
64
- Question: How does the model handle class imbalance?
65
- Answer: Class imbalance is addressed using a combination of class weighting and post-training threshold tuning strategies.
66
 
67
- Question: What does the dashboard recommend for airport planners?
68
- Answer: Airport planners are encouraged to implement adaptive demand forecasting, improve turnaround logistics, and tailor disruption policies by airport.
69
 
70
- Question: How is delay risk scored for individual flights?
71
- Answer: Delay risk scoring is performed by the classifier, which considers engineered features and outputs one of three risk levels using thresholding.
72
 
73
- Question: What performance metrics are reported for the regressor?
74
- Answer: The regressor reports Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), and R² score, indicating how well it predicts delay duration.
75
 
76
- Question: What business impact can be achieved using this dashboard?
77
- Answer: Airlines and airport authorities can reduce costs, improve service reliability, and optimize crew and asset deployment by acting on the insights provided.
78
 
79
- Question: Why is fall considered the most reliable travel season?
80
- Answer: Fall has the lowest delay and disruption rates, making it a reference point for benchmarking operational best practices.
81
 
82
- Question: What do stakeholders gain from understanding delay cause distribution?
83
- Answer: It helps prioritize internal vs. external causes, revealing where operational improvements can have the most impact on delay mitigation.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ SECTION 1: Dataset Overview
 
2
 
3
+ This dataset contains monthly-level flight arrival statistics across U.S. airports and carriers.
4
+ Each row represents a unique (carrier, airport, month, year) combination and includes delay causes, volumes, and outcomes.
5
 
6
+ Temporal Features:
 
7
 
8
+ year, month: Track seasonal and yearly trends.
 
9
 
10
+ Airline Info:
 
11
 
12
+ carrier: IATA code (e.g., AA)
 
13
 
14
+ carrier_name: Full airline name (used for display)
 
15
 
16
+ Airport Info:
 
17
 
18
+ airport: IATA code (e.g., LAX)
 
19
 
20
+ airport_name: Full airport name (used for display)
 
21
 
22
+ Flight & Delay Counts:
 
23
 
24
+ arr_flights: Total arriving flights in the month
 
25
 
26
+ arr_del15: Flights delayed over 15 minutes
 
27
 
28
+ arr_cancelled: Canceled flights
 
29
 
30
+ arr_diverted: Flights diverted to other destinations
 
31
 
32
+ Delay Causes (Counts per Flight):
 
33
 
34
+ carrier_ct: Delays caused by airline-side issues (e.g., crew, maintenance)
 
35
 
36
+ weather_ct: Delays caused by weather conditions
 
37
 
38
+ nas_ct: Delays caused by National Airspace System congestion
 
39
 
40
+ security_ct: Delays caused by security-related disruptions
 
41
 
42
+ late_aircraft_ct: Delays caused by late incoming aircraft
 
43
 
44
+ Delay Duration (Minutes):
 
45
 
46
+ arr_delay: Total delay time in minutes
 
47
 
48
+ carrier_delay, weather_delay, nas_delay, security_delay, late_aircraft_delay: Breakdown of total delay by cause
 
49
 
50
+ Additional Features (engineered):
 
51
 
52
+ delay_ratio: Proportion of delayed flights over total flights (engineered)
 
53
 
54
+ cancellation_rate: Canceled flights divided by total flights (engineered)
 
55
 
56
+ diversion_rate: Diversions divided by total flights (engineered)
57
+
58
+ disrupted: Indicates whether a flight was disrupted (1 = delayed or canceled, 0 = on-time) (engineered)
59
+
60
+ carrier_delay_pct, weather_delay_pct, etc.: Proportion of each delay cause relative to total delay (engineered)
61
+
62
+ carrier_total_flights: Total flights per carrier across dataset (engineered)
63
+
64
+ airport_delay_rate: Total airport delay rate based on delay volume and traffic (engineered)
65
+
66
+ year_month: Concatenated feature combining year and month (e.g., "2022-07") (engineered)
67
+
68
+ season: Categorical season derived from month (e.g., Summer, Winter) (engineered)
69
+
70
+ season_airport_combo: Combined feature of season and airport code (engineered)
71
+
72
+ delay_risk_level: Categorized delay risk score (0 = low, 1 = medium, 2 = high) (engineered)
73
+
74
+ mean_delay_per_flight: Average delay minutes per flight (engineered)
75
+
76
+ dominant_delay_cause: Most significant delay cause on each row (engineered)
77
+
78
+ SECTION 2: Analytical Insights
79
+
80
+ Yearly Flight Arrivals
81
+ Description: Displays the total number of flight arrivals per year to identify long-term trends.
82
+ Insight: The airline industry experienced strong growth from 2013 to 2019, followed by a sharp collapse in 2020 due to the COVID-19 pandemic. The industry recovered in 2021 and 2022, but the 2023 drop signals continuing operational and economic volatility. This highlights the need for resilient forecasting and adaptive capacity planning.
83
+
84
+ Proportion of Carrier
85
+ Description: Shows the market share of each airline carrier.
86
+ Insight: Flight activity is highly concentrated among a few dominant carriers such as OO, DL, and MQ. Around 26 percent of flights are grouped under "Other," indicating the need for detailed carrier-level analysis when assessing delay risks.
87
+
88
+ Yearly Delay Ratio
89
+ Description: Percentage of delayed flights (15+ minutes) per year.
90
+ Insight: Delay ratios rebounded after 2020, surpassing pre-pandemic levels by 2023. This suggests that operational bottlenecks and increased demand now outweigh pre-2020 system efficiency.
91
+
92
+ Yearly Cancellation Rate
93
+ Description: Annual trend in flight cancellations.
94
+ Insight: Apart from the 2020 spike during the COVID crisis, cancellation rates have remained consistently low, demonstrating strong schedule execution across airlines.
95
+
96
+ Yearly Diversion Rate
97
+ Description: Share of flights diverted from their original destination.
98
+ Insight: Diversions remain rare but have increased since the pandemic, peaking in 2023. This indicates rising instability in airport and airspace operations.
99
+
100
+ Average Delay Percentage by Cause
101
+ Description: Average share of total delay minutes for each cause.
102
+ Insight: Carrier and late aircraft delays are the leading contributors, accounting for more than 70 percent of total delay time. Improving internal operations offers the highest potential return on investment.
103
+
104
+ Delay Causes During Peak Season
105
+ Description: Breakdown of delay causes during high-traffic months.
106
+ Insight: Late aircraft delays surge to over 40 percent during peak seasons, indicating a scalability issue in airline operations.
107
+
108
+ Dominant Delay Cause per Flight
109
+ Description: Most common primary delay cause for each flight.
110
+ Insight: Carrier and late aircraft delays are the most frequent causes, reflecting internal inefficiencies and ripple effects from tight scheduling.
111
+
112
+ Average Delay per Flight by Season
113
+ Description: Mean delay duration across different seasons.
114
+ Insight: Summer has the longest average delays, followed by winter. Fall is the most efficient season and can serve as a benchmark for best practices.
115
+
116
+ Total Flights per Season
117
+ Description: Traffic volume by season.
118
+ Insight: Traffic volume does not correlate directly with delay performance. For example, winter has the lowest traffic but experiences some of the highest disruption rates.
119
+
120
+ Delay Causes by Season
121
+ Description: Aggregated delay causes across each season.
122
+ Insight: Summer is dominated by carrier and late aircraft delays, while winter is driven by weather. NAS-related delays are more prominent in spring and summer.
123
+
124
+ Disruption Rate by Season
125
+ Description: Percentage of disrupted (delayed or canceled) flights by season.
126
+ Insight: Winter has the highest disruption rate despite its low traffic. Fall consistently performs best across key metrics.
127
+
128
+ Average Delay Ratio per Carrier
129
+ Description: Ranks airlines by their average delay ratio.
130
+ Insight: JetBlue and Envoy have the highest average delay ratios, while Delta and Southwest demonstrate consistent reliability.
131
+
132
+ Carrier Delay Ratio by Season
133
+ Description: Seasonal changes in delay ratios across airlines.
134
+ Insight: Airlines like JetBlue and Frontier show a sharp increase in delays during summer. Other carriers maintain consistent performance across seasons.
135
+
136
+ Top 10 Carriers: Delay Cause Breakdown
137
+ Description: Total delay minutes by cause across the top 10 airlines.
138
+ Insight: Late aircraft and carrier-related delays dominate most carriers. Some also experience significant NAS and weather delays.
139
+
140
+ Disruption Rate vs Delay Cause (Carrier Level)
141
+ Description: Correlation between disruption rate and delay cause.
142
+ Insight: Carrier and late aircraft delays are the most predictive of system-wide disruption.
143
+
144
+ Average Delay vs Delay Rate (Top Airports)
145
+ Description: Comparison of delay frequency and severity at top airports.
146
+ Insight: Airports such as Chicago and San Francisco face frequent and severe delays, while Atlanta and Dallas manage high volumes with greater efficiency.
147
+
148
+ Delay Ratio vs Flight Volume (All Airports)
149
+ Description: Delay rate in relation to flight volume for all airports.
150
+ Insight: Efficiency is not necessarily dependent on traffic volume. Large airports can still maintain low delay ratios.
151
+
152
+ Heatmap: Delay Cause by Airport
153
+ Description: Geographic distribution of delay types across major airports.
154
+ Insight: Chicago and Newark suffer from NAS-related delays, while Atlanta and Dallas are impacted more by carrier-side issues. Location-specific delay profiles enable more effective intervention strategies.
155
+
156
+ Distribution of Delay Risk Levels
157
+ Description: Share of flights categorized as low, medium, or high delay risk.
158
+ Insight: Nearly 40 percent of flights are in the medium-to-high risk category. This classification provides a foundation for proactive resource planning and risk mitigation.
159
+
160
+ Univariate Analysis Summary
161
+
162
+ Yearly arrivals showed steady growth until the COVID-19 crash in 2020.
163
+
164
+ Carrier market share is concentrated among 3–4 dominant players, with the rest grouped as "Other."
165
+
166
+ Time Trend Summary
167
+
168
+ Delay ratios, cancellation rates, and diversion rates have fluctuated since 2020, with disruptions worsening despite lower flight volumes.
169
+
170
+ Delay Cause Summary
171
+
172
+ Carrier and late aircraft delays dominate.
173
+
174
+ Delays peak in high-travel seasons and are largely caused by internal airline processes.
175
+
176
+ Seasonal Summary
177
+
178
+ Summer has the worst performance; fall is the most efficient.
179
+
180
+ Winter delays are mostly weather-related.
181
+
182
+ Carrier Behavior Summary
183
+
184
+ Envoy and JetBlue have poor delay ratios.
185
+
186
+ Delta and Southwest remain consistently reliable.
187
+
188
+ Airport Insights Summary
189
+
190
+ Chicago and San Francisco suffer frequent delays.
191
+
192
+ Atlanta and Dallas perform well despite high volumes.
193
+
194
+ Delay patterns vary by airport and region.
195
+
196
+ Risk Level Summary
197
+
198
+ Over 40 percent of flights are at moderate or high risk of delay.
199
+
200
+ Understanding risk level is critical for operational planning.
201
+
202
+ SECTION 3: Application Overview
203
+
204
+ This Streamlit-based dashboard provides an end-to-end analysis of U.S. domestic flight delays using a unified, modular interface. It is designed for analysts, planners, airline stakeholders, and decision-makers to explore delay causes, trends, risk levels, and model-driven predictions. The interface is interactive, visually dynamic, and structured around five core tabs.
205
+
206
+ Available Tabs:
207
+
208
+ Home Page
209
+ The homepage provides a welcome message, key visual summaries, and interactive KPI cards that display core metrics such as total flights, delay ratios, cancellation rates, and average delay duration. A brief dataset description and high-level descriptive analysis offer users an immediate understanding of the dataset's scope.
210
+
211
+ Explore Data
212
+ This tab allows users to create their own visualizations through a fully modular plotting engine. Users can select any variable or combination of variables to generate univariate, bivariate, or heatmap plots using various chart types (bar, line, histogram, scatter, box, KDE, pie). A built-in table viewer lets users inspect raw data with filtering and selection tools. This tab is ideal for exploratory analysis and hypothesis testing.
213
+
214
+ Data Analysis
215
+ The Data Analysis page presents a curated, structured set of visual insights based on a complete EDA (Exploratory Data Analysis) of the dataset. The page is divided into thematic sections, such as Univariate Distributions, Seasonal Trends, Delay Cause Breakdown, Carrier Behavior, and Airport Performance. Each section includes a brief description, a stylized plot, and a professional insight box summarizing the key takeaway.
216
+
217
+ Machine Learning Models
218
+ This tab features two predictive pipelines: a classifier (for delay risk level prediction) and a regressor (for arrival delay estimation in minutes). Each model is explained with summary metrics, feature importance, and visual diagnostics. Users can enter new data or select from sample rows to receive real-time predictions. The section highlights interpretability, data preparation, and modeling techniques.
219
+
220
+ Business Insights
221
+ This page translates analytical findings into operational recommendations for airlines, airports, and policymakers. It includes strategic suggestions for addressing seasonal disruptions, optimizing carrier performance, improving airport efficiency, and managing delay risks. The content is designed in a professional tone and focused on real-world applications.
222
+
223
+ Key User Interface Features:
224
+
225
+ A chatbot is embedded in the sidebar and available on all pages. It can answer questions about the dataset, dashboard features, delay causes, seasonal effects, model performance, and more.
226
+
227
+ A dynamic dark mode toggle adjusts the interface theme and background video. Each mode has its own looping animated video to enhance the user experience.
228
+
229
+ A real-time clock is displayed at the top of the page.
230
+
231
+ The sidebar provides seamless navigation and user interaction, including access to chatbot assistance and customization controls.
232
+
233
+ Purpose and Intended Audience:
234
+
235
+ The dashboard is built to support data-driven decision-making across airline operations, airport management, and public transport policy. It empowers users to explore data freely, understand underlying delay patterns, evaluate predictive models, and apply insights toward improving scheduling efficiency, passenger experience, and system resilience.
236
+
237
+ GLOSSARY OF TERMS
238
+
239
+ arr_flights:
240
+ Total number of arriving flights for a given (carrier, airport, month, year) record.
241
+
242
+ arr_del15:
243
+ Number of flights delayed by more than 15 minutes. Used to compute delay ratio.
244
+
245
+ arr_cancelled:
246
+ Total number of canceled flights during the period.
247
+
248
+ arr_diverted:
249
+ Number of flights that were diverted to a different destination airport.
250
+
251
+ carrier_ct:
252
+ Count of delays attributed to the carrier, such as crew issues, maintenance, or scheduling errors.
253
+
254
+ weather_ct:
255
+ Count of delays caused by weather-related conditions such as storms or low visibility.
256
+
257
+ nas_ct:
258
+ Delays from National Airspace System issues, including air traffic congestion or routing problems.
259
+
260
+ security_ct:
261
+ Delays caused by security-related issues such as threats or screenings.
262
+
263
+ late_aircraft_ct:
264
+ Count of delays caused by an incoming aircraft arriving late, affecting its turnaround.
265
+
266
+ arr_delay:
267
+ Total arrival delay time (in minutes) for all flights in that row.
268
+
269
+ carrier_delay, weather_delay, nas_delay, security_delay, late_aircraft_delay:
270
+ Total delay duration for each delay type, expressed in minutes.
271
+
272
+ delay_ratio (engineered):
273
+ The proportion of delayed flights: arr_del15 divided by arr_flights.
274
+
275
+ cancellation_rate (engineered):
276
+ The proportion of canceled flights: arr_cancelled divided by arr_flights.
277
+
278
+ diversion_rate (engineered):
279
+ The proportion of diverted flights: arr_diverted divided by arr_flights.
280
+
281
+ disrupted (engineered):
282
+ A binary indicator (1 or 0) showing whether the flight group was either delayed or canceled.
283
+
284
+ total_delay (engineered):
285
+ The sum of all delay minutes from carrier, weather, NAS, security, and late aircraft.
286
+
287
+ carrier_delay_pct, etc. (engineered):
288
+ The share of total_delay attributable to each cause. For example, carrier_delay_pct = carrier_delay / total_delay.
289
+
290
+ mean_delay_per_flight (engineered):
291
+ Average delay per flight, calculated as total_delay divided by arr_flights.
292
+
293
+ dominant_delay_cause (engineered):
294
+ The delay type with the largest value for a given row.
295
+
296
+ season (engineered):
297
+ A categorical value (Winter, Spring, Summer, Fall) derived from the month column.
298
+
299
+ season_airport_combo (engineered):
300
+ A composite feature representing the interaction between season and airport.
301
+
302
+ carrier_total_flights (engineered):
303
+ The total number of arriving flights for a given carrier across all months and airports.
304
+
305
+ airport_delay_rate (engineered):
306
+ The average delay rate for a specific airport, calculated across all months and carriers.
307
+
308
+ month_delay_rate (engineered):
309
+ The average delay rate for a specific month, calculated across all carriers and airports.
310
+
311
+ delay_risk_level (engineered):
312
+ A categorical label that classifies flights into 0 (low risk), 1 (medium risk), or 2 (high risk) based on delay_ratio thresholds.
313
+
314
+ year_month (engineered):
315
+ A string combining year and month into a single value, formatted as "YYYY-MM".
316
+
317
+ SECTION: Classifier Model Summary
318
+
319
+ The classifier is a multi-class XGBoost model designed to predict the delay risk level of a flight. The target variable is delay_risk_level, which has three classes: 0 for low risk, 1 for medium risk, and 2 for high risk. The classifier uses custom thresholds to adjust class boundaries for better performance. These thresholds are defined in a JSON file located in the models folder and can be loaded dynamically depending on the selected strategy.
320
+
321
+ The preprocessing pipeline includes data cleaning, feature engineering, encoding, and scaling. Features that leak future information such as arr_delay or carrier_name are dropped. The model is trained using OneHotEncoded categorical variables and standardized numerical features. All features are reindexed to match the training schema to avoid mismatches during inference.
322
+
323
+ The classifier supports two modes. In test mode, it accepts labeled data and returns prediction metrics such as precision, recall, and F1-score. In realtime mode, it accepts new, unlabeled data and returns the predicted risk level directly.
324
+
325
+ SECTION: Regressor Model Summary
326
+
327
+ The regressor is an XGBoost model trained to predict the total arrival delay in minutes. The target variable is arr_delay. The model uses the same core pipeline structure as the classifier. It includes cleaning, feature engineering, encoding, and scaling. All delay-related features such as arr_del15, carrier_delay, and total_delay are dropped to prevent data leakage.
328
+
329
+ The regressor also supports two modes. In test mode, it returns evaluation metrics including mean absolute error (MAE), mean squared error (MSE), root mean squared error (RMSE), and R-squared (R2). In realtime mode, it only outputs the predicted delay value.
330
+
331
+ Both models share a modular design, allowing consistent preprocessing and flexible integration with the dashboard and chatbot. The models are stored in the models directory and loaded during runtime for inference.