This project analyzes aviation accident data to identify trends, contributing factors, and patterns that impact flight safety. The goal is to derive insights that can help improve aviation safety and reduce the likelihood of accidents.
The dataset is sourced from the NTSB aviation accident database and contains accident records from 1962 to 2023. It includes details such as:
- Aircraft Type & Model
- Accident Date & Location
- Weather Conditions
- Phase of Flight at Accident
- Injury Severity & Fatalities
- Total Records: 88,889
- Total Features: 31
- Trend Analysis: Identify patterns in accident frequency over time.
- Geographical Analysis: Determine accident-prone regions.
- Aircraft Safety Evaluation: Analyze which aircraft types and engine types are more prone to accidents.
- Weather & Flight Phase Impact: Understand how weather conditions and flight phases affect accident rates.
-
Accident Trends:
- Accidents increased significantly after 1980 due to mass production of aircraft.
- Over time, accident rates have declined, but incidents still occur frequently.
-
Monthly Trends:
- Most accidents occur in June, July, and August, aligning with heavy travel and monsoon seasons.
- Suggests extra safety measures should be implemented during these months.
-
Accident Distribution by Aircraft Type:
- Airplanes have the highest number of accidents, followed by helicopters and gliders.
-
Impact of Weather Conditions:
- Accidents occur more frequently in Visual Meteorological Conditions (VMC) than Instrument Meteorological Conditions (IMC).
- Suggests reliance on instrumentation can enhance safety.
-
Flight Phases Prone to Accidents:
- Landing and takeoff phases see the most accidents.
- Indicates that advanced automated landing systems could reduce risks.
We plan to use predictive modeling to estimate accident probability based on historical patterns. Potential models include:
- Logistic Regression
- Random Forest Classifier
- Gradient Boosting
AviationAccidents-EDA.ipynb
→ Jupyter Notebook with full analysisAviationData.csv
→ Raw datasetUSState_Codes.csv
→ Mapping file for state-level analysis
- Python (Pandas, NumPy, Matplotlib, Seaborn, Scikit-Learn)
- Jupyter Notebook
- Git & GitHub
- Clone the repository:
git clone https://github.com/sayed-ashfaq/Aviation-Accident-EDA.git cd Aviation-Accident-EDA
- Install required libraries:
pip install pandas numpy matplotlib seaborn scikit-learn
- Open the Jupyter Notebook:
jupyter notebook
- Run
AviationAccidents-EDA.ipynb
to explore the analysis.
- Weather plays a significant role in accidents. Airlines should enforce stricter rules for flights during high-risk months.
- Automated Instrumentation Systems (IMC) reduce accidents. Encouraging the use of advanced avionics in small aircraft can improve safety.
- Takeoff and landing remain critical. Investing in better pilot training and automated landing assistance is crucial.
- Personal aircraft accidents are high. Stricter maintenance and licensing regulations for private aircraft can mitigate risks.
This project is a result of extensive independent analysis and research. The initial observations, trend analysis, and insights were derived through detailed EDA performed by me. To ensure clarity and a structured presentation, ChatGPT was used for fact-checking, formatting, and refining the observations into a well-organized report. This approach allowed me to focus on deep data exploration while leveraging AI to improve readability and structure.