This notebook describes the motivation for our project and provides the index for our following process notebooks.
Flight delays are a major problem. Almost everybody has experienced a delayed or cancelled flight before and knows how annoying it is to wait at airports and maybe even miss important meetings at the destination. Delays are not only a problem for individual customers, but also for the airlines and the US economy in general: In 2010, researchers at the University of California (Berkeley) found that flight delays lead to total costs of more than $32.9 billion!
However, usually customers do not know the reason for their delayed arrival. Therefore, we want to understand what causes the delays and if we are able to estimate the expected delay for a given flight in the future.
A few of the main questions we are planning to investigate in our project are:
To make the whole project more readable, we created four different notebooks – one for each of the main subsections.
01_Index_and_Motivation.ipynb
): Our motivation for the project.02_DataAcquisition_and_Preparation.ipynb
: Description of our data sources and data wrangling/preprocessing processes.03_DataExploration.ipynb
: Our exploratory data analysis.04_PredictiveAnalytics.ipynb
: All predictive models we built.