Member-only story
Use Excel to carry out an Exploratory Data Analysis on the Titanic dataset
The Titanic is perhaps one of the most famous ships in modern times because although it was said to be unsinkable, it actually hit an iceberg and sank on 15 April 1912. This dataset is a passenger list of people who boarded the ship, is very popular. The dataset is featured on Kaggle, a data science website, that allows machine learning enthusiasts an opportunity to make predictions on who the survivors might be.
The dataset that I have performed an Exploratory Data Analysis (EDA) on is the training dataset, as it appears on Kaggle. Therefore, it was not possible for me to perform a complete EDA because I do not have the testing dataset. Nevertheless, I will carry out an EDA on the information that has been made available to me in an effort to demonstrate how it is done.
The screenshot below is the training set as it appears on Excel:-
The first question that needs to be answered is what the survival rate of passengers of the Titanic is. In order to answer this question, I used the following formula in cell H1:-