Titanic Machine Learning from Disaster

Project Overview

This data science project aims to build a predictive model to answer the question: "What sorts of people were more likely to survive?" using passenger data from the Titanic disaster. The project will analyze various attributes of the passengers, such as their names, ages, genders, socio-economic classes, and more, to predict the likelihood of survival.

About the Dataset:

The dataset used in this project contains information about the passengers who were aboard the Titanic during its ill-fated voyage. It includes a range of features, including the passengers' names, ages, genders, socio-economic classes, cabin information, ticket details, and survival status.

Data Dictionary

Variable Definition Key
survival Survival 0 = No, 1 = Yes
pclass Ticket class 1 = 1st, 2 = 2nd, 3 = 3rd
sex Sex
Age Age in years
sibsp # of siblings / spouses aboard the Titanic
parch # of parents / children aboard the Titanic
ticket Ticket number
fare Passenger fare
cabin Cabin number
embarked Port of Embarkation C = Cherbourg, Q = Queenstown, S = Southampton
View Notebook Download Dataset