Titanic Machine Learning from Disaster
Project Overview
This data science project aims to build a predictive model to answer the question: "What sorts of people were more likely to survive?" using passenger data from the Titanic disaster. The project will analyze various attributes of the passengers, such as their names, ages, genders, socio-economic classes, and more, to predict the likelihood of survival.
About the Dataset:
The dataset used in this project contains information about the passengers who were aboard the Titanic during its ill-fated voyage. It includes a range of features, including the passengers' names, ages, genders, socio-economic classes, cabin information, ticket details, and survival status.
Data Dictionary
Variable | Definition | Key |
---|---|---|
survival | Survival | 0 = No, 1 = Yes |
pclass | Ticket class | 1 = 1st, 2 = 2nd, 3 = 3rd |
sex | Sex | |
Age | Age in years | |
sibsp | # of siblings / spouses aboard the Titanic | |
parch | # of parents / children aboard the Titanic | |
ticket | Ticket number | |
fare | Passenger fare | |
cabin | Cabin number | |
embarked | Port of Embarkation | C = Cherbourg, Q = Queenstown, S = Southampton |