Indian Used Car Price Prediction

Indian Used Car Price Prediction

Project Overview

The primary objective of this data science project is to predict the price of used cars in major Indian metro cities by analyzing a wide range of car features. These features include the car's manufacturer, model, variant, fuel type, color, kilometers driven, body style, transmission type, manufacture date, model year, CNG kit availability, ownership history, dealer details, and quality scores.

About the Dataset:

The "Indian IT Cities Used Car Dataset 2023" is a comprehensive collection of data that provides valuable insights into the used car market across major metro cities in India. This dataset encompasses details such as car models, variants, pricing, fuel types, dealer locations, warranty information, colors, kilometers driven, body styles, transmission types, ownership history, manufacture dates, model years, dealer names, CNG kit availability, and quality scores.

Data Dictionary:

Column Name Description
ID Unique ID for each listing
Company Name of the car manufacturer
Model Name of the car model
Variant Name of the car variant
Fuel Type Fuel type of the car
Color Color of the car
Kilometer Number of kilometers driven by the car
Body Style Body style of the car
Transmission Type Transmission type of the car
Manufacture Date Manufacture date of the car
Model Year Model year of the car
CngKit Whether the car has a CNG kit or not
Price Price of the car
Owner Type Number of previous owners of the car
Dealer State State in which the car is being sold
Dealer Name Name of the dealer selling the car
City City in which the car is being sold
Warranty Warranty offered by the dealer
Quality Score Quality score of the car

Conclusion

From the exploratory data analysis, several key insights have been derived regarding the Indian used car market. These insights include demand and price trends, the influence of factors such as fuel type and color, the impact of kilometer readings and body styles, geographical price variations, and the role of quality scores in determining car prices.

Machine learning models, specifically the Decision Tree Regressor and Random Forest Regressor, were utilized to predict car prices, with the Random Forest Regressor proving more effective. Feature importance analysis revealed that car age, body style, and car manufacturer are key factors affecting car prices.

This project offers valuable insights for both buyers and sellers in the Indian used car market, facilitating data-driven decision-making and price predictions.

View Notebook Download Dataset