Project Context

In this project, I used MySQL to clean Nashville Housing data and improve its quality for data exploration and analysis.

The dataset for Nashville housing from 2013 to 2016 is available here.

This project follows four basic steps for producing reliable data through data cleaning, and they are as follows:

data_cleaning_process.png

<aside> 💡

All the queries written below are compiled on GitHub

</aside>

Groundwork

Before I started this data cleaning project, I needed to have a better understanding of data cleaning and lay the groundwork for the steps ahead. Here’s what I learned:

How to Perform Data Cleaning

Step 1: Remove Duplicate or Irrelevant Observations

01. Inspecting a table for unwanted columns