This repository contains SQL scripts and documentation for cleaning and standardizing data in the NashvilleHousing
table within the sqlproject2
database. The project aims to prepare the dataset for analysis by addressing inconsistencies, filling missing values, standardizing formats, and removing duplicates.
- Database Creation: Initializes the
sqlproject2
database. - Data Standardization: Converts date formats and standardizes field values.
- Address Processing: Splits combined address fields into separate columns for easier analysis.
- Data Deduplication: Identifies and removes duplicate records to ensure data integrity.
- Column Cleanup: Removes unused columns to streamline the dataset.
- Setup: Create and populate the
NashvilleHousing
table in thesqlproject2
database. - Execution: Run the provided SQL scripts in SQL Server Management Studio (SSMS) to clean the data.
- Verification: Review the final cleaned dataset to confirm the changes.
- README: Provides an overview of the project, step-by-step instructions, and usage guidelines.
- SQL Scripts: Contains the SQL commands for each data cleaning step, including comments for clarity.