This dataset is a modified version of the California Housing dataset available from Luís Torgo’s page (University of Porto). Luís Torgo obtained it from the StatLib repository (which is closed now). The dataset may also be downloaded from StatLib mirrors.
This dataset appeared in a 1997 paper titled Sparse Spatial Autoregressions by Pace, R. Kelley and Ronald Barry, published in the Statistics and Probability Letters journal. They built it using the 1990 California census data. It contains one row per census block group. A block group is the smallest geographical unit for which the U.S. Census Bureau publishes sample data (a block group typically has a population of 600 to 3,000 people).
A typical performance measure for a regression problems is the Root Mean Square Error(RMSE). It gives an idea of how much error the system typically makes in it’s prediction, with a higher weight for large errors.
Equation of RMSE :
Data pre-processing is an important step for the creation of a machine learning model. Initially, data may not be clean or in the required format for the model which can cause misleading outcomes. In pre-processing of data, we transform data into our required format. It is used to deal with noises, duplicates, and missing values of the dataset. Data pre-processing has the activities like importing datasets, splitting datasets, attribute scaling, etc. Preprocessing of data is required for improving the accuracy of the model.
The dataset is now available here !
All the dependencies and required libraries are included in the file requirements.txt
See here
The Code is written in Python 3.7. If you don’t have Python installed you can find it here. If you are using a lower version of Python you can upgrade using the pip package, ensuring you have the latest version of pip. To install the required packages and libraries, run this command in the project directory after cloning the repository:
- Clone the repo
git clone https://github.com/Chaganti-Reddy/House-Price-Prediction.git
- Change your directory to the cloned repo
cd House-Price-Prediction
- Now, run the following command in your Terminal/Command Prompt to install the libraries required
python3 -m virtualenv my_env
source my_env/bin/activate
pip3 install -r requirements.txt
- Open terminal. Go into the cloned project directory and type the following command:
python3 Housing.py
You are allowed to cite any part of the code or our dataset. You can use it in your Research Work or Project.
