After about a decade of scientific data analysis, this repository is my first repository for an EDA project in business context for a stakeholder in a bootcamp in 2022.
Thank you to neuefische and all participants for this journey!
In the workflow and assignment (workflow.md, assignment.md), the project and tasks are outlined.
The dataset is the well known "King County House Data" and the task included selecting a stakeholder and finding, analysing and extracting information in the data in line with the stakeholders needs and requirements.
My specific project can be characterized as follows:
Dataset:
"King County House Data" on home sales in King County (USA)
The available columns of this dataset are described in column_names.md.
Stakeholder choice
* Person chosen: Erin Robinson
* Stakeholder type: Buyer
* Stakeholder Description: Invest in poor neighborhood, buying & selling, costs back + little profit, socially responsible.
Idea for my analysis:
I focussed on sales types. In particular, I eventually gained insights from multi-sale houses in King County as compared to single-sale houses for both: financial and social gain.
This matches the stakeholders description as a buyer who is buying houses and aims to get money back plus a little profit from the respective sale in a socially responsible way.
A minimal notebook was provided as a template for the assignment. The optional material was not used in this assignment.
The jupyter notebook contains the analysis conducted during this 2 day assignment of a bootcamp.
In parallel a presentation was prepared.
The project ended with presentations of all participants in front of the respective stakeholder.
Before inspecting the jupyter notebook, please make sure that all requirements are fulfilled:
- pyenv
- python==3.9.8
- several packages as provided in the requirements file
To use the jupyter notebook in this repository, please create a virtual environment.
- setting the python version locally to 3.9.8
- creating a virtual environment using the
venv
module - activating your newly created environment
- upgrading
pip
(This step is not absolutely necessary, but will save you trouble when installing some packages.) - installing the required packages via
pip
- install the required packaged e.g. via using the requirements file (requirements.txt)
$ pyenv local 3.9.8
$ python -m venv .venv
$ source .venv/bin/activate
$ pip install --upgrade pip
$ pip install -r ./requirements.txt
For all material form the neuefische template see LICENSE. Work on this assignment: CC-BY 4.0 - Andrea Schrader