Airbnb New York City Data Analysis

Project Overview

This project involves an in-depth analysis of Airbnb listings in New York City. The dataset contains various attributes such as listing ID, name, host ID, host name, neighborhood group, neighborhood, latitude, longitude, room type, price, minimum nights, number of reviews, last review, and reviews per month. The main objective is to uncover insights related to revenue generation by neighborhood and room type.

Technologies Used

Python
Pandas
Matplotlib
Seaborn

Tasks and Analysis

Data Preprocessing

Loaded the Airbnb dataset into a Pandas DataFrame.
Explored the dataset to understand its structure and the type of data it contains.

Exploratory Data Analysis

Count of Listings by Neighborhood Group

Calculated the number of Airbnb listings in each of the five Neighborhood Groups (Manhattan, Brooklyn, Queens, Bronx, Staten Island).

Percentage Distribution of Listings

Computed the percentage distribution of listings across the Neighborhood Groups.

Revenue Calculation

Created a new field named "Revenue," calculated as the product of the "Price" and "Number_Of_Reviews" columns.

Average Revenue by Neighborhood Group

Plotted a bar chart to show which Neighborhood Group has the highest average revenues.

Top Revenue-Generating Neighborhoods

Filtered the dataset to include only listings from Manhattan, Brooklyn, and Queens.
Identified the top 3 revenue-generating neighborhoods in these three Neighborhood Groups.

Top Revenue-Generating Room Types

Further filtered the dataset to include only the top 3 neighborhoods in each of the three main Neighborhood Groups.
Identified the top average revenue-generating room type in each of these neighborhoods and visualized this using a bar chart.

Key Insights

Manhattan has the highest number of Airbnb listings and also generates the highest average revenue.
Entire homes/apartments are generally the top revenue-generating room types across most neighborhoods.
Among the neighborhoods analyzed, Williamsburg in Brooklyn and Harlem in Manhattan are the top revenue generators.

Future Work

This analysis can be extended to include time-series data to understand seasonal trends, as well as incorporating additional data such as property amenities, host ratings, and guest reviews for a more comprehensive analysis.

Author

Christine Baxter

Acknowledgements

Data for this project was sourced from Airbnb listings in New York City.

Version Control Workflow with Git/GitHub

The project leveraged Git and GitHub for version control, ensuring a systematic and collaborative approach to code development. The version control strategy is outlined below:

Branching Strategy

Initialization: Created an initial develop branch as the base for development.
Feature Branches: For each major stage of the project (Initial Setup, Data Loading, Exploratory Data Analysis), a dedicated feature branch was created off of the develop branch.

Commit and Merge Workflow

Commit Changes: After the completion of each stage, all changes were committed to the respective feature branch, encapsulating the progress in a version-controlled manner.
Code Reviews and Merges: The feature branch was then merged into the develop branch post-review. This ensures that the develop branch always contains the most recent, stable version of the project.
Creating New Feature Branches: A new feature branch was created from the updated develop branch for the next stage of the project.
Final Merge: Upon completion of all stages, the develop branch was merged into the main branch, signifying the conclusion of the project.

Documentation

Comprehensive comments were added at each stage for clarity and future reference.

By adhering to this workflow, the project maintained a high level of code integrity, streamlined collaboration, and enabled seamless tracking of project milestones.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
Airbnb_Exercise_Student_Facing_170221.ipynb		Airbnb_Exercise_Student_Facing_170221.ipynb
LICENSE		LICENSE
README.md		README.md
airbnb_2.csv		airbnb_2.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Airbnb New York City Data Analysis

Project Overview

Technologies Used

Tasks and Analysis

Data Preprocessing

Exploratory Data Analysis

Count of Listings by Neighborhood Group

Percentage Distribution of Listings

Revenue Calculation

Average Revenue by Neighborhood Group

Top Revenue-Generating Neighborhoods

Top Revenue-Generating Room Types

Key Insights

Future Work

Author

Acknowledgements

Version Control Workflow with Git/GitHub

Branching Strategy

Commit and Merge Workflow

Documentation

About

Languages

License

christinedbaxter/sb-caseStudy-airBnb

Folders and files

Latest commit

History

Repository files navigation

Airbnb New York City Data Analysis

Project Overview

Technologies Used

Tasks and Analysis

Data Preprocessing

Exploratory Data Analysis

Count of Listings by Neighborhood Group

Percentage Distribution of Listings

Revenue Calculation

Average Revenue by Neighborhood Group

Top Revenue-Generating Neighborhoods

Top Revenue-Generating Room Types

Key Insights

Future Work

Author

Acknowledgements

Version Control Workflow with Git/GitHub

Branching Strategy

Commit and Merge Workflow

Documentation

About

Topics

Resources

License

Stars

Watchers

Forks

Languages