A basic installation of the Anaconda distribution of Python is sufficient to run the notebook with Python 3.*. The only modules imported are: NumPy, Pandas and Seaborn.
This projects intends to analyze the subjective differences between Data Analysts, Data Scientists and Data Engineers stated on the 2020 Stack Overflow Developer Survey.
Analysis.ipynb
is the main notebook used for the analysisassets/
contains the files for the charts.charts.pptx
is the PowerPoint file where the plots were made and thecharts/
folder contain the exported.png
images.data/
is the folder containing all the files downloaded from the 2020 Stack Overflow Developer Survey, withsurvey_results_public.csv
containing the survey results of the public questions andsurvey_results_schema.csv
containing the correspondence of column name and question posed in the survey.
The main findings are presented in my Medium blog post.
Credit goes to Stack Overflow for making the survey data available. The Public 2020 Stack Overflow Developer Survey Results is made available under the Open Database License (ODbL). Any rights in individual contents of the database are licensed under the Database Contents License.