In this report I analyst National Parks Service about endangered species in different parks and their status.
- pandas
- numpy
- seaborn
- matplotlib.pyplot
- re
- Data manipulation:
- merge()
- For loop
- Group by
- lambda
- loc
- isna()
- value_counts()
- dropna()
- min()
- mean()
- sum()
- max()
- size()
- duplicated()
- list
- compile
- reset_index
- Data visualization:
- crosstab
- unstack
- autopct
- marker
- edgecolor
- bins
- linestyle
- label
- alpha
- plt.grid
- plt.axvline
- plt.ylim
- ascending
- pivot
- Bar chart
- plt.hist (histogram plot)
- pie chart
- crosstab
- Protected species
- Most protected species among their own and all of the species
- Endangered species
- Threatened species
- Species of concern
- Specie and park in every conservation status
- Clone this repository.
- Install the required libraries: pip install pandas numpy seaborn matplotlib re
- Run the main Python script: biodiversity.ipynb
- Explore the generated visualizations to gain insights into the data.
- Modify the code to experiment with different visualizations and analyses.
Feel free to submit issues or pull requests for improvements or additions.