A compendium of useful resources for statistics, data science and visualization.
Popular repositories in the data-science topic on GitHub
- Modern Dive (Ismay and Kim) A new online text focused at teaching, but with some nice examples.
- Broadening Your Statistical Horizons Generalized Linear Models and Multilevel Models
- Principles of Econometrics in R
- Library of Statistical Techniques
- Analysing Data using Linear Models
- Bayes Rules Book
- Statistical Rethinking Course Materials
- Statistical Rethinking Recoded
- Bayesian Methods for Hackers
- A programmers guide to Bayesian methods for data science using Python.
- Dartistics Awesome resource for web analytics with R!
- What They Forgot to Teach You About R
- R for Data Science (Grolemund and Wickham)
- R Markdown: The Definitive Guide
- Tidyverse Style Guide
- Tidyverse An opinionated collection of R packages designed for data science.
- Tidymodels
- Tinyverse An opinionated single package in R.
- Efficient R programming
- Handling Strings in R
- R Packages
- Mastering Shiny
- Mastering Spark with R
- Docker for R Users
- Engineering Production-Grade Shiny Apps
- Tidy Modeling in R
- Practical Python Programming Foundational aspects of Python programming with an emphasis on script writing, data manipulation, and program organization
- Python Data Science Handbook
- Technical Notes for Python/Pandas (Chris Albon)
- Think Stats Introduction to Probability and Statistics for Python Programmers
- Machine Learning Cheat Sheet
- Cookie Cutter Data Science
- Python-based framework for data science.
- Git for Humans
- Git for Data Science DATACAMP
- Version control. A must-have.
- Happy Git with R
- Git Flow
- Command Line Tricks
- Docker for R Users
- awk
- Terraform -- tool to manage cloud infrastructure
- Regex101 -- regular expressions