Skip to content

ninadgawad/data-science

Repository files navigation

Data Science

Data Science is an inter-disciplinary field that uses scientific methods, processes, algorithms and systems to extract knowledge and insights from many structural and unstructured data. It continues evolve as one of the most promising and in-demand career paths for skilled professionals

The term was coined as recently as 2008 when companies realized the need for data professionals who are skilled in organizing and analyzing massive amounts of data. In a 2009 McKinsey&Company article, Hal Varian, Google’s chief economist and UC Berkeley professor of information sciences, business, and economics, predicted the importance of adapting to technology’s influence and reconfiguration of different industries

Key technical tools and skills, including:

  • Python
  • Apache Spark
  • NoSQL databases
  • Jupyter Notebooks
  • Apache Hadoop
  • S3
  • MIS Dashboards

Data Science Lifecycle

Data Science Lifecycle

Components of Data Science

  • Data Strategy
  • Data Engineering
  • Data Analysis and Models
  • Data Visualization and Operationalization

Reference Links:

https://www.edx.org/course/data-science-inference-and-modeling https://www.edx.org/course/data-science-inference-and-modeling https://jupyter.org/

About

Data Science Journey

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published