Skip to content

Latest commit

 

History

History
 
 

Big Data and Analytics

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 

Big Data and Analytics

This folder contains hands-on labs introducing some of the essential services that Azure provides for handling big-data workloads and visualizing the results. Learn how to spin up Apache Spark clusters in the cloud using Azure HDInsight, use Apache Hadoop to extract information from large datasets, use Microsoft Power BI to explore and visualize data, and more.

Lab Scenario Technology/Language Cost
Azure Data Lake Import data from disparate sources into an Azure Data Lake Store and use Azure Data Lake Analytics to perform federated queries with U-SQL. Azure Data Lake
Azure SQL Database
U-SQL
$$
Hadoop on Azure HDInsight Deploy an Hadoop cluster on Azure and use MapReduce to analyze a text file and Hive to analyze a log file. Apache Hadoop
Azure HDInsight
Hive
Python
$$$
Spark on Azure HDInsight Deploy a Spark cluster on Azure and use Jupyter notebooks to analyze food-inspection data from the city of Chicago, build a machine-learning model around it, and visualize the results. Apache Spark
Azure HDInsight
Jupyter
Python
$$$
Microsoft Power BI Use Microsoft Power BI to view sales data for a fictitious company and create reports and dashboards containing visualizations of that data. Microsoft Power BI $