-
Notifications
You must be signed in to change notification settings - Fork 28
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
1 parent
4815706
commit d83f203
Showing
1 changed file
with
41 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,41 @@ | ||
--- | ||
title: Using Databricks with R | ||
author: | ||
- name: Edgar Ruiz | ||
affiliations: | ||
- name: Posit PBC | ||
description: | | ||
Overview of the latests methods to connect, and interact with | ||
Databricks services. | ||
categories: [databricks, databases, Spark, dplyr] | ||
--- | ||
|
||
# Description | ||
|
||
As most organization's data migrate to the cloud, the ability to analyze data in-place becomes more important. This workshop will walk you through how to think about remote data, how to access it, and how to analyze it efficiently. We will review the latest in integrations between R and Databricks. The two integrations that we will review are: | ||
|
||
- [Spark via Databricks Connect](https://spark.rstudio.com/deployment/databricks-connect.html) | ||
|
||
- [ODBC connection](https://solutions.posit.co/connections/db/databases/databricks/) | ||
|
||
During the workshop, we will discuss best practices for when to use which integration, as well as techniques to take your analysis into production. | ||
|
||
# Audience | ||
|
||
This course is for you if you: | ||
|
||
- Are an R user | ||
|
||
- Plan, or are currently, using Databricks services | ||
|
||
- Need to learn how to access and analyze data in Databricks | ||
|
||
# Instructor(s) | ||
|
||
+----------------------------+--------+------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+ | ||
| ![](images/edgar-ruiz.jpg) | | Edgar co-authored a book called “Mastering Spark with R,” and is currently the maintainer of the `sparklyr` package. Edgar has also authored multiple articles, blog posts sharing analytics insights, and server infrastructure for data science. He has a background in deploying enterprise reporting, and business intelligence solutions. | | ||
| | | | | ||
| | | <https://www.linkedin.com/in/edgararuiz/> | | ||
+----------------------------+--------+------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+ | ||
|
||
: {tbl-colwidths="\[25,5,70\]"} |