Skip to content

francoisjehl/sparkworker

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

43 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

sparkworker: R Worker for Apache Spark

library(sparkworker)
library(sparklyr)

sc <- spark_connect(master = "local", version = "2.0.1")
iris_tbl <- copy_to(sc, iris)

spark_lapply(iris_tbl, function(rows) {
  rows$Petal_Width <- rows$Petal_Width + 1
  rows
})
## Source:     table<sparklyr_tmp_e8341338123e> [?? x 5]
## Database:   spark connection master=local[8] app=sparklyr local=TRUE
## 
## # ... with 5 variables: Sepal_Length <dbl>, Sepal_Width <dbl>,
## #   Petal_Length <dbl>, Petal_Width <dbl>, Species <chr>
spark_disconnect(sc)

About

sparkworker: R Worker for Apache Spark

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Scala 59.1%
  • R 40.9%