Releases: datainsightat/BigDatDevEnv_Docker
Releases · datainsightat/BigDatDevEnv_Docker
Fix R Environment
Merge branch 'master' of https://github.com/datainsightat/bigdata_dev… …elopment_environment
Spark ETL Temple
You can upload data to Hadoop, use Spark to transform the data and save the result on a Postgres db. Spark runs locally inside the ide, or inside a determined cluster.
Initial Release
The cluster consists of:
- Hadoop/Hive docker
- Spark docker
- Postgres docker
You can interact with these services using:
- Jupyterlab
- Theia