Name		Name	Last commit message	Last commit date
parent directory ..
portworx		portworx
prereq		prereq
roks		roks
vpc		vpc
.gitignore		.gitignore
README.md		README.md
VARIABLES.md		VARIABLES.md
cpd		cpd
cpd_roks_diagram.png		cpd_roks_diagram.png
existing-openshift.tfvars		existing-openshift.tfvars
main.tf		main.tf
outputs.tf		outputs.tf
terraform.tfvars.template		terraform.tfvars.template
vars.tf		vars.tf

README.md

Cloud Pak for Data 4.5 on Red Hat OpenShift on IBM Cloud

IBM Cloud Pak for Data is an end-to-end platform that helps organizations in their journey to AI. It enables data engineers, data stewards, data scientists, and business analysts to collaborate using an integrated multiple-cloud platform. Cloud Pak for Data uses IBM’s deep analytics portfolio to help organizations meet data and analytics challenges. The required building blocks (collect, organize, analyze, infuse) for information architecture are available using Cloud Pak for Data on IBM Cloud.

This deployment guide provides instructions for deploying Cloud Pak for Data on managed Red Hat OpenShift on IBM Cloud (formerly known as ROKS) using Terraform.

Costs and licenses
Deployment topology
Cloud Pak for Data services
Instructions
Troubleshooting
- View detailed install logs
- Common errors
Coming soon

Costs and licenses

These scripts create resources on IBM Cloud. For cost estimates, see the pricing pages for each IBM Cloud service that will be enabled. This deployment lets you use the OpenShift license bundled with your Cloud Pak entitlement. Portworx Enterprise is installed from the IBM Cloud catalog and a separate subscription from Portworx is not required.

You must have a Cloud Pak for Data entitlement API key to download images from the IBM entitled Cloud Pak registry. If you don't have a paid entitlement, you can create a 60 day trial subscription key. You can retrieve your entitlement key from the container software library.

Note: After 60 days, contact IBM Cloud Pak for Data sales.

You will also need an IBM Cloud Api Key. Follow the steps here if you don't have one - https://cloud.ibm.com/docs/account?topic=account-userapikey&interface=ui#create_user_key

Deployment topology

The deployment creates the following resources.

A Virtual Private Cloud (Gen 2) spanning one or three zones with a public gateway and private subnet in each zone.
A Red Hat OpenShift on IBM Cloud cluster.
One block storage volume attached to each worker node.
Portworx Enterprise running highly-available software-defined persistent storage.
A managed database service (Databases for Etcd) for Portworx cluster metadata to keep the metadata separate from application data (optional).
A Cloud Object Storage instance to back up the internal registry of your cluster.

Refer to Quotas and service limits and ensure that your IBM Cloud account has sufficient resource quotas available.

Refer to User access permissions to verify that your account has been assigned the necessary IBM Cloud IAM permissions to create the resources in the deployment.

Cloud Pak for Data services

As part of the deployment, any of the following services can be installed. For more information about available services, visit the Cloud Pak for Data services catalog.

Cloud Pak for Data Bedrock Services
Data Virtualization
Watson Knowledge Catalog
Watson Studio
Watson Machine Learning
Watson AI OpenScale
Cognos Dashboard Engine
Analytics Engine powered by Apache Spark
DataStage
Db2 Warehouse
Db2oltp
Decision Optimization
Cognos Analytics
SPSS Modeler
Master Data Management

Instructions

Building the Terraform environment container

It is recommended that these scripts be executed from a Docker container to ensure that the required tools and packages are available. Docker can be installed for your system using instructions found here. This deployment has been tested using Docker version 19.03.13.

To create mulitple clusters, clone the repo again in a new directory and create a new container (with a --name other than 'my-container'). Do not bind multiple containers to the same host template directory.

Clone this repo.
Navigate to the home directory - cd cp4d-deployment
Run docker build . -t cpd-roks-terraform.
Run docker run -d --name my-container --mount type=bind,source="$(pwd)",target=/root/templates cpd-roks-terraform.

The current directory on the host has beeen bind-mounted to ~/templates in the container. This allows file changes made in the host to be reflected in the container and vice versa.

Deploying Cloud Pak for Data

Copy the contents of terraform.tfvars.template to terraform.tfvars. This file can be used to define values for variables. Refer to VARIABLES.md and vars.tf for a list of available variables and their usage.
Log in to your container with docker exec -it my-container bash --login.
Navigate to the managed-ibmcloud directory - cd managed-openshift/ibmcloud/
Run terraform init.
Run terraform apply.

Securing your VPC and cluster

This deployment installs an OpenShift cluster in a VPC with permissive network access policies. To control traffic to your cluster, see Securing the cluster network.

Deploying in an existing VPC

These templates can be used to install Cloud Pak for Data in an existing VPC on your account by providing values for the following variables.

existing_vpc_id
existing_vpc_subnets — A list of subnet IDs in your VPC in which to install the cluster. Every subnet must belong to a different zone. Thus, in a non-multizone deployment only supply one subnet in a list. Public gateways must be enabled on all provided subnets.
multizone and no_of_zones are advised to be set carefully

When installing in an existing VPC, all other VPC configuration variables such as enable_public_gateway, allowed_cidr_range, acl_rules are ignored.

Deploying in an existing OpenShift cluster

These templates can also deploy Cloud Pak for Data on an existing VPC Gen 2 OpenShift on IBM Cloud cluster. In addition to the values in the "Deploying in an existing VPC" section, provide values for the following variables.

existing_roks_cluster — Name of the cluster to deploy in. It is assumed that Portworx has not already been installed on this cluster. All worker nodes will be used.

Replace / Upgrading worker nodes

In VPC Gen2 clusters, if you want to replace/upgrade the worker node which has block storage attached, this script can be used instead of doing it from IBM cloud console. https://github.com/mumutyal/px-utils/blob/master/px_vpc_upgrade/vpc_upgrade_util.sh

usage: ./vpc_upgrade.sh clustername replace/upgrade workerid

If we do replace/upgrade from IBM cloud console, block storage would get detached and portwox will consider that node as storageless. The data stored in the block storage would not be available to the application

IBM Cloud documentation for this issue - https://cloud.ibm.com/docs/openshift?topic=openshift-portworx#portworx_limitations

Portworx volume detachment after destroy

Cuurent implement doesn't support automated detachment of volumes. It should be deleted from IBM Cloud console after the deletion of the cluster.

Troubleshooting

Open an Issue in this repo that describes the error.

Common errors

Resource provisioning can fail due to random errors such as latency timeouts, tokens expiring prematurely or cloud resources failing to stabilize. To let Terraform retry the failed resource and continue with the remainder of the deployment, run terraform apply again. Alternatively, you can run terraform destroy and start from the beginning.

Errors with docker commands

Ensure that your system has the latest version of Docker installed.
Error: timeout while waiting for state to become 'Ready' for the resource module.roks.ibm_container_vpc_cluster.this

This error happens when it takes longer than usual for the cluster ingress domain to be created.
1. Open the IBM Cloud OpenShift clusters console and verify that the state of your cluster is "Normal". If there is an error, contact support or run terraform destroy and try again.
2. Run terraform untaint module.roks.ibm_container_vpc_cluster.this to mark the resource as successful.
3. Run terraform apply again. The deployment will continue onwards.
Unable to connect to the server: dial tcp: lookup c100-e.us-east.containers.cloud.ibm.com on 192.168.65.1:53: read udp 172.17.0.2:33170->192.168.65.1:53: i/o timeout for the resource module.portworx.ibm_resource_instance.portworx
1. Run portworx/scripts/portworx_wait_until_ready.sh. If the script prints an error, run terraform destroy and try the deployment again.
2. Else, run terraform untaint module.portworx.ibm_resource_instance.portworx to mark the resource as successful.
3. Run terraform apply. The deployment will continue onwards.

Coming soon

Support for ODF (previously OCS) Storage
Support for application access restrictions based on allowed_cidr_range
Support for additional Cloud Pak for Data services
Support for IBM Key Protect volume encryption at deploy time.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ibmcloud

ibmcloud

README.md

Cloud Pak for Data 4.5 on Red Hat OpenShift on IBM Cloud

Costs and licenses

Deployment topology

Cloud Pak for Data services

Instructions

Building the Terraform environment container

Deploying Cloud Pak for Data

Securing your VPC and cluster

Deploying in an existing VPC

Deploying in an existing OpenShift cluster

Replace / Upgrading worker nodes

Portworx volume detachment after destroy

Troubleshooting

Common errors

Errors with `docker` commands

`Error: timeout while waiting for state to become 'Ready'` for the resource `module.roks.ibm_container_vpc_cluster.this`

`Unable to connect to the server: dial tcp: lookup c100-e.us-east.containers.cloud.ibm.com on 192.168.65.1:53: read udp 172.17.0.2:33170->192.168.65.1:53: i/o timeout` for the resource `module.portworx.ibm_resource_instance.portworx`

Coming soon

Files

ibmcloud

Directory actions

More options

Directory actions

More options

Latest commit

History

ibmcloud

Folders and files

parent directory

README.md

Cloud Pak for Data 4.5 on Red Hat OpenShift on IBM Cloud

Costs and licenses

Deployment topology

Cloud Pak for Data services

Instructions

Building the Terraform environment container

Deploying Cloud Pak for Data

Securing your VPC and cluster

Deploying in an existing VPC

Deploying in an existing OpenShift cluster

Replace / Upgrading worker nodes

Portworx volume detachment after destroy

Troubleshooting

Common errors

Errors with docker commands

Error: timeout while waiting for state to become 'Ready' for the resource module.roks.ibm_container_vpc_cluster.this

Unable to connect to the server: dial tcp: lookup c100-e.us-east.containers.cloud.ibm.com on 192.168.65.1:53: read udp 172.17.0.2:33170->192.168.65.1:53: i/o timeout for the resource module.portworx.ibm_resource_instance.portworx

Coming soon

Errors with `docker` commands

`Error: timeout while waiting for state to become 'Ready'` for the resource `module.roks.ibm_container_vpc_cluster.this`

`Unable to connect to the server: dial tcp: lookup c100-e.us-east.containers.cloud.ibm.com on 192.168.65.1:53: read udp 172.17.0.2:33170->192.168.65.1:53: i/o timeout` for the resource `module.portworx.ibm_resource_instance.portworx`