The NRM Data Catalogue is a service that helps users find and access relevant metadata for data assets within the Natural Resource Ministries (NRM). Relevant metadata includes information about lineage, storage, transformation logic, security classification, and availability of the data. The NRM Data Catalogue also generates data insights, column-level data profiling, and data quality reports, and links data assets to associated data models in the NRM Data Model library. Users are able to search for data based on specific criteria such as Ministry, business area, data custodian, schema/table/column names, keywords, or publication date.
Visit the DEV NRM Data Catalogue here (VPN connection required): http://nr-data-catalogue-dev.apps.emerald.devops.gov.bc.ca/
Visit the TEST NRM Data Catalogue here (VPN connection required): http://nr-data-catalogue-test.apps.emerald.devops.gov.bc.ca/
The NRM Data Catalogue leverages an open-source data catalogue tool called OpenMetadata. OpenMetadata is deployed in the Emerald cluster of the BCGov OpenShift Container Platform. Deployment details below.
Prerequisites:
- OpenShift CLI
- Helm CLI
Add OpenMetadata to your local Helm repositories:
helm repo add open-metadata https://helm.open-metadata.org/
Navigate to the 'openshift' folder then:
oc apply -f .
oc create secret generic airflow-db-secrets --from-literal=airflow-db-password=airflow_pass
oc create secret generic db-secrets --from-literal=openmetadata-db-password=openmetadata_password
oc create secret generic airflow-secrets --from-literal=openmetadata-airflow-password=admin
Note: Do not use the actual values above! See Vault for secrets. If in a new environment, you will also need to create oidc-secrets and postgres-secrets.
Source: https://github.com/open-metadata/openmetadata-helm-charts/tree/main/charts/deps
Navigate to the 'deps' chart folder then:
helm install openmetadata-dependencies .
Source: https://github.com/open-metadata/openmetadata-helm-charts/tree/main/charts/openmetadata
Once all the dependencies are running, navigate to the 'openmetadata' chart folder then:
helm install openmetadata .
Note: Sometimes old PVC and volumes will break new pods.
OpenSearch and PostgreSQL require modified Dockerfiles to work within the OpenShift restricted security context. The Dockerfiles can be found under charts/. These images are built automatically and pushed to the GHCR any time there is a push or PR to the main branch.
Usage examples:
docker pull ghcr.io/bcgov/nr-openmetadata-opensearch:main
docker pull ghcr.io/bcgov/nr-openmetadata-postgresql:main
Prerequisites:
- OpenShift CLI
- Helm CLI
helm repo update open-metadata
helm pull open-metadata/openmetadata-dependencies
helm pull open-metadata/openmetadata
To review all Helm chart modifications (i.e. differences between the OpenMetadata default config and the OpenShift restricted security context config), search this repo for "DF-NOTE:" annotations. Each of these files has been customized to work in OpenShift Emerald environment:
Note: Make sure to include all 4 values.yaml files in new deployment, match the environment for SSO config and callback URLs, and match the size of the PVCs to the persistence.
Source: https://github.com/open-metadata/openmetadata-helm-charts/tree/main/charts/deps
Navigate to the 'deps' chart folder then:
helm upgrade openmetadata-dependencies .
Source: https://github.com/open-metadata/openmetadata-helm-charts/tree/main/charts/openmetadata
Once all the dependencies are running, navigate to the 'openmetadata' chart folder then:
helm upgrade openmetadata .