Labs Workbench tutorial for RDAP 2018

These tutorial materials are for the 2018 Research Data & Access Summit.

Chicago Hilton, March 21-23, 2018

Presentation NDS Labs Workbench: A Scalable Platform for Research Data Access, Education, and Training.

Workbench basics

This section will walk you through the basics of Labs Workbench including signing in, viewing the catalog, adding and starting applications.

Open a browser and go to https://www.workshop1.nationaldataservice.org

On the landing page, select one of the Sign In links. Enter the provided credentials. After logging in, you'll be taken to an empty dashboard. Select "Add an Application" or the Catalog link to view the available applications.

This catalog contains a number of applications. Clowder, Mongo, and RabbitMQ are all building blocks for the Clowder application. Jupyter, PostgresSQL Studio, RStudio Server, and Visualization tools are all example environments for accessing the DataDrivenAg data.

Using Jupyter

From the catalog page, select the "Add" link on the Jupyter card, then select "View" or "Applications" to go to your dashboard.

You will see a stopped instance of a Jupyter environment. Select the "Launch" button to start it.

This will start a private instance of Jupyter with some tutorial materials and sample data. Browse to "work / tutorials / height_prediction" and load the "Height_Prediction_Example.ipynb" notebook.

This is a sample notebook provided by a machine learning collaborator on the TERRA-REF project to demonstrate how to build predictive models for plant heights from sample data.

Feel free to explore the Jupyter environment. Browse back to "work / shared / terraref / sites / ua-mac" to view some of the data available to users. Beware -- some of the files and directories are quite large!

Exploring Clowder

Clowder is an active data management, intended to support collaboration and sharing while collecting metadata along the way. Clowder has an extensible "extractor" framework that can be used to extract metadata or process files automatically. We'll just touch the surface with this example.

From the Catalog, select "Add" on the Clowder card.

On the dashboard select "Image Preview extractor" and "Add service".

You'll be prompted to configure the service, but you can accept the defaults and save.

Select launch and wait for the services to start. This will take a minute (or 3). Once started, select the endpoint link.

This is now a private Clowder sandbox for you to explore. Select the "Sign Up" link and enter your email (you'll go through a validation step, so check your mail)