Skip to content

nds-org/rdap2018

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

Labs Workbench tutorial for RDAP 2018

These tutorial materials are for the 2018 Research Data & Access Summit.

Chicago Hilton, March 21-23, 2018

Presentation NDS Labs Workbench: A Scalable Platform for Research Data Access, Education, and Training.

Workbench basics

This section will walk you through the basics of Labs Workbench including signing in, viewing the catalog, adding and starting applications.

Open a browser and go to https://www.workshop1.nationaldataservice.org

On the landing page, select one of the Sign In links. Enter the provided credentials. After logging in, you'll be taken to an empty dashboard. Select "Add an Application" or the Catalog link to view the available applications.

This catalog contains a number of applications. Clowder, Mongo, and RabbitMQ are all building blocks for the Clowder application. Jupyter, PostgresSQL Studio, RStudio Server, and Visualization tools are all example environments for accessing the DataDrivenAg data.

Using Jupyter

From the catalog page, select the "Add" link on the Jupyter card, then select "View" or "Applications" to go to your dashboard.

You will see a stopped instance of a Jupyter environment. Select the "Launch" button to start it.

This will start a private instance of Jupyter with some tutorial materials and sample data. Browse to "work / tutorials / height_prediction" and load the "Height_Prediction_Example.ipynb" notebook.

This is a sample notebook provided by a machine learning collaborator on the TERRA-REF project to demonstrate how to build predictive models for plant heights from sample data.

Feel free to explore the Jupyter environment. Browse back to "work / shared / terraref / sites / ua-mac" to view some of the data available to users. Beware -- some of the files and directories are quite large!

Exploring Clowder

Clowder is an active data management, intended to support collaboration and sharing while collecting metadata along the way. Clowder has an extensible "extractor" framework that can be used to extract metadata or process files automatically. We'll just touch the surface with this example.

From the Catalog, select "Add" on the Clowder card. Landing

On the dashboard select "Image Preview extractor" and "Add service". Landing

You'll be prompted to configure the service, but you can accept the defaults and save.

Landing

Select launch and wait for the services to start. This will take a minute (or 3). Once started, select the endpoint link.

Landing

This is now a private Clowder sandbox for you to explore. Select the "Sign Up" link and enter your email (you'll go through a validation step, so check your mail)

Landing

Click on the link in your email and enter your user information to Sign Up. Landing

Sign in and select "Create > Dataset" to create a sample dataset.

Landing

Enter some sample metadata.

Landing

Since we've enable the preview extractor, select "Add Files" to upload an image (feel free to try the one in this repo). Landing

View the dataset. Note the ability for users to add custom metadata at the dataset or file level. Landing

Select the file link and note the extraction events. The extractors leave a provenance trail of all actions taken.

Landing

About

Tutorial materials for RDAP2018

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published