These tutorial materials are for the 2018 Research Data & Access Summit.
Chicago Hilton, March 21-23, 2018
Presentation NDS Labs Workbench: A Scalable Platform for Research Data Access, Education, and Training.
This section will walk you through the basics of Labs Workbench including signing in, viewing the catalog, adding and starting applications.
Open a browser and go to https://www.workshop1.nationaldataservice.org
On the landing page, select one of the Sign In links. Enter the provided credentials. After logging in, you'll be taken to an empty dashboard. Select "Add an Application" or the Catalog link to view the available applications.
This catalog contains a number of applications. Clowder, Mongo, and RabbitMQ are all building blocks for the Clowder application. Jupyter, PostgresSQL Studio, RStudio Server, and Visualization tools are all example environments for accessing the DataDrivenAg data.
From the catalog page, select the "Add" link on the Jupyter card, then select "View" or "Applications" to go to your dashboard.
You will see a stopped instance of a Jupyter environment. Select the "Launch" button to start it.
This will start a private instance of Jupyter with some tutorial materials and sample data. Browse to "work / tutorials / height_prediction" and load the "Height_Prediction_Example.ipynb" notebook.
This is a sample notebook provided by a machine learning collaborator on the TERRA-REF project to demonstrate how to build predictive models for plant heights from sample data.
Feel free to explore the Jupyter environment. Browse back to "work / shared / terraref / sites / ua-mac" to view some of the data available to users. Beware -- some of the files and directories are quite large!
Clowder is an active data management, intended to support collaboration and sharing while collecting metadata along the way. Clowder has an extensible "extractor" framework that can be used to extract metadata or process files automatically. We'll just touch the surface with this example.
From the Catalog, select "Add" on the Clowder card.
On the dashboard select "Image Preview extractor" and "Add service".
You'll be prompted to configure the service, but you can accept the defaults and save.
Select launch and wait for the services to start. This will take a minute (or 3). Once started, select the endpoint link.
This is now a private Clowder sandbox for you to explore. Select the "Sign Up" link and enter your email (you'll go through a validation step, so check your mail)
Click on the link in your email and enter your user information to Sign Up.
Sign in and select "Create > Dataset" to create a sample dataset.
Enter some sample metadata.
Since we've enable the preview extractor, select "Add Files" to upload an image (feel free to try the one in this repo).
View the dataset. Note the ability for users to add custom metadata at the dataset or file level.
Select the file link and note the extraction events. The extractors leave a provenance trail of all actions taken.