Skip to content

Latest commit

 

History

History
141 lines (124 loc) · 8.11 KB

software.md

File metadata and controls

141 lines (124 loc) · 8.11 KB
layout title
default
Software

Welcome to ESGF Software and Services

The Earth System Grid Federation (ESGF) is a collaboration that develops, deploys and maintains software infrastructure for the management, dissemination, and analysis of model output and observational data. Below are the different data access interfaces and software tools. You can install and configure all the tools or a subset depending on your needs.

Search data

ESGF Software Stack: for the Node Administrator

ESGF Data and Index/Identity Node

  • Description:
    • The ESGF Data Node software stack enables sites hosting earth system data to make it available to the community over several transfer protocols including http(s). Index nodes enable search for hosted data via data publishing to the index, and these nodes include a search API and web frontend. Identity nodes manage user accounts. All these services together consitute a "Full" ESGF installation. These nodes are installed using the popular Ansible automation platform using our esgf-ansible collection of playbooks.
  • Use case:
    • I want to install a data and/or index/IdP node software stack using the current architecture
    • I want to upgrade my existing node software stack to the latest supported service versions
  • New and returning installations:
  • Requirements, Setup and Usage documentation * https://esgf.github.io/esgf-ansible/intro/intro.html
  • Basic Prerequisite:
    • The ESGF software stack requires Linux RedHat Enterprise or Centos 7 distributions and administrators have full sudo privileges to root access
    • The services are meant to run on webserver-grade hardware. For data nodes, storage holding your data to share must be mounted on the node.
    • See the main documentation site for more information
  • Source repository on github
  • Issues: (bug reporting)
  • Installation email list:

ESGF Docker (beta)

Metagrid User Interface

Deprecated User Interface (CoG) Frontend

ESGF data publisher

ESG publisher (esg-publisher)

  • Description:
    • The ESG Publisher (esgcet python package) enables data publishers to push references of their data on their site’s data node to an ESGF Index for use in search and retrieval
  • Use Cases:
    • I want to publish a new dataset to ESGF
    • I want to update an existing dataset that I published on ESGF
    • I want to retract/delete a dataset that I published from ESGF
  • Main Page: (user documentation)
  • Prerequisite:
    • Publishers to ESGF must have an existing Data Node installed at their site.
  • Next generation publisher: (v5 Alpha version)
    • This version is compatible with the current and next-generation ESGF Archtectures
    • https://esg-publisher.readthedocs.io/
    • The Next-gen (v5) Publisher can be run external to the Data Node, but the data to be published must be locally accessible on your linux file system.
  • Issues:
  • Publication working team mailing list:

For Data Preparation, our collaborators at IPSL provide the Pre-publication Tools for a number of ongoing ESGF data projects.

Data Analysis tools

Community Data Analysis Tools (CDAT)

ESGF Compute end-user API (esgf-compute-api)

ESGF Compute Node Web Processing Service (ECN WPS)

Misc software and documentation

CMIP6 administrators and publishers

PCMDI has produced the Data Node Managers guide for CMIP6 data specifically here: https://pcmdi.llnl.gov/CMIP6/Guide/dataManagers.html

ESGF bulk data download tools

  • https://github.com/ESGF/sproket Sproket Download tool for command line data search and download, where you can specify search criteria and download data files in a single command.
  • http://prodiguer.github.io/synda/index.html From IPSL, Synda is an automated download service to manage massive replica copies of ESGF datasets. (suited for server administrators to operate)