Darwin Core biodiversity data pipelines

This repository contains data production pipelines for building Darwin Core datasets for publication in the Global Biodiversity Information Facility, with permanent archiving in Zenodo

EcoTaxa

Datasets

Notice: These are pre-production URLs, for testing purposes only

Workflow

Export EcoTaxa data as TSV (using DOI export with images)
Publish untreated TSV and images to Zenodo
Create Darwin Core occurrences in NDJSON from EcoTaxa TSV, using ecotaxa-darwin-core
Create unique Darwin Core sampling events in NDJSON by reducing the occurrences
@todo Merge with other/authoritative event metadata (eg. sampling volumes)
Create lists of ignored (not-living) and rejected (non-Eukaryota) objects
Create lists of rejected events (non-unique or invalid/non-consistent metadata)
Finish local processing by executing Darwin Core pipelines below

gbif-no-darwin-core$ ./bin/ecotaxa-pipeline 1420

Darwin Core pipelines

Taxonomy

Create taxonomy NDJSON by extracting occurrence taxa and checking against GBIF Species API using WoRMS
Create lists of possible taxonomy issues (not found or incertae sedis)

Metadata

Extract time coverage (start/end, years, months, days, dates)
Extract space coverage (bounding box/depths)
Extract sampling protocols
@todo Create EML XML

Dependencies

@todo

Project

This project was co-funded by GBIF Norway, see Data management plan for further details.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
bin		bin
config		config
.gitignore		.gitignore
LICENSE		LICENSE
dmp-zooplankton-gbif-2020.md		dmp-zooplankton-gbif-2020.md
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Darwin Core biodiversity data pipelines

EcoTaxa

Datasets

Workflow

Darwin Core pipelines

Taxonomy

Metadata

Archive

Dependencies

Project

About

Releases

Packages

Languages

License

akvaplan-niva/gbif-no-darwin-core

Folders and files

Latest commit

History

Repository files navigation

Darwin Core biodiversity data pipelines

EcoTaxa

Datasets

Workflow

Darwin Core pipelines

Taxonomy

Metadata

Archive

Dependencies

Project

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages