Skip to content

Commit

Permalink
Merge pull request #329 from ptsefton/use-cases-2024-07
Browse files Browse the repository at this point in the history
Use cases 2024 07
  • Loading branch information
elichad authored Jul 18, 2024
2 parents 7773eee + b1a400b commit daa558e
Show file tree
Hide file tree
Showing 11 changed files with 856 additions and 47 deletions.
2 changes: 1 addition & 1 deletion docs/_data/featured.yml
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
- page_id: life-monitor
- page_id: arkisto
- page_id: tlcmap
- page_id: ldaca
- page_id: autosubmit
- page_id: workflowhub
Expand Down
Binary file added docs/assets/img/ghap-screenshot.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/assets/img/ldaca-screenshot.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/assets/img/rrkive.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/assets/img/tlcmap.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
14 changes: 7 additions & 7 deletions docs/pages/use_cases/arkisto.md
Original file line number Diff line number Diff line change
@@ -1,20 +1,20 @@
---
page_id: arkisto
redirect_from:
- /in-use/arkisto
title: Arkisto
description: Arkisto uses RO-Crate for packaging data objects in the 3 uses cases described below, Modern PARADISEC, UTS Research Data Repository and UTS Cultural Datasets. As part of these use-cases they have been developing or enhancing their tooling to facilitate their use of RO-Crate
description: Arkisto was a project website that aimed to outline a principled approach to research data management.
url: https://arkisto-platform.github.io/
image: arkisto.png
domains: []
tasks: [data_handling]
roles: [data_steward, information_architect, repository_manager]
---

## Arkisto

The basis of Arkisto is that the long-term preservability of well-described data is always the first consideration.
Data on an Arkisto deployment is alway available on disc (or object storage) with a complete description independently of any services such as websites or APIs. Once the data is safe and well described, Arkisto has a flexible model for how data can be accessed using a variety of services.
Arkisto is built on top of Research Object Crate (RO-Crate) and the Oxford Common File System Layout (OCFL).
With Arkisto there is no messy data migration.
Arkisto was a project website that aimed to outline a principled approach to research data management. The site is no longer being maintained.

The work done on Arkisto has been developed into a set of protocols: Protocols for Implementing Long-term Archival Repositories Services (PILARS) <http://w3id.org/ldac/pilars>. See the [RRKive](rrkive) and [PILARS](pilars) pages for more information.

Arkisto was built on top of Research Object Crate (RO-Crate) and the Oxford Common File System Layout (OCFL).

<img src="assets/img/arkisto.png" alt="Arkisto logo" width="300px">
431 changes: 397 additions & 34 deletions docs/pages/use_cases/ldaca.md

Large diffs are not rendered by default.

253 changes: 248 additions & 5 deletions docs/pages/use_cases/paradisec.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,14 +2,257 @@
page_id: paradisec
redirect_from:
- /in-use/paradisec
title: PARADISEC
description: Modern PARADISEC demonstrates the use of RO-Crate to describe the collections and items
url: https://arkisto-platform.github.io/case-studies/paradisec/
title: "PARADISEC"
description: Pacific and Regional Archive for Digital Sources in Endangered Cultures (PARADISEC) holds 16,100 hours of audio recordings and 2,800 hours of video recordings that might otherwise have been lost. These recordings are of performance, narrative, singing, and other oral tradition. This amounts to over 220 terabytes, and represents 1,370 languages, mainly from the Pacific region.
url: https://www.paradisec.org.au/
domains: [language_text]
tasks: [data_handling]
roles: [information_architect, repository_manager]
---

# Modern PARADISEC
# PARADISEC: Pacific and Regional Archive for Digital Sources in Endangered Cultures

[Modern PARADISEC](https://arkisto-platform.github.io/case-studies/paradisec/) demonstrates the use of RO-Crate to describe the collections and items. The demonstrator includes an elastic search service and a webserver but the key feature is that it keeps working with only the filesystem and a webserver.


## PARADISEC and the Language Data Commons of Australia

From 2022 - 2028 PARADISEC is working with the national project the [Language Data Commons of Australia (LDACA)] to build accessible language material from Australia and the Pacific region. This work allows us to plan more digitisation projects and to add textual transcripts and summaries to audio items in the collection.

PARADISEC has an architecture which ensures that all data is stored in commodity storage with metadata adjacent to the files. This way of working has proved to be a low-cost and robust way to maintain an archive and this has been an inspiration for the [PILARS: Protocols for Implementing Long-term Archival Repository Services]. PARADISEC is currently undergoing an update where metadata is being migrated to RO-Crate which will be stored alongside materials.

PARADISEC uses the same API as the [Language Data Commons of Australia (LDACA)] which can serve RO-Crate.

[Language Data Commons of Australia (LDACA)]: ldaca
[PILARS: Protocols for Implementing Long-term Archival Repository Services]: pilars


## Example Crate

```
{
"@context": [
"https://w3id.org/ro/crate/1.2-DRAFT/context",
{
"@vocab": "http://schema.org/"
},
"http://purl.archive.org/language-data-commons/context.json",
{
"Geometry": "http://www.opengis.net/ont/geosparql#Geometry",
"asWKT": "http://www.opengis.net/ont/geosparql#asWKT"
},
"https://w3id.org/ldac/context"
],
"@graph": [
{
"@id": "https://catalog.paradisec.org.au/repository/RL1/011",
"@type": [
"Data",
"Object",
"RepositoryObject"
],
"conformsTo": {
"@id": "https://w3id.org/ldac/profile#Object"
},
"additionalType": "item",
"identifier": [
{
"@id": "#identifier_domain"
},
{
"@id": "#identifier_id"
},
{
"@id": "#identifier_itemIdentifier"
},
{
"@id": "#identifier_doi"
},
{
"@id": "#identifier_collectionIdentifier"
}
],
"name": "Singing and Enchanting, Conversations on Traditional tribal battles",
"description": "Singing and enchanting by women and general conversations with group of men mainly on traditional feuds and tribal battles by warring groups.\r\n(Steven Gagau, May 2018)",
"contentLocation": [
{
"@id": "#place-Kiriwina"
}
],
"dateCreated": "2018-05-09",
"dateModified": "2019-11-20",
"datePublished": "2019-11-20",
"publisher": {
"@id": "http://nla.gov.au/nla.party-811611"
},
"private": false,
"inLanguage": [
{
"@id": "#language-kij"
}
],
"subjectLanguages": [
{
"@id": "#language-kij"
}
],
"countries": [
{
"@id": "#country-Papua New Guinea"
}
],
"license": {
"@id": "#license-3"
},
"memberOf": {
"@id": "https://catalog.paradisec.org.au/repository/RL1"
},
"bornDigital": false,
"digitisedOn": "2018-04-27",
"external": false,
"languageAsGiven": "Kiriwina",
"metadataExportable": true,
"tapesReturned": false,
"hasPart": [
{
"@id": "https://catalog.paradisec.org.au/repository/RL1/011/RL1-011-A.mp3"
},
{
"@id": "https://catalog.paradisec.org.au/repository/RL1/011/RL1-011-A.wav"
}
],
"translator": [
{
"@id": "#person-4871"
}
]
},
{
"@id": "ro-crate-metadata.json",
"@type": "CreativeWork",
"conformsTo": {
"@id": "https://w3id.org/ro/crate/1.2-DRAFT"
},
"about": {
"@id": "https://catalog.paradisec.org.au/repository/RL1/011"
}
},
{
"@id": "http://nla.gov.au/nla.party-595489",
"@type": "Organisation",
"name": "Australian National University"
},
{
"@id": "#identifier_domain",
"@type": "PropertyValue",
"name": "domain",
"value": "paradisec.org.au"
},
{
"@id": "#identifier_id",
"@type": "PropertyValue",
"name": "id",
"value": "RL1-011"
},
{
"@id": "#identifier_itemIdentifier",
"@type": "PropertyValue",
"name": "itemIdentifier",
"value": "011"
},
{
"@id": "#identifier_doi",
"@type": "PropertyValue",
"name": "doi",
"value": "10.4225/72/5afda71574a1c"
},
{
"@id": "#identifier_collectionIdentifier",
"@type": "PropertyValue",
"name": "collectionIdentifier",
"value": "RL1"
},
{
"@id": "#country-Papua New Guinea",
"@type": "Country",
"code": "PG",
"name": "Papua New Guinea"
},
{
"@id": "#license-3",
"@type": "CreativeWork",
"name": "Closed (subject to the access condition details)"
},
{
"@id": "http://nla.gov.au/nla.party-811611",
"@type": "Person",
"familyName": "Lawton",
"givenName": "Ralph",
"name": "Ralph Lawton",
"email": "ralph.lawton@anu.edu.au"
},
{
"@id": "#person-4871",
"@type": "Person",
"familyName": "Hull",
"givenName": "Grace",
"name": "Grace Hull"
},
{
"@id": "https://catalog.paradisec.org.au/repository/RL1/011/RL1-011-A.mp3",
"@type": "File",
"contentSize": 17315968,
"dateCreated": "2018-05-11",
"dateModified": "2018-05-18",
"encodingFormat": "audio/mpeg",
"name": "RL1-011-A.mp3",
"doi": "10.4225/72/5afda751b05d1",
"bitrate": 127718,
"duration": 1084.63,
"channels": 2,
"sampleRate": 44100
},
{
"@id": "https://catalog.paradisec.org.au/repository/RL1/011/RL1-011-A.wav",
"@type": "File",
"contentSize": 625002484,
"dateCreated": "2018-05-11",
"dateModified": "2018-05-18",
"encodingFormat": "audio/vnd.wav",
"name": "RL1-011-A.wav",
"doi": "10.4225/72/5afda74dd11d2",
"bitrate": 4609917,
"duration": 1084.62,
"channels": 2,
"sampleRate": 96000
},
{
"@id": "#geo-151.0,-8.6-151.2,-8.4",
"@type": "Geometry",
"asWKT": "POLYGON((151.0 -8.4, 151.2 -8.4, 151.2 -8.6, 151.0 -8.6, 151.0 -8.4))"
},
{
"@id": "#geo-150.294,-8.879-151.2,-8.32",
"@type": "Geometry",
"asWKT": "POLYGON((150.294 -8.32, 151.2 -8.32, 151.2 -8.879, 150.294 -8.879, 150.294 -8.32))"
},
{
"@id": "#place-Kiriwina",
"@type": "Place",
"geo": {
"@id": "#geo-151.0,-8.6-151.2,-8.4"
},
"name": "Kiriwina"
},
{
"@id": "#language-kij",
"@type": "Language",
"code": "kij",
"location": {
"@id": "#geo-150.294,-8.879-151.2,-8.32"
},
"name": "Kilivila"
}
]
}
```
26 changes: 26 additions & 0 deletions docs/pages/use_cases/pilars.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,26 @@
---
page_id: pilars
redirect_from:
- /in-use/arkisto
title: "PILARS"
description: PILARS is a set of Protocols for Implementing Long-term Archival Repository Services
url: https://w3id.org/ldac/pilars/
image:
domains: []
tasks: [data_handling]
roles: [data_steward, information_architect, repository_manager]
---

# PILARS

The Protocols for Implementing Long-term Archival Repository Services (PILARS, <http://w3id.org/ldac/pilars>) are inspired by the continuing success of the technical approach taken over two decades by [PARADISEC](paradisec). PARADISEC houses cultural heritage material from more than 1360 languages with standard metadata, with data stored in commodity services (initially files on disk, now objects in a cloud storage service), and metadata adjacent to the data. PILARS was developed through work with the Language Data Commons of Australia to generalise the PARADISEC approach to other disciplines.

PILARS is aimed at IT practitioners, archivists, librarians, researchers and infrastructure managers involved in long-term data management. The protocols are intended to be complementary to the existing practices and principles of those disciplines.

## Rationale

In a research context it is important to be able to support the FAIR principles, ensuring that data is well described by metadata, is identified with persistent identifiers, and that shared services with good governance are in place to store interoperable data, to make it findable and provide appropriate access controls.

These protocols could form the basis for design, evaluation or procurement of archival repository-services, but also allow for data custodians to begin organising data in a format ready for archiving and digital preservation as long as they have access to some kind of commodity storage, by using a range of tools.

The PILARS Protocols call for the implementation of a linked-data metadata standard for describing data; RO-Crate represents one option for this.
23 changes: 23 additions & 0 deletions docs/pages/use_cases/rrkive.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,23 @@
---
page_id: rrkive
redirect_from:
- /in-use/arkisto
title: RRkive
description: RRKive is a website aimed to outline a principled approach to research data management with guidance on data storage and metadata
url: https://rrkive.org/
image: rrkive.png
domains: []
tasks: [data_handling]
roles: [data_steward, information_architect, repository_manager]
---

# RRKive


The RRKive site, managed by the Language Data Commons of Australia collects resources related to implementing Archival Repository ([RRKive](https://rkive.org/)) solutions based on a set of Protocols know as the PILARS: Principles for Implementing Long-term Archival Repository Services. https://w3id.org/ldac/pilars. RRkive is built on RO-Crate as a way to describe data objects for long term archiving.

RRKive is a replacement for [Arkisto](arkisto).



![RRKive logo is the word RRKive with a parrot sitting on it](assets/img/rrkive.png)
Loading

0 comments on commit daa558e

Please sign in to comment.