Skip to content

BOX Retrieval Service

Eric Lopatin edited this page Jul 25, 2023 · 2 revisions

The Merritt team has established a common workflow for facilitating the retrieval of files from BOX. Once obtained from BOX, files are staged for ingest into Merritt.

When is this workflow recommended?

Many Merritt depositors provide an accessible URL that allows the Merritt System to download content and ingest that content into the Merritt repository.

When a depositor has content that resides in BOX, supplying these URLs for individual files can be very labor intensive when they are obtained through the BOX user interface. Furthermore, the files must also reside in a folder that is made public. This workflow might be a good choice for users with numerous files in BOX (public or private) that they would like to ingest into Merritt.

Overview of the BOX Retrieval Deposit Process

The temporary storage utilized by this workflow creates incremental costs for the Merritt system. The steps within this workflow should be implemented expeditiously to reduce costs and to ensure that digital content has been properly preserved.

The retrieval service is suited for deposits of 1GB-500GB, as its connection to BOX appears to be throttled.

  • The Merritt Team configures its BOX retrieval service with a BOX user account that will be identified to the depositor via an email address.
  • The depositor adds this account as a Viewer of the shared BOX folder in which the files reside.
  • The Merritt Team runs the retrieval service and downloads all images to a local Merritt staging area.
  • Batch manifests are created whose entries include file URLs that point to the staging area (where the files from BOX were download to).
  • Manifests are submitted to Merritt for ingest.