archivesspace-duke-scripts

Various scripts to process ArchivesSpace EAD exports, interact with the ArchivesSpace API, or query backend ASpace SQL database for reports

accessions_acquired_FYreport_final.sql - SQL script that queries ASpace SQL database for information about accessions acquired in a given time period (typically a fiscal year).

accessions_without_acquisition_type.sql - SQL script that queries ASpace SQL database for list of accessions missing an acquisition type value. Used for periodic data QC.

accessions_without_extent.sql - SQL script that queries ASpace SQL database for list of accessions missing an extent statement of any kind. Used for periodic data QC.

accessions_without_extent_in_linear_feet.sql - SQL script that queries ASpace SQL database for list of accessions missing an extent statement in linear feet. Used for periodic data QC.

accessions_without_extent_number.sql - SQL script that queries ASpace SQL database for list of accessions missing an extent number. Used for periodic data QC.

extent_accessioned_by_acquisition_type.sql - SQL script that queries ASpace SQL database and generates a report showing extent (number and linear_feet) of accessions by acqusition type (gift, purchase, etc.).

extent_accessioned_by_research_center.sql - SQL script that queries ASpace SQL database and generates a report showing extent (number and linear_feet) of accessions by Rubenstein Library research center (Hartman, Bingham, Franklin, etc.)

EAD_ASpace_export_post-processor.xsl - Fixes various validation issues with Duke's ArchivesSpace EAD exports, most related to namespace conflicts carried over from Archivist Toolkit (ns2: vs. xlink:)

EADrenamer.py - Python script to batch rename EAD exports based on value of element.

EADrenamer_published_only.py - Python script to batch rename EAD exports based on value of element. The script will only process EADs where eadheader/@findaidstatus="published". Files are renamed and moved to the specified source directory

asEADexport_public.py - Python script to batch export all EADs from a specified repository where finding_aid_status=published and save those EADs to a specified location using the value as the filename. Can configure export parameters (include DAOs, unpublished, etc.). Usernames, passwords, and repository URLs have been removed. Adapted from: https://gist.github.com/helrond/1ef5b5bd47b47bd52f02

asEADexport_eadid_input.py - Python script to batch export EADs based on eadid input. Prompts for a list of eadid values separated with commas (e.g. eadid1,eadid2,eadid3). Will export EADs for all eadids provided if marked "published" in ASpace. Script relies on a config file (local_settings.cfg) that contains repository URL, username, password.

For example:

[ArchivesSpace]
baseURL:http://aspace-backend-url.lib.duke.edu
repository:1
user:admin
password:adminpassword

aspace_dig_guide_creator.xsl - XSLT that extracts metadata and refIDs from ASpace EAD exports and creates a digitization guide spreadsheet (TSV) suitable for use by Duke's Digital Production Center (DPC). Digital Object IDs and URIs can be added to the spreadsheet by DPC during digitization. This script should be modified based on the level of description in a given collection (file vs. item) and the metadata available for any given component in a collection (dates, subjects, etc.). The completed digitization guide can serve as the basis for batch loading digital object records in ArchivesSpace and linking those digital objects as instances to existing archival object (component) records using duke_update_archival_object.py

duke_update_archival_object.py - Python script that reads TSV file produced from aspace_dig_guide_creator.xsl script (including Digital Object IDs and URIs) and batch loads digital object records in ArchivesSpace and links them as instances to existing archival object component records based on the archival object's refID value. This script should be modified to specify input TSV file location and filename/location for output CSV file. It should also be modified if the column position of the Digital Object IDs or URIs changes in the input TSV file. This script is adapted from: https://github.com/djpillen/bentley_scripts/blob/master/update_archival_object.py

duke_update_archival_object_args.py - Python script that reads TSV file produces from aspace_dig_guide_creator.xsl script and batch loads digital object records in ArchivesSpace and links them as instances to existing archival object component records based on the archival object's refID value. In other words, it functions similarly to duke_update_archival_object.py. It differs in that it accepts arguments for the input TSV file and output CSV file, as well as arguments for the Aspace User ID and password. It also will read file version use statements from the input TSV. It still must be modified to map the correct column position of the Digital Object IDs, URIs, and File Version Use Statements. This script was adapated from duke_update_archival_object.py which was itself adapted from: https://github.com/djpillen/bentley_scripts/blob/master/update_archival_object.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

archivesspace-duke-scripts

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
EAD_ASpace_export_post-processor.xsl		EAD_ASpace_export_post-processor.xsl
EADrenamer.py		EADrenamer.py
EADrenamer_published_only.py		EADrenamer_published_only.py
README.md		README.md
accessions_acquired_FYreport_final.sql		accessions_acquired_FYreport_final.sql
accessions_without_acquisition_type.sql		accessions_without_acquisition_type.sql
accessions_without_extent.sql		accessions_without_extent.sql
accessions_without_extent_in_linear_feet.sql		accessions_without_extent_in_linear_feet.sql
accessions_without_extent_number.sql		accessions_without_extent_number.sql
asEADexport_eadid_input.py		asEADexport_eadid_input.py
asEADexport_public.py		asEADexport_public.py
aspace_dig_guide_creator-with-daos.xsl		aspace_dig_guide_creator-with-daos.xsl
aspace_dig_guide_creator.xsl		aspace_dig_guide_creator.xsl
duke_archival_object_metadata_adder.py		duke_archival_object_metadata_adder.py
duke_update_archival_object.py		duke_update_archival_object.py
duke_update_archival_object_args.py		duke_update_archival_object_args.py
duke_update_archival_object_args_multiplefileversions.py		duke_update_archival_object_args_multiplefileversions.py
duke_update_archival_object_args_pword.py		duke_update_archival_object_args_pword.py
extent_accessioned_by_acquisition_type.sql		extent_accessioned_by_acquisition_type.sql
extent_accessioned_by_research_center.sql		extent_accessioned_by_research_center.sql

NYULibraries/archivesspace-duke-scripts

Folders and files

Latest commit

History

Repository files navigation

archivesspace-duke-scripts

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages