Skip to content

Processing attachments and corrections

Matt Post edited this page Dec 19, 2019 · 3 revisions

This page describes the process of handling attachments, corrections, and errata. This is a time-consuming task that is ideally done at least quarterly.

Attachments

  1. Create an ingestion folder in the shared Dropbox folder, e.g., Dropbox/Anthology/ingests/2019/2019-12-18-attachments.

  2. Download the Microsoft form data used to gather the attachments. Use Microsoft Excel or a compatible program to export it to a UTF-8-encoded CSV file. Save this file as attachments.csv

  3. Process the attachments using the script

    acl-anthology/bin/add_attachments.py attachments.csv
    

    This will go through each of the attachments, download them, do some minimal verification, and log everything to add_attachments.log. For successful attachments, it will edit the XML in the Anthology repo and put the file in a local mirror of the Anthology attachments, under ~/anthology-files/attachments/. For failed attachments, it will create a file add_attachments.log.$ANTH_ID.txt. This file contains an email you can manually send to the person (first line is email, second is subject, rest is body).

  4. Commit the repo changes after manually checking them and create a PR.

  5. Sync the locally mirrored files to the Anthology:

    cd ~/anthology-files
    rsync -azve ssh attachments/ aclweb:anthology-files/attachments/
    

    Where aclweb is an ssh alias to the Anthology host.

Corrections

[Coming soon]