You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Inspect diffs of generated YAML files between old and new libraries for any functional changes
Check papers/*.yaml for functional equivalence
Check volumes.yaml for functional equivalence
Check people/*.yaml for functional equivalence
Check venues.yaml for functional equivalence
Check events.yaml for functional equivalence
Sorting algorithm for associated volumes behaves slightly differently, but usually more correctly (Findings wasn't always moved above workshops before with joint events; main volumes of EMNLP 2018 were not moved to the top).
Check sigs.yaml for functional equivalence
Years in keys are not being quoted; url: null is omitted.
Performance optimizations
Generating bibliography strings was so far done with citeproc-py. Replacing this with a custom Python function speeds up the generation of bibliography strings from four minutes to a few seconds in my local testing.
YAML serialization (even with CDumper) is significantly slower than JSON serialization with msgspec in my testing (by a factor of at least 20); since Hugo also supports JSON for data files, we should probably switch the build pipeline to write JSON files instead.
The text was updated successfully, but these errors were encountered:
A while ago, we merged the new Python library into this repo, but the build pipeline still uses the legacy code.
Porting the build pipeline to the new library now happens on the https://github.com/acl-org/acl-anthology/tree/build-pipeline-with-new-library branch.
Roadmap
data/yaml/papers/*
files with new librarydata/yaml/volumes.yaml
with new librarydata/yaml/people/*
files with new librarydata/yaml/venues.yaml
with new librarydata/yaml/events.yaml
with new librarydata/yaml/sigs.yaml
with new librarypapers/*.yaml
for functional equivalencevolumes.yaml
for functional equivalencepeople/*.yaml
for functional equivalencevenues.yaml
for functional equivalenceevents.yaml
for functional equivalencesigs.yaml
for functional equivalenceurl: null
is omitted.Performance optimizations
The text was updated successfully, but these errors were encountered: