Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

xml driver needs paging #428

Open
jacobthill opened this issue Aug 29, 2023 · 1 comment
Open

xml driver needs paging #428

jacobthill opened this issue Aug 29, 2023 · 1 comment
Assignees

Comments

@jacobthill
Copy link
Contributor

jacobthill commented Aug 29, 2023

The xml driver currently only harvests the first page of metadata. Shahre Farang is a collection with multiple pages e.g. https://shahrefarang.com/en/feed/?paged=2

Newcastle and Harvard SCW need xml paging

@jacobthill jacobthill added the bug Something isn't working label Aug 29, 2023
@edsu
Copy link
Contributor

edsu commented Aug 29, 2023

Since they are using Atom for linking, it would be great if they could use paging by adding a <link rel=next ..> like:

<atom:link href="https://shahrefarang.com/en/feed/?paged=2" rel="next" type="application/rss+xml" />

Then our XML driver could follow it's nose to the next URL if paging is on? Otherwise I guess we could add some URL pattern to the intake configuration.

Of course there would be work to do in the driver to follow the link, whichever way we choose to go.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants