-
Notifications
You must be signed in to change notification settings - Fork 287
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Switch to aclanthology.org #1278
Comments
I strongly endorse this. I think the team has made a few change to canonical URLs without much (any?) negative repercussions, so this makes sense. I'll be very happy if the team can make this as the canonical version because this was the original intent of registering this domain under ACL auspices over a decade ago. |
Here are some TODOs:
After merging:
Here is the rule we can use at https://aclweb.org/anthology for general redirects:
|
Wasn't sure whether to bring this up here or open a separate issue about this. Google has started showing results from aclanthology.org in it's search results. For papers on aclweb.org, it shows the last modified/updated date correctly. For papers from the new site, it shows something like 4 days ago (when it last crawled it I guess). I've attached screenshots below. Being able to look at the year is useful in getting to know when a paper is from. Anything that can be done to address this in the future? |
We deliver all metadata we can (though there is some more to come in #1407, which might fix the extracted text) . Maybe google stores the date it first found the site? |
Yes, I'm guessing this is what is happening. Though it might be that for new files, Google uses metadata from the file's timestamp itself. In that case we could manually set those dates to, say, the ingestion date. However, I'm not sure we'll have bandwidth to look into this. |
Sometime this year, I am hoping we can switch to aclanthology.org as our primary hosting site. There are technical reasons—ACL IT has been pushing for this, since we often come close to our hosting and bandwidth limits, and the Anthology is a key piece of that. But I also like it better aesthetically; aclanthology.org (versus aclweb.org/anthology) is more parsimonious (16 versus 20 characters); and using a top-level domain reflects its status.
The main question is whether we change the canonical URL as well. My thinking is that yes, we do, with permanent 301 redirects for papers existing at the time of the switch.
I welcome discussion.
The text was updated successfully, but these errors were encountered: