Arcadia Year 2 Work Plan

Links

Arcadia project general workplan

Liberation of data from scholarly publications

Specifically, we will:

Create a list of journals to be targeted (see possiblities)
- We will process daily the selected journals, with the goal to automate detection of new articles, import, processing, preparing QC reports, upload and dissemination
- We will process the back issues of the selected journals using among other these criteria:
  - The ranking of how often a journal is cited in bibliographic references and treatment citations will be used to select additional journals for conversion. This approach will help build a comprehensive, linked corpus of data.
  - Journals that contribute significantly to reach the 50% goal.
  - Journals that contribute to build communities for users of the data.
Develop web scraping capacity and implement it for a select number of journals
Improve online data quality control and correction tool
Set up ingestion of taxpub based articles
Extend template based extraction of 70K additional treatments including those from the most relevant taxonomy journals
Implement the import of JATS based articles (e.g. Order out of chaos, Flora der Schweiz)
Extract author affiliation from publication in GGI workflow

Infrastructure

All public-facing, production data will be migrated to Zenodo. This will include articles, their metadata, bibliographic references, the extracted images and treatments with links back to the source articles and other data or sources (eg GBIF). The existing treatments will be migrated to Zenodo as well. Specifically, we will:

Migrate existing TreatmenBank data to Zenodo
- resolve the issue with correct markup of treatmentCitations
Create a data import policy to Zenodo from TreatmentBank
Implement service in Zenodo/BLR to start automated processing of articles in TreatmentBank and update deposit accordingly
Adjust the extraction process so new output goes to Zenodo

Interfaces, Discovery tools and APIs

Since the extracted data and the APIs to access and use them will be new to the community-at-large, it will be helpful to have documentation, tutorials, and even applications that utilize these resources to demonstrate their potential. Specifically, we will:

Create sample applications to demonstrate discovery and analytical capabilities of the API
- applications in other languages such as R
- applications for mobile platforms
Make documentation main topic at Arcadia Spring Sprint
Activate the user side editing tool

Outreach

We will continue to actively publicize and promote BLR in the scientific community with the aim of it becoming the preferred stop for taxonomic discovery, education and research. Specifically, we will:

Enhance the website with rich information discovery tools
Brand and design the website
Refine and test the UI
Provide a daily summary of new data liberated on Twitter and Facebook
- instead of only providing the number of new treatments added daily, include a representative image from one of the treatments. see example
- consider tweeting more than just once a day
- broaden the audience of the existing Twitter account @plazi_treat
Develop a roster of training resources and workshops
- create screencasts
- create training plan
- conduct workshops colocated with other events
  - Feb, Warszawa, Poland
  - Mar, São Paulo, Brazil
  - Sep, Washington DC, USA
  - Oct, San Sebastian, España
    - send proposal to Force11
Attend at least three major conferences, two in Europe and one in the United States or elsewhere
- see list of upcoming meetings 2020
Publish a scholarly publication describing our work
- a journal article introducing and describing a taxonomic treatment

Provide feedback

Saved searches