Skip to content

Arcadia Year 2 Work Plan

punkish edited this page Aug 22, 2022 · 27 revisions

Links

Liberation of data from scholarly publications

Specifically, we will:

  • Create a list of journals to be targeted (see possiblities)
    • We will process daily the selected journals, with the goal to automate detection of new articles, import, processing, preparing QC reports, upload and dissemination
    • We will process the back issues of the selected journals using among other these criteria:
      • The ranking of how often a journal is cited in bibliographic references and treatment citations will be used to select additional journals for conversion. This approach will help build a comprehensive, linked corpus of data.
      • Journals that contribute significantly to reach the 50% goal.
      • Journals that contribute to build communities for users of the data.
  • Develop web scraping capacity and implement it for a select number of journals
  • Improve online data quality control and correction tool
  • Set up ingestion of taxpub based articles
  • Extend template based extraction of 70K additional treatments including those from the most relevant taxonomy journals
  • Implement the import of JATS based articles (e.g. Order out of chaos, Flora der Schweiz)
  • Extract author affiliation from publication in GGI workflow

Infrastructure

All public-facing, production data will be migrated to Zenodo. This will include articles, their metadata, bibliographic references, the extracted images and treatments with links back to the source articles and other data or sources (eg GBIF). The existing treatments will be migrated to Zenodo as well. Specifically, we will:

Interfaces, Discovery tools and APIs

Since the extracted data and the APIs to access and use them will be new to the community-at-large, it will be helpful to have documentation, tutorials, and even applications that utilize these resources to demonstrate their potential. Specifically, we will:

  • Create sample applications to demonstrate discovery and analytical capabilities of the API
    • applications in other languages such as R
    • applications for mobile platforms
  • Make documentation main topic at Arcadia Spring Sprint
  • Activate the user side editing tool

Outreach

We will continue to actively publicize and promote BLR in the scientific community with the aim of it becoming the preferred stop for taxonomic discovery, education and research. Specifically, we will:

  • Enhance the website with rich information discovery tools

  • Brand and design the website

  • Refine and test the UI

  • Provide a daily summary of new data liberated on Twitter and Facebook

    • instead of only providing the number of new treatments added daily, include a representative image from one of the treatments. see example
    • consider tweeting more than just once a day
    • broaden the audience of the existing Twitter account @plazi_treat
  • Develop a roster of training resources and workshops

    • create screencasts
    • create training plan
    • conduct workshops colocated with other events
      • Feb, Warszawa, Poland
      • Mar, São Paulo, Brazil
      • Sep, Washington DC, USA
      • Oct, San Sebastian, España
  • Attend at least three major conferences, two in Europe and one in the United States or elsewhere

  • Publish a scholarly publication describing our work

    • a journal article introducing and describing a taxonomic treatment