-
Notifications
You must be signed in to change notification settings - Fork 0
/
README.Rmd
62 lines (43 loc) · 2.19 KB
/
README.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
---
output: github_document
bibliography: man/bibliography.bib
---
<!-- README.md is generated from README.Rmd. Please edit that file -->
```{r, include = FALSE}
knitr::opts_chunk$set(
collapse = TRUE,
comment = "#>",
fig.path = "man/figures/README-",
out.width = "100%"
)
```
# muddy_db <img src="man/figures/README-hex.png" align="right" alt="" width="120" />
<!-- badges: start -->
[![Lifecycle: experimental](https://img.shields.io/badge/lifecycle-experimental-orange.svg)](https://www.tidyverse.org/lifecycle/#experimental)
![R-CMD-check](https://github.com/TracyRage/muddy_db/workflows/R-CMD-check/badge.svg)
[![DOI](https://zenodo.org/badge/330902563.svg)](https://zenodo.org/badge/latestdoi/330902563)
<!-- badges: end -->
### Description
**muddy_db** represents a biologically-oriented mud volcano database.
It aggregates mud volcano specific terminology and taxonomy, which were mined
from open-access articles, available in the [S2ORC](https://github.com/allenai/s2orc/)
database, [CC BY-NC 2.0, unmodified](https://creativecommons.org/licenses/by-nc/2.0/) by [@lo-wang-2020-s2orc]. We used [ScispaCy](https://github.com/allenai/scispacy)
[@neumann_2019_scispacy] models and [ETE3](https://github.com/etetoolkit/ete) [@cepas_2016]
library to check taxonomy-flavored tokens against [NCBI Taxonomy](ftp://ftp.ncbi.nlm.nih.gov/pub/taxonomy/) database.
Check muddy_db web app: [muddy_db](https://muddy-db.shinyapps.io/muddy_db/)
Check our mining pipeline: [muddy_mine](https://github.com/TracyRage/muddy_mine)
Check the published [PeerJ article](https://doi.org/10.7717/peerj.12463).
Cite this work `Remizovschi A, Carpa R. 2021. Biologically-oriented mud volcano database: muddy_db. PeerJ 9:e12463 https://doi.org/10.7717/peerj.12463`.
### Data
**muddy_db** includes the following data:
* Bacterial and archaeal taxonomy (phylum, class, order, family, genus)
* Chemistry (inorganic ions, hydrocarbons)
* Geology (geological periods, minerals)
* Mud volcano specific data (microbial consortia, metabolic pathways etc.)
* Methods (amplified genes, analytics)
### Install locally
1. Install package
`remotes::install_github('TracyRage/muddy_db')`
2. Run **muddy_db**
`muddy::run_app()`
### References