Bi, Y., Walia, H., Obata, T., & Morota, G. (2024). Genomic prediction of metabolic content in rice grain in response to warmer night conditions. Crop Science, e21435. https://doi.org/10.1002/csc2.21435
It has been argued that metabolic content can be used as a selection marker to accelerate crop improvement because metabolic profiles in crops are often under genetic control. Evaluating the role of genetics in metabolic variation is a long-standing challenge. Rice, one of the world's most important staple crops, is known to be sensitive to recent increases in nighttime temperatures. Quantification of metabolic levels can help measure rice responses to high night temperature (HNT) stress. However, the extent of metabolic variation that can be explained by regression on whole-genome molecular markers remains to be evaluated. In the current study, we generated metabolic profiles for mature grains from a subset of rice diversity panel accessions grown under optimal and HNT conditions. Metabolite accumulation was low to moderately heritable, and genomic prediction accuracies of metabolite accumulation were within the expected upper limit set by their genomic heritability estimates. Genomic heritability estimates were slightly higher in the control group than in the HNT group. Genomic correlation estimates for the same metabolite accumulation between the control and HNT conditions indicated the presence of genotype-by-environment interactions. Reproducing kernel Hilbert spaces regression and image-based deep learning improved prediction accuracy, suggesting that some metabolite levels are under non-additive genetic control. Joint analysis of multiple metabolite accumulation simultaneously was effective in improving prediction accuracy by exploiting correlations among metabolites. The current study serves as an important first step in evaluating the cumulative effect of markers in influencing metabolic variation under control and HNT conditions.
- .Rmd file Including metabolite and genotype data cleaning
- .R file Using sommer package to calculate heritability for metabolites.
- .Rmd file Drawing heritability plots.
- .R file Running Single trait GBLUP in cluster.
- .Rmd file Drawing Single trait GBLUP plots.
- .Rmd file Selecting suitable bandwidth for RKHS.
- .R file Runing Single trait RKHS in cluster.
- .R file Running multi-trait genomic correlation.
- .Rmd file Drawing multi-trait genomic correlation plots.
- .Rmd file Factorial analysis to identify underlying latent factors controlling metabolites.
- .R file Running MegaLMM for genomic prediction.
- .R file Running MegaLMM for RKHS.
- .Rmd file Drawing barplot, density plots for MegaLMM genomic prediction model.
- .Rmd file Drawing genomic correlation density plot.
Figure 8: Percentage difference of gain in prediction accuracy for multi-trait genomic best linear unbiased prediction (MegaLMM-G) and multi-trait reproducing kernel Hilbert spaces regression (MegaLMM-GK) relative to single-trait genomic best linear unbiased prediction (A). Density plots of percentage difference are shown for MegaLMM-G (B) and MegaLMM-GK (C).
- .ipynb Shows examples about how to convert SNP tabular data into SNP images.
- .py file Loop converting for SNPs in all chromosomes.
- .py file Convolutional neural network with multiple branches.
- .Rmd file Drawing barplot to compare performance of all deep learning models and RKHS.