ALBERT — All Library Books, journals and Electronic Records Telegrafenberg

Hits per page

hits 1 - 10 | 966 hits

Sorting

Online Resource

Multivariate Statistical Machine Learning Methods for Genomic Prediction (2022)

Montesinos López, Osval Antonio. ; Montesinos López, Abelardo. ; Crossa, José.

Cham :Springer International Publishing :

add to mindlist on the mindlist

Details

Keywords: Agriculture. ; Bioinformatics. ; Plant genetics. ; Agricultural genome mapping. ; Biometry. ; Agriculture. ; Bioinformatics. ; Plant Genetics. ; Agricultural Genetics. ; Biostatistics.

Description / Table of Contents: Preface -- Chapter 1 -- General elements of genomic selection and statistical learning -- Chapter. 2 -- Preprocessing tools for data preparation -- Chapter. 3 -- Elements for building supervised statistical machine learning models -- Chapter. 4 -- Overfitting, model tuning and evaluation of prediction performance -- Chapter. 5 -- Linear Mixed Models -- Chapter. 6 -- Bayesian Genomic Linear Regression -- Chapter. 7 -- Bayesian and classical prediction models for categorical and count data -- Chapter. 8 -- Reproducing Kernel Hilbert Spaces Regression and Classification Methods -- Chapter. 9 -- Support vector machines and support vector regression -- Chapter. 10 -- Fundamentals of artificial neural networks and deep learning -- Chapter. 11 -- Artificial neural networks and deep learning for genomic prediction of continuous outcomes -- Chapter. 12 -- Artificial neural networks and deep learning for genomic prediction of binary, ordinal and mixed outcomes -- Chapter. 13 -- Convolutional neural networks -- Chapter. 14 -- Functional regression -- Chapter. 15 -- Random forest for genomic prediction.

Abstract: This book is open access under a CC BY 4.0 license This open access book brings together the latest genome base prediction models currently being used by statisticians, breeders and data scientists. It provides an accessible way to understand the theory behind each statistical learning tool, the required pre-processing, the basics of model building, how to train statistical learning methods, the basic R scripts needed to implement each statistical learning tool, and the output of each tool. To do so, for each tool the book provides background theory, some elements of the R statistical software for its implementation, the conceptual underpinnings, and at least two illustrative examples with data from real-world genomic selection experiments. Lastly, worked-out examples help readers check their own comprehension. The book will greatly appeal to readers in plant (and animal) breeding, geneticists and statisticians, as it provides in a very accessible way the necessary theory, the appropriate R code, and illustrative examples for a complete understanding of each statistical learning tool. In addition, it weighs the advantages and disadvantages of each tool.

Type of Medium: Online Resource

Pages: XXIV, 691 p. 113 illus., 61 illus. in color. , online resource.

Edition: 1st ed. 2022.

ISBN: 9783030890100

URL: https://doi.org/10.1007/978-3-030-89010-0

DOI: 10.1007/978-3-030-89010-0

DDC: 630

Language: English

Permalink

	Location	Call Number	Expected	Availability

Others were also interested in ...

E-BOOKS (AWI ONLY)

Fulltext

Unknown

A Variational Bayes Genomic-Enabled Prediction Model with Genotype x Environment Interaction (2017)

Montesinos-Lopez, O. A., Montesinos-Lopez, A., Crossa, J., Montesinos-Lopez, J. C., Luna-Vazquez, F. J., Salinas-Ruiz, J., Herrera-Morales, J. R., Buenrostro-Mariscal, R.

Genetics Society of America (GSA)

In: G3: Genes, Genomes, Genetics

add to mindlist on the mindlist

Details

Publication Date: 2017-06-08

Description: There are Bayesian and non-Bayesian genomic models that take into account G x E interactions. However, the computational cost of implementing Bayesian models is high, and becomes almost impossible when the number of genotypes, environments, and traits is very large, while, in non-Bayesian models, there are often important and unsolved convergence problems. The variational Bayes method is popular in machine learning, and, by approximating the probability distributions through optimization, it tends to be faster than Markov Chain Monte Carlo methods. For this reason, in this paper, we propose a new genomic variational Bayes version of the Bayesian genomic model with G x E using half-t priors on each standard deviation (SD) term to guarantee highly noninformative and posterior inferences that are not sensitive to the choice of hyper-parameters. We show the complete theoretical derivation of the full conditional and the variational posterior distributions, and their implementations. We used eight experimental genomic maize and wheat data sets to illustrate the new proposed variational Bayes approximation, and compared its predictions and implementation time with a standard Bayesian genomic model with G x E. Results indicated that prediction accuracies are slightly higher in the standard Bayesian model with G x E than in its variational counterpart, but, in terms of computation time, the variational Bayes genomic model with G x E is, in general, 10 times faster than the conventional Bayesian genomic model with G x E. For this reason, the proposed model may be a useful tool for researchers who need to predict and select genotypes in several environments.

Electronic ISSN: 2160-1836

Topics: Biology

Published by Genetics Society of America (GSA)

Permalink

	Location	Call Number	Expected	Availability

Others were also interested in ...

PAPER CURRENT

S·F·X

Fulltext

Unknown

A Bayesian Poisson-lognormal Model for Count Data for Multiple-Trait Multiple-Environment Genomic-Enabled Prediction (2017)

Montesinos-Lopez, O. A., Montesinos-Lopez, A., Crossa, J., Toledo, F. H., Montesinos-Lopez, J. C., Singh, P., Juliana, P., Salinas-Ruiz, J.

Genetics Society of America (GSA)

In: G3: Genes, Genomes, Genetics

add to mindlist on the mindlist

Details

Publication Date: 2017-05-06

Description: When a plant scientist wishes to make genomic-enabled predictions of multiple traits measured in multiple individuals in multiple environments, the most common strategy for performing the analysis is to use a single trait at a time taking into account genotype x environment interaction (G x E), because there is a lack of comprehensive models that simultaneously take into account the correlated counting traits and G x E. For this reason, in this study we propose a multiple-trait and multiple-environment model for count data. The proposed model was developed under the Bayesian paradigm for which we developed a Markov Chain Monte Carlo (MCMC) with noninformative priors. This allows obtaining all required full conditional distributions of the parameters leading to an exact Gibbs sampler for the posterior distribution. Our model was tested with simulated data and a real data set. Results show that the proposed multi-trait, multi-environment model is an attractive alternative for modeling multiple count traits measured in multiple environments.

Electronic ISSN: 2160-1836

Topics: Biology

Published by Genetics Society of America (GSA)

Permalink

	Location	Call Number	Expected	Availability

Others were also interested in ...

PAPER CURRENT

S·F·X

Fulltext

Unknown

Prediction of Multiple-Trait and Multiple-Environment Genomic Data Using Recommender Systems (2018)

Montesinos-Lopez, O. A., Montesinos-Lopez, A., Crossa, J., Montesinos-Lopez, J. C., Mota-Sanchez, D., Estrada-Gonzalez, F., Gillberg, J., Singh, R., Mondal, S., Juliana, P.

Genetics Society of America (GSA)

In: G3: Genes, Genomes, Genetics

add to mindlist on the mindlist

Details

Publication Date: 2018-01-05

Description: In genomic-enabled prediction, the task of improving the accuracy of the prediction of lines in environments is difficult because the available information is generally sparse and usually has low correlations between traits. In current genomic selection, although researchers have a large amount of information and appropriate statistical models to process it, there is still limited computing efficiency to do so. Although some statistical models are usually mathematically elegant, many of them are also computationally inefficient, and they are impractical for many traits, lines, environments, and years because they need to sample from huge normal multivariate distributions. For these reasons, this study explores two recommender systems: item-based collaborative filtering (IBCF) and the matrix factorization algorithm (MF) in the context of multiple traits and multiple environments. The IBCF and MF methods were compared with two conventional methods on simulated and real data. Results of the simulated and real data sets show that the IBCF technique was slightly better in terms of prediction accuracy than the two conventional methods and the MF method when the correlation was moderately high. The IBCF technique is very attractive because it produces good predictions when there is high correlation between items (environment–trait combinations) and its implementation is computationally feasible, which can be useful for plant breeders who deal with very large data sets.

Electronic ISSN: 2160-1836

Topics: Biology

Published by Genetics Society of America (GSA)

Permalink

	Location	Call Number	Expected	Availability

Others were also interested in ...

PAPER CURRENT

S·F·X

Fulltext

Unknown

A guide for kernel generalized regression methods for genomic-enabled prediction (2021)

Montesinos-López, Abelardo ; Montesinos-López, Osval Antonio ; Montesinos-López, José Cricelio ; [et al.]

Springer Nature

In: Heredity. 2021; 126(4): 577-596. Published 2021 Mar 01. doi: 10.1038/s41437-021-00412-1.

add to mindlist on the mindlist

Details

Publication Date: 2021-03-01

Description: The primary objective of this paper is to provide a guide on implementing Bayesian generalized kernel regression methods for genomic prediction in the statistical software R. Such methods are quite efficient for capturing complex non-linear patterns that conventional linear regression models cannot. Furthermore, these methods are also powerful for leveraging environmental covariates, such as genotype × environment (G×E) prediction, among others. In this study we provide the building process of seven kernel methods: linear, polynomial, sigmoid, Gaussian, Exponential, Arc-cosine 1 and Arc-cosine L. Additionally, we highlight illustrative examples for implementing exact kernel methods for genomic prediction under a single-environment, a multi-environment and multi-trait framework, as well as for the implementation of sparse kernel methods under a multi-environment framework. These examples are followed by a discussion on the strengths and limitations of kernel methods and, subsequently by conclusions about the main contributions of this paper.

Print ISSN: 0018-067X

Electronic ISSN: 1365-2540

Topics: Biology

Published by Springer Nature

Permalink

	Location	Call Number	Expected	Availability

Others were also interested in ...

PAPER CURRENT

S·F·X

Fulltext

Unknown

Multivariate Bayesian Analysis of On‐Farm Trials with Multiple‐Trait and Multiple‐Environment Data (2019)

Montesinos‐López, Osval A. ; Montesinos‐López, Abelardo ; Hernández, Mateo Vargas ; [et al.]

Wiley

In: Agronomy Journal. 2019; 111(6): 2658-2669. Published 2019 Nov 01. doi: 10.2134/agronj2018.06.0362.

add to mindlist on the mindlist

Details

Publication Date: 2019-11-01

Print ISSN: 0002-1962

Electronic ISSN: 1435-0645

Topics: Agriculture, Forestry, Horticulture, Fishery, Domestic Science, Nutrition

Published by Wiley

Permalink

	Location	Call Number	Expected	Availability

Others were also interested in ...

PAPER CURRENT

S·F·X

Fulltext

Unknown

Modeling Genotype × Environment Interaction Using a Factor Analytic Model of On‐Farm Wheat Trials in the Yaqui Valley of Mexico (2019)

Hernández, Mateo Vargas ; Ortiz‐Monasterio, Iván ; Pérez‐Rodríguez, Paulino ; [et al.]

Wiley

In: Agronomy Journal. 2019; 111(6): 2647-2657. Published 2019 Nov 01. doi: 10.2134/agronj2018.06.0361.

add to mindlist on the mindlist

Details

Publication Date: 2019-11-01

Print ISSN: 0002-1962

Electronic ISSN: 1435-0645

Topics: Agriculture, Forestry, Horticulture, Fishery, Domestic Science, Nutrition

Published by Wiley

Permalink

	Location	Call Number	Expected	Availability

Others were also interested in ...

PAPER CURRENT

S·F·X

Fulltext

Unknown

A Genomic Bayesian Multi-trait and Multi-environment Model (2016)

Montesinos-Lopez, O. A., Montesinos-Lopez, A., Crossa, J., Toledo, F. H., Perez-Hernandez, O., Eskridge, K. M., Rutkoski, J.

Genetics Society of America (GSA)

In: G3: Genes, Genomes, Genetics

add to mindlist on the mindlist

Details

Publication Date: 2016-09-09

Description: When information on multiple genotypes evaluated in multiple environments is recorded, a multi-environment single trait model for assessing genotype x environment interaction (G x E) is usually employed. Comprehensive models that simultaneously take into account the correlated traits and trait x genotype x environment interaction (T x G x E) are lacking. In this research, we propose a Bayesian model for analyzing multiple traits and multiple environments for whole-genome prediction (WGP) model. For this model, we used Half- $$\mathit{t}$$ priors on each standard deviation term and uniform priors on each correlation of the covariance matrix. These priors were not informative and led to posterior inferences that were insensitive to the choice of hyper-parameters. We also developed a computationally efficient Markov Chain Monte Carlo (MCMC) under the above priors, which allowed us to obtain all required full conditional distributions of the parameters leading to an exact Gibbs sampling for the posterior distribution. We used two real data sets to implement and evaluate the proposed Bayesian method and found that when the correlation between traits was high (〉0.5), the proposed model (with unstructured variance–covariance) improved prediction accuracy compared to the model with diagonal and standard variance–covariance structures. The R-software package Bayesian Multi-Trait and Multi-Environment (BMTME) offers optimized C++ routines to efficiently perform the analyses.

Electronic ISSN: 2160-1836

Topics: Biology

Published by Genetics Society of America (GSA)

Permalink

	Location	Call Number	Expected	Availability

Others were also interested in ...

PAPER CURRENT

S·F·X

Fulltext

Unknown

Genomic-Enabled Prediction of Ordinal Data with Bayesian Logistic Ordinal Regression (2015)

Montesinos-Lopez, O. A., Montesinos-Lopez, A., Crossa, J., Burgueno, J., Eskridge, K.

Genetics Society of America (GSA)

In: G3: Genes, Genomes, Genetics

add to mindlist on the mindlist

Details

Publication Date: 2015-10-06

Description: Most genomic-enabled prediction models developed so far assume that the response variable is continuous and normally distributed. The exception is the probit model, developed for ordered categorical phenotypes. In statistical applications, because of the easy implementation of the Bayesian probit ordinal regression (BPOR) model, Bayesian logistic ordinal regression (BLOR) is implemented rarely in the context of genomic-enabled prediction [sample size ( n ) is much smaller than the number of parameters ( p )]. For this reason, in this paper we propose a BLOR model using the Pólya-Gamma data augmentation approach that produces a Gibbs sampler with similar full conditional distributions of the BPOR model and with the advantage that the BPOR model is a particular case of the BLOR model. We evaluated the proposed model by using simulation and two real data sets. Results indicate that our BLOR model is a good alternative for analyzing ordinal data in the context of genomic-enabled prediction with the probit or logit link.

Electronic ISSN: 2160-1836

Topics: Biology

Published by Genetics Society of America (GSA)

Permalink

	Location	Call Number	Expected	Availability

Others were also interested in ...

PAPER CURRENT

S·F·X

Fulltext

Unknown

Constitutive expression of transgenes encoding derivatives of the synthetic antimicrobial peptide BP100: impact on rice host plant fitness (2012)

Anna Nadal; Maria Montero; Nuri Company; Esther Badosa; Joaquima Messeguer; Laura Montesinos; Emilio Montesinos; Maria Pla

BioMed Central

In: BMC Plant Biology

add to mindlist on the mindlist

Details

Publication Date: 2012-09-05

Description: Background: The Biocontrol Peptide BP100 is a synthetic and strongly cationic a-helical undecapeptide with high, specific antibacterial activity against economically important plant-pathogenic bacteria, and very low toxicity. It was selected from a library of synthetic peptides, along with other peptides with activities against relevant bacterial and fungal species. Expression of the BP100 series of peptides in plants is of major interest to establish disease-resistant plants and facilitate molecular farming. Specific challenges were the small length, peptide degradation by plant proteases and toxicity to the host plant. Here we approached the expression of the BP100 peptide series in plants using BP100 as a proof-of-concept. Results: Our design considered up to three tandemly arranged BP100 units and peptide accumulation in the endoplasmic reticulum (ER), analyzing five BP100 derivatives. The ER retention sequence did not reduce the antimicrobial activity of chemically synthesized BP100 derivatives, making this strategy possible. Transformation with sequences encoding BP100 derivatives (bp100der) was over ten-fold less efficient than that of the hygromycin phosphotransferase (hptII) transgene. The BP100 direct tandems did not show higher antimicrobial activity than BP100, and genetically modified (GM) plants constitutively expressing them were not viable. In contrast, inverted repeats of BP100, whether or not elongated with a portion of a natural antimicrobial peptide (AMP), had higher antimicrobial activity, and fertile GM rice lines constitutively expressing bp100der were produced. These GM lines had increased resistance to the pathogens Dickeya chrysanthemi and Fusarium verticillioides, and tolerance to oxidative stress, with agronomic performance comparable to untransformed lines. Conclusions: Constitutive expression of transgenes encoding short cationic a-helical synthetic peptides can have a strong negative impact on rice fitness. However, GM plants expressing, for example, BP100 based on inverted repeats, have adequate agronomic performance and resistant phenotypes as a result of a complex equilibrium between bp100der toxicity to plant cells, antimicrobial activity and transgene-derived plant stress response. It is likely that these results can be extended to other peptides with similar characteristics.

Electronic ISSN: 1471-2229

Topics: Biology

Published by BioMed Central

Permalink

	Location	Call Number	Expected	Availability

Others were also interested in ...

PAPER CURRENT

S·F·X

Fulltext

hits 1 - 10 | 966 hits