Download PDF

Genome Biology

Publication date: 2025-04-04
Volume: 26
Publisher: BioMed Central Ltd.

Author:

Raimondi, Daniele
Verplaetse, Nora ; Passemiers, Antoine ; Jans, Deborah Sarah ; Cleynen, Isabelle ; Moreau, Yves

Keywords:

Science & Technology, Life Sciences & Biomedicine, Biotechnology & Applied Microbiology, Genetics & Heredity, HILBERT-SPACES REGRESSION, GENETIC ARCHITECTURE, ASSOCIATION, SELECTION, SUPPORT, PATHWAY, HUMANS, LOCUS, Phenotype, Genomics, Animals, Cattle, Models, Genetic, Polymorphism, Single Nucleotide, Multifactorial Inheritance, Humans, Epistasis, Genetic, Support Vector Machine, Machine Learning, Quantitative Trait, Heritable, STADIUS-25-106, 05 Environmental Sciences, 06 Biological Sciences, 08 Information and Computing Sciences, Bioinformatics

Abstract:

BACKGROUND: Genomic prediction encompasses the techniques used in agricultural technology to predict the genetic merit of individuals towards valuable phenotypic traits. It is related to Genome Interpretation in humans, which models the individual risk of developing disease traits. Genomic prediction is dominated by linear mixed models, such as the Genomic Best Linear Unbiased Prediction (GBLUP), which computes kinship matrices from SNP array data, while Genome Interpretation applications to clinical genetics rely mainly on Polygenic Risk Scores. RESULTS: In this article, we exploit the positive semidefinite characteristics of the kinship matrices that are conventionally used in GBLUP to propose a novel Genomic Multiple Kernel Learning method (GMKL), in which the multiple kinship matrices corresponding to Additive, Dominant, and Epistatic Inheritance Mechanisms are used as kernels in support vector machines, and we apply it to both worlds. We benchmark GMKL on simulated cattle phenotypes, showing that it outperforms the classical GBLUP predictors for genomic prediction. Moreover, we show that GMKL ranks the kinship kernels representing different inheritance mechanisms according to their compatibility with the observed data, allowing it to produce hypotheses on the normally unknown inheritance mechanisms generating the target phenotypes. We then apply GMKL to the prediction of two inflammatory bowel disease cohorts with more than 6500 samples in total, consistently obtaining results suggesting that epistasis might have a relevant, although underestimated role in inflammatory bowel disease (IBD). CONCLUSIONS: We show that GMKL performs similarly to GBLUP, but it can formulate biological hypotheses about inheritance mechanisms, such as suggesting that epistasis influences IBD.