Download PDF

Genome Biology

Publication date: 2025-03-07
Volume: 26
Publisher: BioMed Central Ltd.

Author:

Passemiers, Antoine
Tuveri, Stefania ; Jatsenko, Tatjana ; Vanderstichele, Adriaan ; Busschaert, Pieter ; Coosemans, An ; Timmerman, Dirk ; Tejpar, Sabine ; Vandenberghe, Peter ; Lambrechts, Diether ; Raimondi, Daniele ; Vermeesch, Joris Robert ; Moreau, Yves

Keywords:

Science & Technology, Life Sciences & Biomedicine, Biotechnology & Applied Microbiology, Genetics & Heredity, COPY NUMBER, LYMPHOMA, SERUM, Humans, Cell-Free Nucleic Acids, Sequence Analysis, DNA, DNA Copy Number Variations, Neoplasms, High-Throughput Nucleotide Sequencing, Deep Learning, Bias, STADIUS-25-98, VOEWIAI02#58101618, 1SB2721N|1SB2723N#55499964, 1S74420N|1S74422N#55289583, G080217N#54021349, S003422N#56473652, 12Y5623N#57061784, C14/22/125#57006238, 05 Environmental Sciences, 06 Biological Sciences, 08 Information and Computing Sciences, Bioinformatics

Abstract:

Cell-free DNA (cfDNA) is a rich source of biomarkers for various pathophysiological conditions. Preanalytical variables, such as the library preparation protocol or sequencing platform, are major confounders of cfDNA analysis. We present DAGIP, a novel data correction method that builds on optimal transport theory and deep learning, which explicitly corrects for the effect of such preanalytical variables and can infer technical biases. Our method improves cancer detection and copy number alteration analysis by alleviating the sources of variation that are not of biological origin. It also enhances fragmentomic analysis of cfDNA. DAGIP allows the integration of cohorts from different studies.