Loading...
Thumbnail Image
Item

Statistical framework for calling allelic imbalance in high-throughput sequencing data

Buyan, A.
Meshcheryakov, G.
Safronov, V.
Abramov, S.
Boytsov, A.
Nozdrin, V.
Baulin, E. F.
Kolmykov, S.
Vierstra, J.
Kolpakov, F.
... show 2 more
Citations
Google Scholar:
Altmetric:
Abstract
High-throughput sequencing facilitates large-scale studies of gene regulation and allows tracing the associations of individual genomic variants with changes in gene regulation and expression. Compared to classic association studies, the assessment of an allelic imbalance at heterozygous variants captures functional variant effects with smaller sample sizes, higher sensitivity, and better resolution. Yet, identification of allele-specific variants from allelic read counts remains challenging due to data-dependent biases and overdispersion arising from technical and biological variability. We present MIXALIME, a novel computational framework for calling allele-specific variants in diverse omics data with a repertoire of statistical models accounting for read mapping bias and copy number variation. We benchmark MIXALIME with DNase-Seq, ATAC-Seq, and CAGE-Seq data, and we demonstrate that the allelic imbalance highlights causal variants in GWAS results. Finally, as a showcase of the large-scale practical application of MIXALIME, we present an atlas of variants exhibiting allele-specific chromatin accessibility, built from thousands of available datasets obtained from diverse cell types.
Affiliation
Institute of Protein Research, Russian Academy of Sciences, Pushchino, Russia. Life Improvement by Future Technologies (LIFT) Center, Moscow, Russia. Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, Russia. Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow, Russia. Altius Institute for Biomedical Sciences, Seattle, WA, USA. Moscow Center for Advanced Studies, Moscow, Russia. International Institute of Molecular and Cell Biology in Warsaw, Warsaw, Poland. Department of Computational Biology, Sirius University of Science and Technology, Sirius, Krasnodar region, Russia. Bioinformatics Laboratory, Federal Research Center for Information and Computational Technologies, Novosibirsk, Russia. Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow, Russia. vsevolod.makeev@gmail.com. Moscow Center for Advanced Studies, Moscow, Russia. vsevolod.makeev@gmail.com. Institute of Biochemistry and Genetics, Ufa Federal Research Centre of the Russian Academy of Sciences, Ufa, Russia. vsevolod.makeev@gmail.com. Cancer Research UK National Biomarker Centre, University of Manchester, Manchester, UK. vsevolod.makeev@gmail.com. Institute of Protein Research, Russian Academy of Sciences, Pushchino, Russia. ivan.kulakovskiy@gmail.com. Life Improvement by Future Technologies (LIFT) Center, Moscow, Russia. ivan.kulakovskiy@gmail.com. Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow, Russia. ivan.kulakovskiy@gmail.com.
Description
Date
2025
Publisher
Keywords
Type
Article
Citation
Buyan A, Meshcheryakov G, Safronov V, Abramov S, Boytsov A, Nozdrin V, et al. Statistical framework for calling allelic imbalance in high-throughput sequencing data. Nature communications. 2025 Feb 18;16(1):1739. PubMed PMID: 39966391. Pubmed Central PMCID: PMC11836314. Epub 2025/02/19. eng.
Journal Title
Journal ISSN
Volume Title
Embedded videos