• Login
    View Item 
    •   Home
    • The Manchester Institute Cancer Research UK
    • All Paterson Institute for Cancer Research
    • View Item
    •   Home
    • The Manchester Institute Cancer Research UK
    • All Paterson Institute for Cancer Research
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Browse

    All of ChristieCommunitiesTitleAuthorsIssue DateSubmit DateSubjectsThis CollectionTitleAuthorsIssue DateSubmit DateSubjectsProfilesView

    My Account

    LoginRegister

    Local Links

    The Christie WebsiteChristie Library and Knowledge Service

    Statistics

    Display statistics

    Enhancing SNV identification in whole-genome sequencing data through the incorporation of known genetic variants into the minimap2 index

    • CSV
    • RefMan
    • EndNote
    • BibTex
    • RefWorks
    Thumbnail
    Name:
    s12859-024-05862-y.pdf
    Size:
    2.488Mb
    Format:
    PDF
    Description:
    Found with Open Access Button
    Download
    Authors
    Egor, G.
    Artem, K.
    Maksim, B.
    Gaukhar, Z.
    Ekaterina, K.
    Vsevolod, Makeev
    Evgeny, K.
    Affiliation
    Cancer Research UK National Biomarker Centre, University of Manchester, Manchester, Manchester, M20 4BX, UK.
    Issue Date
    2024
    
    Metadata
    Show full item record
    Abstract
    MotivationAlignment of reads to a reference genome sequence is one of the key steps in the analysis of human whole-genome sequencing data obtained through Next-generation sequencing (NGS) technologies. The quality of the subsequent steps of the analysis, such as the results of clinical interpretation of genetic variants or the results of a genome-wide association study, depends on the correct identification of the position of the read as a result of its alignment. The amount of human NGS whole-genome sequencing data is constantly growing. There are a number of human genome sequencing projects worldwide that have resulted in the creation of large-scale databases of genetic variants of sequenced human genomes. Such information about known genetic variants can be used to improve the quality of alignment at the read alignment stage when analysing sequencing data obtained for a new individual, for example, by creating a genomic graph. While existing methods for aligning reads to a linear reference genome have high alignment speed, methods for aligning reads to a genomic graph have greater accuracy in variable regions of the genome. The development of a read alignment method that takes into account known genetic variants in the linear reference sequence index allows combining the advantages of both sets of methods.ResultsIn this paper, we present the minimap2_index_modifier tool, which enables the construction of a modified index of a reference genome using known single nucleotide variants and insertions/deletions (indels) specific to a given human population. The use of the modified minimap2 index improves variant calling quality without modifying the bioinformatics pipeline and without significant additional computational overhead. Using the PrecisionFDA Truth Challenge V2 benchmark data (for HG002 short-read data aligned to the GRCh38 linear reference (GCA_000001405.15) with parameters k = 27 and w = 14) it was demonstrated that the number of false negative genetic variants decreased by more than 9500, and the number of false positives decreased by more than 7000 when modifying the index with genetic variants from the Human Pangenome Reference Consortium.
    Citation
    Egor G, Artem K, Maksim B, Gaukhar Z, Ekaterina K, Vsevolod M, et al. Enhancing SNV identification in whole-genome sequencing data through the incorporation of known genetic variants into the minimap2 index. BMC bioinformatics. 2024 JUL 13;25(1). PubMed PMID: WOS:001267092300001. English.
    Journal
    BMC Bioinformatics
    URI
    http://hdl.handle.net/10541/627135
    PubMed ID
    39003441
    Language
    en
    Collections
    All Paterson Institute for Cancer Research

    entitlement

    Related articles

    • Calling known variants and identifying new variants while rapidly aligning sequence data.
    • Authors: VanRaden PM, Bickhart DM, O'Connell JR
    • Issue date: 2019 Apr
    • Fast and SNP-aware short read alignment with SALT.
    • Authors: Quan W, Liu B, Wang Y
    • Issue date: 2021 Aug 25
    • Fast and memory efficient approach for mapping NGS reads to a reference genome.
    • Authors: Kumar S, Agarwal S, Ranvijay
    • Issue date: 2019 Apr
    • Fast read alignment with incorporation of known genomic variants.
    • Authors: Guo H, Liu B, Guan D, Fu Y, Wang Y
    • Issue date: 2019 Dec 19
    • Sensitive alignment using paralogous sequence variants improves long-read mapping and variant calling in segmental duplications.
    • Authors: Prodanov T, Bansal V
    • Issue date: 2020 Nov 4
    DSpace software (copyright © 2002 - 2025)  DuraSpace
    Quick Guide | Contact Us
    Open Repository is a service operated by 
    Atmire NV
     

    Export search results

    The export option will allow you to export the current search results of the entered query to a file. Different formats are available for download. To export the items, click on the button corresponding with the preferred download format.

    By default, clicking on the export buttons will result in a download of the allowed maximum amount of items.

    To select a subset of the search results, click "Selective Export" button and make a selection of the items you want to export. The amount of items that can be exported at once is similarly restricted as the full export.

    After making a selection, click one of the export format buttons. The amount of items that will be exported is indicated in the bubble next to export format.