• Login
    View Item 
    •   Home
    • The Christie Research Publications Repository
    • All Christie Publications
    • View Item
    •   Home
    • The Christie Research Publications Repository
    • All Christie Publications
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Browse

    All of ChristieCommunitiesTitleAuthorsIssue DateSubmit DateSubjectsThis CollectionTitleAuthorsIssue DateSubmit DateSubjects

    My Account

    LoginRegister

    Local Links

    The Christie WebsiteChristie Library and Knowledge Service

    Statistics

    Display statistics

    EPICURE :Ensemble pretrained models for extracting cancer mutations from literature

    • CSV
    • RefMan
    • EndNote
    • BibTex
    • RefWorks
    Authors
    Cao, J. R.
    van Veen, E. M.
    Peek, N.
    Renehan, Andrew G
    Ananiadou, S.
    Affiliation
    National Centre for Text Mining Dept. of Computer Science The University of Manchester Manchester,
    Issue Date
    2021
    
    Metadata
    Show full item record
    Abstract
    Abstract—To interpret the genetic profile present in a patient sample, it is necessary to know which mutations have important roles in the development of the corresponding cancer type. Named entity recognition (NER) is a core step in the text mining pipeline which facilitates mining valuable cancer information from the scientific literature. However, due to the scarcity of related datasets, previous NER attempts in this domain either suffer from low performance when deep learning based models are deployed, or they apply feature-based machine learning models or rule based models to tackle this problem, which requires intensive efforts from domain experts, and limit the model generalization capability. In this paper, we propose EPICURE, an ensemble pre trained model equipped with a conditional random field pattern (CRF) layer and a span prediction pattern (Span) layer to extract cancer mutations from text. We also adopt a data augmentation strategy to expand our training set from multiple datasets. Ex perimental results on three benchmark datasets show competitive results compared to the baseline models, validating our model’s effectiveness and advances in generalization capability.
    Citation
    Cao JR, van Veen EM, Peek N, Renehan AG, Ananiadou S. EPICURE :Ensemble Pretrained Models for Extracting Cancer Mutations from Literature. 2021 Ieee 34th International Symposium on Computer-Based Medical Systems (Cbms). 2021:461-7. PubMed PMID: WOS:000847341000079.
    Journal
    2021 Ieee 34th International Symposium on Computer-Based Medical Systems
    URI
    http://hdl.handle.net/10541/625670
    Type
    Article
    Language
    en
    Collections
    All Christie Publications

    entitlement

     
    DSpace software (copyright © 2002 - 2023)  DuraSpace
    Quick Guide | Contact Us
    Open Repository is a service operated by 
    Atmire NV
     

    Export search results

    The export option will allow you to export the current search results of the entered query to a file. Different formats are available for download. To export the items, click on the button corresponding with the preferred download format.

    By default, clicking on the export buttons will result in a download of the allowed maximum amount of items.

    To select a subset of the search results, click "Selective Export" button and make a selection of the items you want to export. The amount of items that can be exported at once is similarly restricted as the full export.

    After making a selection, click one of the export format buttons. The amount of items that will be exported is indicated in the bubble next to export format.