• Login
    View Item 
    •   Home
    • The Christie Research Publications Repository
    • All Christie Publications
    • View Item
    •   Home
    • The Christie Research Publications Repository
    • All Christie Publications
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Browse

    All of ChristieCommunitiesTitleAuthorsIssue DateSubmit DateSubjectsThis CollectionTitleAuthorsIssue DateSubmit DateSubjectsProfilesView

    My Account

    LoginRegister

    Local Links

    The Christie WebsiteChristie Library and Knowledge Service

    Statistics

    Display statistics

    Establishing a colorectal cancer research database from routinely collected health data: the process and potential from a pilot study

    • CSV
    • RefMan
    • EndNote
    • BibTex
    • RefWorks
    Authors
    Tamm, A.
    Jones, H. J.
    Perry, W.
    Campbell, D.
    Carten, R.
    Davies, J.
    Galdikas, A.
    English, L.
    Garbett, Alexander
    Glampson, B.
    Harris, S.
    Khan, K.
    Little, S.
    Malcomson, Lee
    Matharu, S.
    Mayer, E.
    Mercuri, L.
    Morris, E. J.
    Muirhead, R.
    Norris, R.
    O'Hara, Catherine
    Papadimitriou, D.
    Peek, N.
    Renehan, Andrew G
    Roadknight, G.
    Starling, N.
    Teare, M.
    Turner, R.
    Várnai, K. A.
    Wasan, H.
    Woods, K.
    Cunningham, C.
    Show allShow less
    Affiliation
    NIHR Oxford Biomedical Research Centre, Oxford, UK
    Issue Date
    2022
    
    Metadata
    Show full item record
    Abstract
    Objective: Colorectal cancer is a common cause of death and morbidity. A significant amount of data are routinely collected during patient treatment, but they are not generally available for research. The National Institute for Health Research Health Informatics Collaborative in the UK is developing infrastructure to enable routinely collected data to be used for collaborative, cross-centre research. This paper presents an overview of the process for collating colorectal cancer data and explores the potential of using this data source. Methods: Clinical data were collected from three pilot Trusts, standardised and collated. Not all data were collected in a readily extractable format for research. Natural language processing (NLP) was used to extract relevant information from pseudonymised imaging and histopathology reports. Combining data from many sources allowed reconstruction of longitudinal histories for each patient that could be presented graphically. Results: Three pilot Trusts submitted data, covering 12 903 patients with a diagnosis of colorectal cancer since 2012, with NLP implemented for 4150 patients. Timelines showing individual patient longitudinal history can be grouped into common treatment patterns, visually presenting clusters and outliers for analysis. Difficulties and gaps in data sources have been identified and addressed. Discussion: Algorithms for analysing routinely collected data from a wide range of sites and sources have been developed and refined to provide a rich data set that will be used to better understand the natural history, treatment variation and optimal management of colorectal cancer. Conclusion: The data set has great potential to facilitate research into colorectal cancer.
    Journal
    BMJ Health Care Inform
    URI
    http://hdl.handle.net/10541/625389
    DOI
    10.1136/bmjhci-2021-100535
    PubMed ID
    35738723
    Additional Links
    https://dx.doi.org/10.1136/bmjhci-2021-100535
    Type
    Article
    Language
    en
    ae974a485f413a2113503eed53cd6c53
    10.1136/bmjhci-2021-100535
    Scopus Count
    Collections
    All Christie Publications

    entitlement

    Related articles

    • A method for cohort selection of cardiovascular disease records from an electronic health record system.
    • Authors: Abrahão MTF, Nobre MRC, Gutierrez MA
    • Issue date: 2017 Jun
    • Designing an openEHR-Based Pipeline for Extracting and Standardizing Unstructured Clinical Data Using Natural Language Processing.
    • Authors: Wulff A, Mast M, Hassler M, Montag S, Marschollek M, Jack T
    • Issue date: 2020 Dec
    • Extracting data from electronic medical records: validation of a natural language processing program to assess prostate biopsy results.
    • Authors: Thomas AA, Zheng C, Jung H, Chang A, Kim B, Gelfond J, Slezak J, Porter K, Jacobsen SJ, Chien GW
    • Issue date: 2014 Feb
    • Assessing Information Congruence of Documented Cardiovascular Disease between Electronic Dental and Medical Records.
    • Authors: Patel J, Mowery D, Krishnan A, Thyvalikakath T
    • Issue date: 2018
    • A frame semantic overview of NLP-based information extraction for cancer-related EHR notes.
    • Authors: Datta S, Bernstam EV, Roberts K
    • Issue date: 2019 Dec
    DSpace software (copyright © 2002 - 2025)  DuraSpace
    Quick Guide | Contact Us
    Open Repository is a service operated by 
    Atmire NV
     

    Export search results

    The export option will allow you to export the current search results of the entered query to a file. Different formats are available for download. To export the items, click on the button corresponding with the preferred download format.

    By default, clicking on the export buttons will result in a download of the allowed maximum amount of items.

    To select a subset of the search results, click "Selective Export" button and make a selection of the items you want to export. The amount of items that can be exported at once is similarly restricted as the full export.

    After making a selection, click one of the export format buttons. The amount of items that will be exported is indicated in the bubble next to export format.