Skip to main navigation Skip to search Skip to main content

Timeline and episode-structured clinical data: pre-processing for Data Mining and analytics

  • Jing Lu
  • , Alan Hales
  • , David Rew
  • , Malcolm Keech

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    7 Citations (Scopus)

    Abstract

    Data Mining has been used in the healthcare domain for diagnosis and treatment analysis, resource management and fraud detection. It brings a set of tools and techniques that can be applied to large-scale patient data to discover underlying patterns and provide healthcare professionals an additional source of knowledge for making decisions. The Southampton Breast Cancer Data System (SBCDS) containing some 16,000 timeline-structured records is a visually rich and highly intuitive system for the manual and automated transfer of demographic, pathology and treatment data into an episode-based structure. While expansion of the data mining capability in SBCDS is one of the objectives of our research, real-world patient data is generally incomplete, inconsistent and containing errors. This case study will focus on the data pre-processing stage in order to clean the raw data and prepare the final dataset for use in data mining and analytics. Some initial results are given for sequential patterns mining and classification which highlight the advantages of the approach.
    Original languageEnglish
    Title of host publicationnan
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    Pages64-67
    ISBN (Print)9781509021086
    DOIs
    Publication statusPublished - 23 Jun 2016
    EventIEEE 32nd International Conference on Data Engineering Workshops (ICDEW) - Helsinki
    Duration: 16 May 201620 May 2016

    Conference

    ConferenceIEEE 32nd International Conference on Data Engineering Workshops (ICDEW)
    CityHelsinki
    Period16/05/1620/05/16
    OtherIEEE 32nd International Conference on Data Engineering Workshops (ICDEW) (16/05/2016-20/05/2016, Helsinki)

    UN SDGs

    This output contributes to the following UN Sustainable Development Goals (SDGs)

    1. SDG 3 - Good Health and Well-being
      SDG 3 Good Health and Well-being

    Keywords

    • Health informatics
    • Pre-Processing
    • breast cancer data
    • data mining
    • electronic patient records

    Fingerprint

    Dive into the research topics of 'Timeline and episode-structured clinical data: pre-processing for Data Mining and analytics'. Together they form a unique fingerprint.

    Cite this