Skip to main navigation Skip to search Skip to main content

Applications of concurrent sequential patterns in protein data mining

  • Cuiqing Wang
  • , Jing Lu
  • , Malcolm Keech

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    3 Citations (Scopus)

    Abstract

    Protein sequences of the same family typically share common patterns which imply their structural function and biological relationship. Traditional sequential patterns mining has its focus on mining frequently occurring sub-sequences. However, a number of applications motivate the search for more structured patterns, such as protein motif mining. This paper builds on the original idea of structural relation patterns and applies the Concurrent Sequential Patterns (ConSP) mining approach in bioinformatics. Specifically, a new method and algorithms are presented using support vectors as the data structure for the extraction of novel patterns in protein sequences. Experiments with real-world protein datasets highlight the applicability of the ConSP methodology in protein data mining. The results show the potential for knowledge discovery in the field of protein structure identification.
    Original languageEnglish
    Title of host publicationMachine Learning and Data Mining in Pattern Recognition
    PublisherSpringer
    ISBN (Electronic)9783319089782
    ISBN (Print)9783319089782
    DOIs
    Publication statusPublished - 1 Jan 2014
    EventMLDM: International Workshop on Machine Learning and Data Mining in Pattern Recognition - St Petersburg
    Duration: 21 Jul 201424 Jul 2014

    Conference

    ConferenceMLDM: International Workshop on Machine Learning and Data Mining in Pattern Recognition
    CitySt Petersburg
    Period21/07/1424/07/14
    OtherMLDM: International Workshop on Machine Learning and Data Mining in Pattern Recognition (21/07/2014-24/07/2014, St Petersburg)

    Keywords

    • protein

    Fingerprint

    Dive into the research topics of 'Applications of concurrent sequential patterns in protein data mining'. Together they form a unique fingerprint.

    Cite this