Skip to main navigation Skip to search Skip to main content

Protein data modelling for concurrent sequential patterns

  • Jing Lu
  • , Malcolm Keech
  • , Cuiqing Wang

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    3 Citations (Scopus)

    Abstract

    Protein sequences from the same family typically share common patterns which imply their structural function and biological relationship. The challenge of identifying protein motifs is often addressed through mining frequent itemsets and sequential patterns, where post-processing is a useful technique. Earlier work has shown that Concurrent Sequential Patterns mining can be applied in bioinformatics, e.g. to detect frequently occurring concurrent protein sub-sequences. This paper presents a companion approach to data modelling and visualisation, applying it to real-world protein datasets from the PROSITE and NCBI databases. The results show the potential for graph-based modelling in representing the integration of higher level patterns common to all or nearly all of the protein sequences.
    Original languageEnglish
    Title of host publication2014 25th International Workshop on Database and Expert Systems Applications
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    ISBN (Electronic)9781479957224
    DOIs
    Publication statusPublished - 4 Dec 2014
    Event2014 25th International Workshop on Database and Expert Systems Applications - Munich
    Duration: 1 Sept 20145 Sept 2014

    Conference

    Conference2014 25th International Workshop on Database and Expert Systems Applications
    CityMunich
    Period1/09/145/09/14
    Other2014 25th International Workshop on Database and Expert Systems Applications (01/09/2014-05/09/2014, Munich)

    Keywords

    • protein

    Fingerprint

    Dive into the research topics of 'Protein data modelling for concurrent sequential patterns'. Together they form a unique fingerprint.

    Cite this