Skip to main navigation Skip to search Skip to main content

Deep-confidentiality: an IoT-enabled privacy-preserving framework for unstructured big biomedical data

  • COMSATS University Islamabad
  • Aberystwyth University
  • University of Milan
  • Incheon National University

Research output: Contribution to journalArticlepeer-review

15 Citations (Scopus)

Abstract

Due to the Internet of Things evolution, the clinical data is exponentially growing and using smart technologies. The generated big biomedical data is confidential, as it contains a patient's personal information and findings. Usually, big biomedical data is stored over the cloud, making it convenient to be accessed and shared. In this view, the data shared for research purposes helps to reveal useful and unexposed aspects. Unfortunately, sharing of such sensitive data also leads to certain privacy threats. Generally, the clinical data is available in textual format (e.g., perception reports). Under the domain of natural language processing, many research studies have been published to mitigate the privacy breaches in textual clinical data. However, there are still limitations and shortcomings in the current studies that are inevitable to be addressed. In this article, a novel framework for textual medical data privacy has been proposed as Deep-Confidentiality. The proposed framework improves Medical Entity Recognition (MER) using deep neural networks and sanitization compared to the current state-of-the-art techniques. Moreover, the new and generic utility metric is also proposed, which overcomes the shortcomings of the existing utility metric. It provides the true representation of sanitized documents as compared to the original documents. To check our proposed framework's effectiveness, it is evaluated on the i2b2-2010 NLP challenge dataset, which is considered one of the complex medical data for MER. The proposed framework improves the MER with 7.8% recall, 7% precision, and 3.8% F1-score compared to the existing deep learning models. It also improved the data utility of sanitized documents up to 13.79%, where the value of the k is 3.

Original languageEnglish
Article number42
JournalACM Transactions on Internet Technology
Volume22
Issue number2
DOIs
Publication statusPublished - 10 Nov 2021
Externally publishedYes

Keywords

  • big biomedical data
  • CNN
  • Deep neural network
  • LSTM
  • textual data privacy

ASJC Scopus subject areas

  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Deep-confidentiality: an IoT-enabled privacy-preserving framework for unstructured big biomedical data'. Together they form a unique fingerprint.

Cite this