Slicing-based enhanced method for privacy-preserving in publishing big data
Abstract
Publishing big data and making it accessible to researchers is important for knowledge building as it helps in applying highly efficient methods to plan, conduct, and assess scientific research. However, publishing and processing big data poses a privacy concern related to protecting individuals’ sensitive information while maintaining the usability of the published data. Several anonymization methods, such as slicing and merging, have been designed as solutions to the privacy concerns for publishing big data. However, the major drawback of merging and slicing is the random permutation procedure, which does not always guarantee complete protection against attribute or membership disclosure. Moreover, merging procedures may generate many fake tuples, leading to a loss of data utility and subsequent erroneous knowledge extraction. This study therefore proposes a slicing-based enhanced method for privacy-preserving big data publishing while maintaining the data utility. In particular, the proposed method distributes the data into horizontal and vertical partitions. The lower and upper protection levels are then used to identify the unique and identical attributes’ values. The unique and identical attributes are swapped to ensure the published big data is protected from disclosure risks. The outcome of the experiments demonstrates that the proposed method could maintain data utility and provide stronger privacy preservation.Citation
BinJubier, M., Ismail, M.A., Ahmed, A.A. and Sadiq, A.S. (2022) Slicing-based enhanced method for privacy-preserving in publishing big data. Computers, Materials and Continua, 72(2), pp. 3665-3686Publisher
Tech Science PressJournal
Computers, Materials and ContinuaAdditional Links
https://www.techscience.com/cmc/v72n2/47175Type
Journal articleLanguage
enDescription
© 2022 The Authors. Published by Tech Science Press. This is an open access article available under a Creative Commons licence. The published version can be accessed at the following link on the publisher’s website: https://www.techscience.com/cmc/v72n2/47175ISSN
1546-2218Sponsors
This work was supported by Postgraduate Research Grants Scheme (PGRS) with grant No. PGRS190360.ae974a485f413a2113503eed53cd6c53
10.32604/cmc.2022.024663
Scopus Count
Collections
Except where otherwise noted, this item's license is described as https://creativecommons.org/licenses/by/4.0/