Adversarial representation learning for synthetic replacement of private attributes
Authors: John Martinsson, Edvin Listo Zec, Daniel Gillblad, Olof Mogren
Published in: Proceedings - 2021 IEEE International Conference on Big Data, Big Data 2021
Year: 2021
Location:
Abstract
Data privacy is an increasingly important aspect of many real-world analytics tasks. Data sources that contain sensitive information may have immense potential which could be unlocked using the right privacy enhancing transformations, but current methods often fail to produce convincing output. Furthermore, finding the right balance between privacy and utility is often a tricky trade-off. In this work, we propose a novel approach for data privatization, which involves two steps: in the first step, it removes the sensitive information, and in the second step, it replaces this information with an independent random sample. Our method builds on adversarial representation learning which ensures strong privacy by training the model to fool an increasingly strong adversary. While previous methods only aim at obfuscating the sensitive information, we find that adding new random information in its place strengthens the provided privacy and provides better utility at any given level of privacy. The result is an approach that can provide stronger privatization on image data, and yet be preserving both the domain and the utility of the inputs, entirely independent of the downstream task.
BibTeX
@article{Martinsson2021, author = {Martinsson, John and Zec, Edvin Listo and Gillblad, Daniel and Mogren, Olof}, title = {Adversarial representation learning for synthetic replacement of private attributes}, journal = {Proceedings - 2021 IEEE International Conference on Big Data, Big Data 2021}, year = {2021}, pages = {1291--1299}, doi = {10.1109/BigData52589.2021.9671802}, url = {https://ieeexplore.ieee.org/document/9671802} }
License/copyright for PDF: © 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.