Open Source Tools for Statistical Disclosure Control

Authors

  • Veena Gadad
  • Sowmyarani C N

Keywords:

Data anonymization, GDPR, open source tools, statistical disclosure control, utility

Abstract

There is a large requirement of the quality data that gets collected from various sources because the effective development, planning and research depend on the same. Huge amount of data gets collected and is stored in cloud or data centers. However, this data consists of sensitive information such as salary, disease, political affinity etc. that an individual does not want others to know. If the collected data is published as it is, then there are high chances that there is disclosure of the sensitive data and to prevent this data anonymization is used. Data anonymization must also be carried out to make the data compliant with General Data Protection Regulation (GDPR). Statistical disclosure control (SDC) is a suite of techniques to carry out data anonymization and at the same time preserving the utility of the data. This paper focuses on usage of open source tools that are available for statistical disclosure control.

Published

2020-01-17

Issue

Section

Articles