Information-theoretic disclosure risk measures in statistical disclosure control of tabular data

J. Domingo-Feffer1, A. Oganian2, V. Torra3
1Dept. of Comp. Eng. & Maths, Univ. Rovira i Virgili, Tarragona, Spain
2Dept. of Comp. Eng. and Maths, Universitat Rovira i Virgili, Tarragona, Spain
3IIIA - CSIC, Campus UAB, Bellaterra, Spain

Tóm tắt

Statistical database protection is a part of information security which tries to prevent published statistical information (tables, individual records) from disclosing the contribution of specific respondents. This paper shows how to use information-theoretic concepts to measure disclosure risk for tabular data. The proposed disclosure risk measure is compatible with a broad class of disclosure protection methods and can be extended for computing disclosure risk for a set of linked tables.

Từ khóa

#Protection #Databases #Information security #Information theory #Statistics #Aggregates #Sampling methods #Statistical distributions #Conference management

Tài liệu tham khảo

felsö, 2001, Disclosure limitation methods in use: Results of a survey, Confidentiality Disclosure and Data Access, 17 duncan, 2001, Disclosure limitation methods and information loss for tabular data, Confidentiality Disclosure and Data Access, 135 willenborg, 2001, Statistical Disclosure Control in Practice, 10.1007/978-1-4613-0121-9 gießing, 2001, Nonperturbative disclosure control methods for tabular data, Confidentiality Disclosure and Data Access, 185 fienberg, 1998, Disclosure limitation using perturbation and related methods for categorical data, Journal of Official Statistics, 14, 485 luige, 1999, Confidentiality practices in the transition countries, Proceedings of the Joint Eurostat/UNECE Work Session on Statistical Data Confidentiality, 287 holvast, 1999, Statistical dissemination, confidentiality and disclosure, Proceedings of the Joint Eurostat/UNECE Work Session on Statistical Data Confidentiality, 191 denning, 1982, Cryptography and Data Security 10.1007/3-540-47804-3_2 cox, 2001, Disclosure risk for tabular economic data, Confidentiality Disclosure and Data Access, 167