1887

OECD Statistics Working Papers

The OECD Statistics Working Paper Series - managed by the OECD Statistics and Data Directorate – is designed to make available in a timely fashion and to a wider readership selected studies prepared by staff in the Secretariat or by outside consultants working on OECD projects. The papers included are of a technical, methodological or statistical policy nature and relate to statistical work relevant to the organisation. The Working Papers are generally available only in their original language - English or French - with a summary in the other.

Joint Working Papers:

Testing the evidence, how good are public sector responsiveness measures and how to improve them? (with OECD Public Governance Directorate)

Measuring Well-being and Progress in Countries at Different Stages of Development: Towards a More Universal Conceptual Framework (with OECD Development Centre)

Measuring and Assessing Job Quality: The OECD Job Quality Framework (with OECD Directorate for Employment, Labour and Social Affairs)

Forecasting GDP during and after the Great Recession: A contest between small-scale bridge and large-scale dynamic factor models (with OECD Economics Directorate)

Decoupling of wages from productivity: Macro-level facts (with OECD Economics Directorate)

Which policies increase value for money in health care? (with OECD Directorate for Employment, Labour and Social Affairs)

Compiling mineral and energy resource accounts according to the System of Environmental-Economic Accounting (SEEA) 2012 (with OECD Environment Directorate)

English

A Multiplicative Masking Method for Preserving the Skewness of the Original Micro-records

Masking methods for the safe dissemination of microdata consist of distorting the original data while preserving a pre-defined set of statistical properties in the microdata. For continuous variables, available methodologies rely essentially on matrix masking and in particular on adding noise to the original values, using more or less refined procedures depending on the extent of information that one seeks to preserve. Almost all of these methods make use of the critical assumption that the original datasets follow a normal distribution and/or that the noise has such a distribution. This assumption is, however, restrictive in the sense that few variables follow empirically a Gaussian pattern: the distribution of household income, for example, is positively skewed, and this skewness is essential information that has to be considered and preserved. This paper addresses these issues by presenting a simple multiplicative masking method that preserves skewness of the original data while offering a sufficient level of disclosure risk control. Numerical examples are provided, leading to the suggestion that this method could be well-suited for the dissemination of a broad range of microdata, including those based on administrative and business records.

English

This is a required field
Please enter a valid email address
Approval was a Success
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error