A Computational Bayesian Approach for Estimating Density Functions Based on Noise-Multiplied Data
In this big data era, an enormous amount of personal and company information can be easily collected by third parties. Sharing the data with the public and allowing data users to access the data for data mining often bring many benefits to the public. However, sharing the microdata with the public usually causes the issue of data privacy. Protecting data privacy through noise-multiplied data is one of the approaches studied in the literature. This paper introduces the B-M L2014 Approach for estimating the density function of the original data based on noise-multiplied microdata. This paper shows applications of the B-M L2014 Approach and demonstrates that the statistical information of the original data can be retrieved from their noise-multiplied data reasonably while the disclosure risk is under control. The B-M L2014 Approach provides a new data mining technique for big data when data privacy is concerned.