Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
In this information age, data and knowledge extracted by data mining techniques represent a key asset driving research, innovation, and policy-making activities. Many agencies and organizations have recognized the need of accelerating such trends and are therefore willing to release the data they collected to other parties, for purposes such as research and the formulation of public policies. However the data publication processes are today still very difficult. Data often contains personally identifiable information and therefore releasing such data may result in privacy breaches; this is the case for the examples of microdata, e.g., census data and medical data. Several anonymization techniques, such as generalization and bucketization, have been designed for privacy preserving microdata publishing. Recent work has shown that generalization loses considerable amount of information, especially for high dimensional data. Bucketization, on the other hand, does not prevent membership disclosure and does not apply for data that do not have a clear separation between quasi-identifying attributes and sensitive attributes. This paper presents a novel technique called slicing, which publish and share microdata in a privacy-preserving manner. It partitions the data both horizontally and vertically. Slicing preserves better data utility than generalization and can be used for membership disclosure protection. Slicing can also handle high-dimensional data. It shows how slicing can be used for attribute disclosure protection and develop an efficient algorithm for computing the sliced data that obey the -diversity requirement.
TABLE OF CONTENTS
CHAPTER NO ABSTRACT
TITLE
PAGE NO
i iii iv 1
LIST OF FIGURES LIST OF ABBREVATIONS 1 INTRODUCTION 1.1 MICRODATA PUBLISHING 1.1.1 INFORMATION RISKS DISCLOSURE
REFERENCES
ii
iii
iv