Privacy violation issues must be taken into consideration when datasets are released for public use. To address these issues, there are various anonymization models to be proposed, e.g., k-anonymity, l-diversity, and t-closeness. However, these anonymization models generally propose to address privacy violation issues in datasets which are assumed that all attributes of them must be completed. Thus, these anonymization models could be insufficient to address privacy violation issues in such a dataset which is allowed to collect missing-values, e.g., rating datasets and trajectory datasets. Therefore, a new appropriate privacy preservation model for missing-value datasets is proposed by this work. With the proposed model, aside from privacy preservation, the data utility is also maintained as much as possible. Moreover, a suitable data utility metric for missing-value datasets is also presented by this work. Furthermore, the proposed model is shown that it is an NP-Complete problem by reduction from the X3C problem.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.