2021
DOI: 10.21203/rs.3.rs-533949/v1
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Re-identification risk prediction paradigm using incomplete statistical information and recursive hypergeometric distribution

Abstract: The dataset anonymization has not eliminated the re-identification risk, the evaluation of which remains a huge challenge, especially given incomplete statistical information. The re-identification risk of individuals depends on their tuple frequency. The paper proposes the recursive hypergeometric (RH) distribution to accurately calculate the tuple frequency and leverages the binomial distribution to approximate the RH distribution and to efficiently predict the re-identification risk of individuals in both g… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 18 publications
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?