2006
DOI: 10.1007/s00778-006-0010-5
|View full text |Cite
|
Sign up to set email alerts
|

A privacy-preserving technique for Euclidean distance-based mining algorithms using Fourier-related transforms

Abstract: Privacy preserving data mining has become increasingly popular because it allows sharing of privacy-sensitive data for analysis purposes. However, existing techniques such as random perturbation do not fare well for simple yet widely used and efficient Euclidean distance-based mining algorithms. Although original data distributions can be pretty accurately reconstructed from the perturbed data, distances between individual data points are not preserved, leading to poor accuracy for the distance-based mining me… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
69
0

Year Published

2008
2008
2016
2016

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 87 publications
(70 citation statements)
references
References 34 publications
(63 reference statements)
1
69
0
Order By: Relevance
“…SMC-based solutions [3] provide secure mining algorithms for the distributed environment, and they are orthogonal to our PPC problem. On the other hand, distortion-based solutions [1], [6]- [8] are generally used for the centralized environment that consists of multiple data providers and one or more third parties. A simple distortion-based solution is using random data perturbation [1], [8], but it may incur bad clustering accuracy [6].…”
Section: Related Workmentioning
confidence: 99%
See 4 more Smart Citations
“…SMC-based solutions [3] provide secure mining algorithms for the distributed environment, and they are orthogonal to our PPC problem. On the other hand, distortion-based solutions [1], [6]- [8] are generally used for the centralized environment that consists of multiple data providers and one or more third parties. A simple distortion-based solution is using random data perturbation [1], [8], but it may incur bad clustering accuracy [6].…”
Section: Related Workmentioning
confidence: 99%
“…In this paper we address the problem of privacy-preserving clustering (PPC in short) on sensitive time-series data [5], [6]. Typical examples are as follows: (1) drivers do not wish to disclose their exact speed recorded in the vehicle monitoring system, but they still allow clustering of driving patterns [8]; (2) patients with heart disease do not want to disclose their private electrocardiogram (ECG) data, but they still allow clustering of patient ECG data.…”
Section: Introductionmentioning
confidence: 99%
See 3 more Smart Citations