2014 IEEE High Performance Extreme Computing Conference (HPEC) 2014
DOI: 10.1109/hpec.2014.7040946
|View full text |Cite
|
Sign up to set email alerts
|

Computing on masked data: a high performance method for improving big data veracity

Abstract: The growing gap between data and users calls for innovative tools that address the challenges faced by big data volume, velocity and variety. Along with these standard three V's of big data, an emerging fourth "V" is veracity, which addresses the confidentiality, integrity, and availability of the data. Traditional cryptographic techniques that ensure the veracity of data can have overheads that are too large to apply to big data.This work introduces a new technique called Computing on Masked Data (CMD), which… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
41
1

Year Published

2015
2015
2024
2024

Publication Types

Select...
5
2

Relationship

2
5

Authors

Journals

citations
Cited by 62 publications
(42 citation statements)
references
References 10 publications
0
41
1
Order By: Relevance
“…The SuperCloud is a fusion of the four large computing ecosystems: supercomputing, enterprise computing, big data and traditional databases into a coherent, unified platform. The MIT SuperCloud has spurred the development of a number of cross-ecosystem innovations in high performance databases [3], [13]; database management [19]; data protection [14]; database federation [11], [6]; data analytics [12]; dynamic virtual machines [23], [8] and system monitoring [7].…”
Section: Introductionmentioning
confidence: 99%
“…The SuperCloud is a fusion of the four large computing ecosystems: supercomputing, enterprise computing, big data and traditional databases into a coherent, unified platform. The MIT SuperCloud has spurred the development of a number of cross-ecosystem innovations in high performance databases [3], [13]; database management [19]; data protection [14]; database federation [11], [6]; data analytics [12]; dynamic virtual machines [23], [8] and system monitoring [7].…”
Section: Introductionmentioning
confidence: 99%
“…Data veracity is becoming a research hotspot of big data and there have been many related studies in the literature [2,4,10,16,20,15]. For example, Kepner et al [10] introduced a new technique called Computing on Masked Data (CMD) to improve data veracity while allowing a wide range of computations and queries to be performed with low overhead by combining efficient cryptographic encryption methods with an associative array representation of big data.…”
Section: Related Workmentioning
confidence: 99%
“…For example, Kepner et al [10] introduced a new technique called Computing on Masked Data (CMD) to improve data veracity while allowing a wide range of computations and queries to be performed with low overhead by combining efficient cryptographic encryption methods with an associative array representation of big data. Bodnar et al [4] proposed a veracity assessment model for information dissemination on social media networks that combines natural language processing and machine learning algorithms to mine textual content generated by each user.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The costs of these approaches are a key aspect of their adoption and there is active research to develop technologies that minimize their limitations [Kepner et al 2014].…”
Section: Introductionmentioning
confidence: 99%