2013 IEEE High Performance Extreme Computing Conference (HPEC) 2013
DOI: 10.1109/hpec.2013.6670318
|View full text |Cite
|
Sign up to set email alerts
|

D4M 2.0 schema: A general purpose high performance schema for the Accumulo database

Abstract: Abstract-Non-traditional, relaxed consistency, triple store databases are the backbone of many web companies (e.g., Google Big Table, Amazon Dynamo, and Facebook Cassandra). The Apache Accumulo database is a high performance open source relaxed consistency database that is widely used for government applications. Obtaining the full benefits of Accumulo requires using novel schemas. The Dynamic Distributed Dimensional Data Model (D4M)[http://www.mit.edu/~kepner/D4M] provides a uniform mathematical framework bas… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
32
0

Year Published

2014
2014
2018
2018

Publication Types

Select...
7
2

Relationship

8
1

Authors

Journals

citations
Cited by 37 publications
(32 citation statements)
references
References 15 publications
0
32
0
Order By: Relevance
“…The code snippet below describes the D4M syntax for loading the incidence matrix file, inserting into a table called Tedge, generating the degree table and inserting it into TedgeDeg. Details about the general schema and table design can be found in [33]. mat.E.mat'],'E'); put(Tedge,putVal(E,'1,')); Edeg = putCol(sum(E.',2),'degree,'); put(TedgeDeg,num2str(Edeg));…”
Section: F Step 6: Ingestmentioning
confidence: 99%
“…The code snippet below describes the D4M syntax for loading the incidence matrix file, inserting into a table called Tedge, generating the degree table and inserting it into TedgeDeg. Details about the general schema and table design can be found in [33]. mat.E.mat'],'E'); put(Tedge,putVal(E,'1,')); Edeg = putCol(sum(E.',2),'degree,'); put(TedgeDeg,num2str(Edeg));…”
Section: F Step 6: Ingestmentioning
confidence: 99%
“…of the four largest computing ecosystems: supercomputing, enterprise computing, big data, and traditional databases. The MIT SuperCloud has spurred the development of a number of cross-ecosystem innovations in high performance databases [31], [32], database management [33], data protection [34], database federation [35], [36], data analytics [37] and system monitoring [38].…”
Section: Experimental Environmentmentioning
confidence: 99%
“…The SuperCloud is a fusion of the four large computing ecosystems: supercomputing, enterprise computing, big data and traditional databases into a coherent, unified platform. The MIT SuperCloud has spurred the development of a number of cross-ecosystem innovations in high performance databases [3], [13]; database management [19]; data protection [14]; database federation [11], [6]; data analytics [12]; dynamic virtual machines [23], [8] and system monitoring [7].…”
Section: Introductionmentioning
confidence: 99%