An Alternative Compressed Storage Format for Sparse Matrices

Ekambaram, Anand; Montagne, Eurípides

doi:10.1007/978-3-540-39737-3_25

Cited by 14 publications

(10 citation statements)

References 3 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However, as is the case for text documents, the matrix is likely to be sparse. Therefore, we can eliminate the words that never occur in the data, and/or store the matrix in a compressed format such as the Compressed Column Storage (CCS) format [5]. In our experiments, we find that only about 10% of all words have some subsequence mapped to it.…”

Section: Bag-of-words Representation For Time Seriesmentioning

confidence: 90%

Finding structurally different medical data

Lin

Yuan

2009

2009 22nd IEEE International Symposium on Computer-Based Medical Systems

View full text Add to dashboard Cite

show abstract

Section: Bag-of-words Representation For Time Seriesmentioning

confidence: 90%

Finding structurally different medical data

Lin

Yuan

2009

2009 22nd IEEE International Symposium on Computer-Based Medical Systems

View full text Add to dashboard Cite

show abstract

“…Another array row_ind() is needed to store the row indices of the non-zero elements in the original matrix. Finally, a third array is also needed, tjd_ptr(), which stores the starting position of the transposed jagged diagonals in the array val() [17,18]. Although TJDS suffers the drawback of indirect addressing, it does not need the permutation step.…”

Section: Transposed Jagged Diagonal Storage (Tjds)mentioning

confidence: 99%

“…The Transpose Jagged Diagonal Storage (TJDS) is inspired from the Jagged Diagonal Storage format and makes no assumptions about the sparsity pattern of the matrix [17]. In TJDS all the non-zero elements are shifted upward instead of leftward as in JDS.…”

Section: Transposed Jagged Diagonal Storage (Tjds)mentioning

confidence: 99%

“…Instead of storing n 2 elements, it requires only (2 * N nze + N + 1) storage locations [1]. Similar to CRS is the Compressed Column Storage (CCS) (also called the Harwell-Boeing Sparse Matrix Storage Format) [17,12], which is constructed in exactly the same way as CRS but with the roles of rows and columns interchanged. One can also say that CCS is the transpose of CRS [12].…”

Section: Point Based Storage Formatsmentioning

confidence: 99%

“…Although TJDS suffers the drawback of indirect addressing, it does not need the permutation step. This format is suitable for parallel and distributed processing [17,19]. [12].…”

Section: Transposed Jagged Diagonal Storage (Tjds)mentioning

confidence: 99%

See 2 more Smart Citations

Review of Storage Techniques for Sparse Matrices

Shahnaz

Usman

Chughtai

2005

2005 Pakistan Section Multitopic Conference

View full text Add to dashboard Cite

This paper reviews the current state of knowledge of the storage formats for sparse linear systems. Here we consider the ways developed so far for storing a sparse matrix and their quoted effects on computational speed. The main idea behind these formats involves keeping both the indices and the non-zero elements in the sparse matrix in a single data structure. These specialized schemes not only save storage but also yield computational savings. Since the locations of the non-zero elements in the matrix are known explicitly, unnecessary computations involving zeros can be avoided [1,2]. Thus the use of these formats reduces additional memory required in the usual indexing based storage schemes and gives promising performance improvements [3,4].

show abstract

Finding Structural Similarity in Time Series Data Using Bag-of-Patterns Representation

Lin

2009

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Abstract. For more than one decade, time series similarity search has been given a great deal of attention by data mining researchers. As a result, many time series representations and distance measures have been proposed. However, most existing work on time series similarity search focuses on finding shape-based similarity. While some of the existing approaches work well for short time series data, they typically fail to produce satisfactory results when the sequence is long. For long sequences, it is more appropriate to consider the similarity based on the higher-level structures. In this work, we present a histogram-based representation for time series data, similar to the "bag of words" approach that is widely accepted by the text mining and information retrieval communities. We show that our approach outperforms the existing methods in clustering, classification, and anomaly detection on several real datasets.

show abstract

An Alternative Compressed Storage Format for Sparse Matrices

Cited by 14 publications

References 3 publications

Finding structurally different medical data

Finding structurally different medical data

Review of Storage Techniques for Sparse Matrices

Finding Structural Similarity in Time Series Data Using Bag-of-Patterns Representation

Contact Info

Product

Resources

About