iDiary

Feldman, Dan; Sung, Cynthia; Sugaya, Andrew; Rus, Daniela

doi:10.1145/2814569

Cited by 6 publications

(3 citation statements)

References 37 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The algorithms in the previous sections are optimal but take polynomial time in the input (length of string). However, their running time can be easily reduced to be linear in the input, by running them on core-sets for segmentation [29,30]. Roughly speaking, core-set is a problem-dependent reduction of the input, such that running the existing algorithm for solving the problem on the core-set, would yield a provable approximation compared to the result of running the algorithm on the complete data.…”

Section: Linear-time Streaming and Parallel Computationmentioning

confidence: 99%

“…The sum of distances from the original set to every signal that consists of a constant number of k linear segments, is approximated by C, up to (1 + ε) multiplicative factor, where ε ∈ (0, 1) is constant. More generally, the core-set time has roughly quadratic dependency on k and 1/ ; see [29,30] for details. Unlike many solutions in machine or PAC-learning, in this and most core-sets there are no special assumptions on the size of input or its distribution (i.e., worse case input is assumed).…”

Section: Linear-time Streaming and Parallel Computationmentioning

confidence: 99%

See 1 more Smart Citation

Finding Patterns in Signals Using Lossy Text Compression

2019

Self Cite

View full text Add to dashboard Cite

Whether the source is autonomous car, robotic vacuum cleaner, or a quadcopter, signals from sensors tend to have some hidden patterns that repeat themselves. For example, typical GPS traces from a smartphone contain periodic trajectories such as "home, work, home, work, · · · ". Our goal in this study was to automatically reverse engineer such signals, identify their periodicity, and then use it to compress and de-noise these signals. To do so, we present a novel method of using algorithms from the field of pattern matching and text compression to represent the "language" in such signals. Common text compression algorithms are less tailored to handle such strings. Moreover, they are lossless, and cannot be used to recover noisy signals. To this end, we define the recursive run-length encoding (RRLE) method, which is a generalization of the well known run-length encoding (RLE) method. Then, we suggest lossy and lossless algorithms to compress and de-noise such signals. Unlike previous results, running time and optimality guarantees are proved for each algorithm. Experimental results on synthetic and real data sets are provided. We demonstrate our system by showing how it can be used to turn commercial micro air-vehicles into autonomous robots. This is by reverse engineering their unpublished communication protocols and using a laptop or on-board micro-computer to control them. Our open source code may be useful for both the community of millions of toy robots users, as well as for researchers that may extend it for further protocols.

show abstract

Section: Linear-time Streaming and Parallel Computationmentioning

confidence: 99%

Section: Linear-time Streaming and Parallel Computationmentioning

confidence: 99%

Finding Patterns in Signals Using Lossy Text Compression

2019

Self Cite

View full text Add to dashboard Cite

show abstract

“…However, their work only supports categorical attributes. Other effective algorithms are proposed [10][11][12][13][14][15][16][17][18][19][20][21], but all have the shortcoming that can not handle a large scale data effectively.…”

Section: Related Workmentioning

confidence: 99%

An Optimized Iterative Semantic Compression Algorithm And Parallel Processing for Large Scale Data

Jin¹,

Chen²,

Tung³

et al. 2018

KSII TIIS

View full text Add to dashboard Cite

With the continuous growth of data size and the use of compression technology, data reduction has great research value and practical significance. Aiming at the shortcomings of the existing semantic compression algorithm, this paper is based on the analysis of ItCompress algorithm, and designs a method of bidirectional order selection based on interval partitioning, which named An Optimized Iterative Semantic Compression Algorithm (Optimized ItCompress Algorithm). In order to further improve the speed of the algorithm, we propose a parallel optimization iterative semantic compression algorithm using GPU (POICAG) and an optimized iterative semantic compression algorithm using Spark (DOICAS). A lot of valid experiments are carried out on four kinds of datasets, which fully verified the efficiency of the proposed algorithm.

show abstract