2015
DOI: 10.1145/2814569
|View full text |Cite
|
Sign up to set email alerts
|

iDiary

Abstract: This article describes iDiary, a system that takes as input GPS data streams generated by users’ phones and turns them into textual descriptions of the trajectories. The system features a user interface similar to Google Search that allows users to type text queries on their activities (e.g., “Where did I buy books?”) and receive textual answers based on their GPS signals. iDiary uses novel algorithms for semantic compression and trajectory clustering of massive GPS signals in parallel to compute the critical … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2017
2017
2019
2019

Publication Types

Select...
5
1

Relationship

2
4

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 37 publications
0
3
0
Order By: Relevance
“…The algorithms in the previous sections are optimal but take polynomial time in the input (length of string). However, their running time can be easily reduced to be linear in the input, by running them on core-sets for segmentation [29,30]. Roughly speaking, core-set is a problem-dependent reduction of the input, such that running the existing algorithm for solving the problem on the core-set, would yield a provable approximation compared to the result of running the algorithm on the complete data.…”
Section: Linear-time Streaming and Parallel Computationmentioning
confidence: 99%
See 1 more Smart Citation
“…The algorithms in the previous sections are optimal but take polynomial time in the input (length of string). However, their running time can be easily reduced to be linear in the input, by running them on core-sets for segmentation [29,30]. Roughly speaking, core-set is a problem-dependent reduction of the input, such that running the existing algorithm for solving the problem on the core-set, would yield a provable approximation compared to the result of running the algorithm on the complete data.…”
Section: Linear-time Streaming and Parallel Computationmentioning
confidence: 99%
“…The sum of distances from the original set to every signal that consists of a constant number of k linear segments, is approximated by C, up to (1 + ε) multiplicative factor, where ε ∈ (0, 1) is constant. More generally, the core-set time has roughly quadratic dependency on k and 1/ ; see [29,30] for details. Unlike many solutions in machine or PAC-learning, in this and most core-sets there are no special assumptions on the size of input or its distribution (i.e., worse case input is assumed).…”
Section: Linear-time Streaming and Parallel Computationmentioning
confidence: 99%
“…However, their work only supports categorical attributes. Other effective algorithms are proposed [10][11][12][13][14][15][16][17][18][19][20][21], but all have the shortcoming that can not handle a large scale data effectively.…”
Section: Related Workmentioning
confidence: 99%