2018
DOI: 10.1109/tcsvt.2017.2716819
|View full text |Cite
|
Sign up to set email alerts
|

First-Person Daily Activity Recognition With Manipulated Object Proposals and Non-Linear Feature Fusion

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
14
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 39 publications
(14 citation statements)
references
References 43 publications
0
14
0
Order By: Relevance
“…In recommender system research, two lines of contributions are most significant to date: 1) pure Collaborative Filtering (CF) techniques such as matrix factorization [27] and its variants [11], and 2) content-or context-aware methods that rely on more complex models such as featurebased embeddings [2] and deep learning [36], [37]. While multimedia recommendation falls into the second category of content-based recommendations, it is more challenging yet popular, due to massive and abundant multimedia (e.g., visual, acoustic and semantic) features in real-world information systems [19], [38], [39].…”
Section: Multimedia Recommendationmentioning
confidence: 99%
“…In recommender system research, two lines of contributions are most significant to date: 1) pure Collaborative Filtering (CF) techniques such as matrix factorization [27] and its variants [11], and 2) content-or context-aware methods that rely on more complex models such as featurebased embeddings [2] and deep learning [36], [37]. While multimedia recommendation falls into the second category of content-based recommendations, it is more challenging yet popular, due to massive and abundant multimedia (e.g., visual, acoustic and semantic) features in real-world information systems [19], [38], [39].…”
Section: Multimedia Recommendationmentioning
confidence: 99%
“…The two-stream approach for action recognition is first proposed in [10], in which spatial network acquires single RGB frame feature and temporal network captures the motion pattern between frames with the input of 10 stacked optical flow images. Then a great number of efforts [16], [17], [32] have been made to enhance the twostream network from different perspectives.…”
Section: B Action Recognition With Deep Neural Networkmentioning
confidence: 99%
“…This segmentation process is a 1D version of the image-based 2D region-growing technique, which is a class of bottom-up image segmentation algorithms used to segment an image into homogeneous regions based on similarity of neighboring gray-level values. Image segmentation methods such as the top-down quad-tree and bottomup region-growing techniques are useful tools that have been utilized in a variety of image processing applications such as image steganography [17], image compression [27], and object detection [28].…”
Section: A Pixogram Segment-growingmentioning
confidence: 99%