2019
DOI: 10.1109/access.2019.2952432
|View full text |Cite
|
Sign up to set email alerts
|

Motion Recognition Algorithm Based on Deep Edge-Aware Pyramid Pooling Network in Human–Computer Interaction

Abstract: Action recognition is an important research direction in computer vision, which has worldwide applications, such as video surveillance, human-robot interaction and so on. Due to the influence of complex background and multi-angle changes, accurate recognition and analysis of human motion in real-life scenarios is still a challenging problem. In order to improve the accuracy of pedestrian detection and motion recognition, this paper proposes a novel edge-aware end-to-end deep network method, which uses the edge… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(6 citation statements)
references
References 24 publications
0
4
0
Order By: Relevance
“…Data‐driven methods : Recently, many deep learning‐based methods have been proposed and achieved remarkable performance. Xu et al [XRY*15] propose a deep edge‐aware filter to approximate different operators based on a gradient domain training procedure. Shen et al [SCTJ17] present a pyramid structure with an extended receptive field.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Data‐driven methods : Recently, many deep learning‐based methods have been proposed and achieved remarkable performance. Xu et al [XRY*15] propose a deep edge‐aware filter to approximate different operators based on a gradient domain training procedure. Shen et al [SCTJ17] present a pyramid structure with an extended receptive field.…”
Section: Related Workmentioning
confidence: 99%
“…With the development of deep learning, data‐driven approaches [XRY*15, ZLJ*19] have also been explored on image smoothing. The deep hierarchical network structure has a natural advantage in capturing semantic cues for extracting meaningful image structures.…”
Section: Introductionmentioning
confidence: 99%
“…SPP is a manageable answer for managing aspect ratios, diverse scales, dimensions, and sizes (Wang et al 2018;Fu et al 2020). These results are significant in computer visual recognition but have gained little attention in deep network research (Xu et al 2019a). Figure 1 presents a network configuration with an SPP layer.…”
Section: Spatial Pyramid Pooling (Spp) Networkmentioning
confidence: 99%
“…However, the long and short-term memory network in the recurrent neural network is typically used to solve the correlation problem between consecutive frames in the video, as only taking into account the CNN will ignore the continuity problem of image frames. Accordingly, [9] proposes a sort of end-to-end fullyconnected long and short-term memory (LSTM) network for the human behaviour recognition technology based on human behaviour; For example, [10] applies video recognition technology for the first time by combining a CNN with a long-and short-term memory network; [11][12][13] combines the two networks and uses the dual-stream network to extract features in the time and space dimensions, respectively, to achieve good recognition results. blended and used in the field of video recognition technology; [11][12][13] successfully recognised human behaviour by combining the two networks and using a dual-stream network to extract features in the temporal and spatial dimensions, respectively.…”
Section: Introductionmentioning
confidence: 99%