2021
DOI: 10.3233/ica-210652
|View full text |Cite
|
Sign up to set email alerts
|

Machine learning for video event recognition

Abstract: In recent years, the spread of video sensor networks both in public and private areas has grown considerably. Smart algorithms for video semantic content understanding are increasingly developed to support human operators in monitoring different activities, by recognizing events that occur in the observed scene. With the term event, we refer to one or more actions performed by one or more subjects (e.g., people or vehicles) acting within the same observed area. When these actions are performed by subjects that… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 10 publications
(4 citation statements)
references
References 159 publications
0
4
0
Order By: Relevance
“…The large amount of data available has encouraged active research on analysis techniques that extract knowledge in different settings [15,16]. These techniques are able to perform different tasks in diverse fields such as the estimation of variables like the strain of a structural member in buildings [17] or the evaporation in cooling towers [18].…”
Section: Related Workmentioning
confidence: 99%
“…The large amount of data available has encouraged active research on analysis techniques that extract knowledge in different settings [15,16]. These techniques are able to perform different tasks in diverse fields such as the estimation of variables like the strain of a structural member in buildings [17] or the evaporation in cooling towers [18].…”
Section: Related Workmentioning
confidence: 99%
“…In this way, environment background and personal information are removed from the input, enabling the model to focus exclusively on the subject and its dynamics, 84 i.e., the person moving in the scene, like in most real camera-based surveillance scenarios. 85 Instead, the sanitized amplitudes are extracted from the CSI measurements of sequential Wi-Fi data packets as signal-based features describing human poses in the radio domain. 86 This paired input enables the cross-modality supervision to learn a mapping from one domain to another during the network training phase.…”
Section: Related Workmentioning
confidence: 99%
“…More recently, there have been several attempts to connect AI mechanisms with the learning algorithms in neural networks, which are raising a research hotspot in a wide range of possible applications, including network intrusion detection (Martina & Foresti, 2021), person re‐identification (Gómez‐Silva et al., 2021), and video event recognition (Avola et al., 2021). Deep‐learning‐based methods have outperformed traditional models in many machine learning tasks (Lara‐Benitez et al., 2021).…”
Section: Introductionmentioning
confidence: 99%