ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020
DOI: 10.1109/icassp40776.2020.9054407
|View full text |Cite
|
Sign up to set email alerts
|

Multimodal Learning for Classroom Activity Detection

Abstract: Classroom activity detection (CAD) focuses on accurately classifying whether the teacher or student is speaking and recording both the length of individual utterances during a class. A CAD solution helps teachers get instant feedback on their pedagogical instructions. This greatly improves educators' teaching skills and hence leads to students' achievement. However, CAD is very challenging because (1) the CAD model needs to be generalized well enough for different teachers and students; (2) data from both voca… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

1
13
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
5
3
1

Relationship

4
5

Authors

Journals

citations
Cited by 36 publications
(19 citation statements)
references
References 19 publications
1
13
0
Order By: Relevance
“…In educational research regarding teaching observation, the procedure is usually accompanied with audio or video recordings. However, a large portion of the work in this matter focuses on using linguistic features extracted from Automatic Speech Recognition (ASR) systems and Natural Language Processing (NLP) approaches for modelling teaching practices [19]. This causes these analyses to disregard important information besides the linguistic sources, such as paralinguistic or contextual ones, which are inferable from audio recordings [33], [10].…”
Section: Introductionmentioning
confidence: 99%
“…In educational research regarding teaching observation, the procedure is usually accompanied with audio or video recordings. However, a large portion of the work in this matter focuses on using linguistic features extracted from Automatic Speech Recognition (ASR) systems and Natural Language Processing (NLP) approaches for modelling teaching practices [19]. This causes these analyses to disregard important information besides the linguistic sources, such as paralinguistic or contextual ones, which are inferable from audio recordings [33], [10].…”
Section: Introductionmentioning
confidence: 99%
“…One of the main problems is the slowness of the analysis. To encode using any of the established protocols, it is estimated that 4 h of coding are required for each hour of recording ( Li et al, 2020 ). Another difficulty is the dependence on the encoder.…”
Section: Introductionmentioning
confidence: 99%
“…Using technology in classroom has shown to contribute to the development of students' creativity, motivation, and critical thought, encouraging also their capability of solving problems in a more collaborative way [30]. The increasing digital research in today's classrooms has encouraged a recent development of specific computer-based approaches for their application in E-learning environments, such as classroom activity detection [21], hand-rising gesture recognition [23], and classroom motion tracking [15]. Particularly, the proliferation of sensors in classrooms has created an environment in which students' behaviours are continuously monitored and recorded [2,28].…”
Section: Introductionmentioning
confidence: 99%