Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Confer 2021
DOI: 10.18653/v1/2021.acl-short.70
|View full text |Cite
|
Sign up to set email alerts
|

More than Text: Multi-modal Chinese Word Segmentation

Abstract: Chinese word segmentation (CWS) is undoubtedly an important basic task in natural language processing. Previous works only focus on the textual modality, but there are often audio and video utterances (such as news broadcast and face-to-face dialogues), where textual, acoustic and visual modalities normally exist. To this end, we attempt to combine the multimodality (mainly the converted text and actual voice information) to perform CWS. In this paper, we annotate a new dataset for CWS containing text and audi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 35 publications
0
3
0
Order By: Relevance
“…To collect data for the newswire domain, we follow Zhang et al (2021) [9] and adopt the Xuexi platform, 1 on which official news released by our government. We obtain 513 speech-text articles that are read out by human announcers.…”
Section: Collecting Speech/text Datamentioning
confidence: 99%
See 2 more Smart Citations
“…To collect data for the newswire domain, we follow Zhang et al (2021) [9] and adopt the Xuexi platform, 1 on which official news released by our government. We obtain 513 speech-text articles that are read out by human announcers.…”
Section: Collecting Speech/text Datamentioning
confidence: 99%
“…Recently, with the rapid development in speech [7,8] and multi-modal fields [9,10,11], researchers have attempted to introduce speech modality as extra auxiliary information for NLP tasks and shown stable improvements. One mainstream way is to explicitly or implicitly integrate speech modality features into NLP models.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation