Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) 2020
DOI: 10.18653/v1/2020.emnlp-main.443
|View full text |Cite
|
Sign up to set email alerts
|

Widget Captioning: Generating Natural Language Description for Mobile User Interface Elements

Abstract: Natural language descriptions of user interface (UI) elements such as alternative text are crucial for accessibility and language-based interaction in general. Yet, these descriptions are constantly missing in mobile UIs. We propose widget captioning, a novel task for automatically generating language descriptions for UI elements from multimodal input including both the image and the structural representations of user interfaces. We collected a largescale dataset for widget captioning with crowdsourcing. Our d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
25
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 32 publications
(25 citation statements)
references
References 27 publications
0
25
0
Order By: Relevance
“…However, we had to manually add meaningful textual descriptions to the view hierarchy to do so. Fortunately, adding such descriptions can be done automatically by using a widget captioning technique [25]. This is a promising direction for future work.…”
Section: Discussionmentioning
confidence: 99%
“…However, we had to manually add meaningful textual descriptions to the view hierarchy to do so. Fortunately, adding such descriptions can be done automatically by using a widget captioning technique [25]. This is a promising direction for future work.…”
Section: Discussionmentioning
confidence: 99%
“…For example, [Li et al, 2020b] leveraged Transformer to map natural language commands to executable actions in a UI. [Li et al, 2020c; used Transformer to generate textual descriptions for UI elements. There were also attempts using convolutional neural networks to retrieve similar UIs for design mining [Deka et al, 2017;Liu et al, 2018;Huang et al, 2019].…”
Section: Related Workmentioning
confidence: 99%
“…[Li et al, 2020c; used Transformer to generate textual descriptions for UI elements. There were also attempts using convolutional neural networks to retrieve similar UIs for design mining [Deka et al, 2017;Liu et al, 2018;Huang et al, 2019]. Past work generally built task-specific models and required substantial labeled data.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations