Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Softw 2020
DOI: 10.1145/3368089.3409691
|View full text |Cite
|
Sign up to set email alerts
|

Object detection for graphical user interface: old fashioned or deep learning or a combination?

Abstract: Detecting Graphical User Interface (GUI) elements in GUI images is a domain-specific object detection task. It supports many software engineering tasks, such as GUI animation and testing, GUI search and code generation. Existing studies for GUI element detection directly borrow the mature methods from computer vision (CV) domain, including old fashioned ones that rely on traditional image processing features (e.g., canny edge, contours), and deep learning models that learn to detect from large-scale GUI data. … Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
45
0
2

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
3
1

Relationship

2
6

Authors

Journals

citations
Cited by 115 publications
(47 citation statements)
references
References 51 publications
0
45
0
2
Order By: Relevance
“…For instance, the OutSpoken screen reader for Windows 3.1 allowed users to label icons on the screen, which it then recognizes from their pixels alone [69]. Inferring information from pixels of interfaces has been applied in diverse applications such as interface augmentation and remapping [11,17,30,79], GUI testing [78], data-driven design for GUI search [20,22,45] or prototyping [70], generating UI code from existing apps to support app development [12,21,24,53,57], and GUI security [25]. Some work also employed pixel-based methods to improve accessibility, such as Prefab, which augments existing app interface with targetaware pointing techniques that enhance interaction for people with motor impairments [31].…”
Section: Ui Detection From Pixelsmentioning
confidence: 99%
See 2 more Smart Citations
“…For instance, the OutSpoken screen reader for Windows 3.1 allowed users to label icons on the screen, which it then recognizes from their pixels alone [69]. Inferring information from pixels of interfaces has been applied in diverse applications such as interface augmentation and remapping [11,17,30,79], GUI testing [78], data-driven design for GUI search [20,22,45] or prototyping [70], generating UI code from existing apps to support app development [12,21,24,53,57], and GUI security [25]. Some work also employed pixel-based methods to improve accessibility, such as Prefab, which augments existing app interface with targetaware pointing techniques that enhance interaction for people with motor impairments [31].…”
Section: Ui Detection From Pixelsmentioning
confidence: 99%
“…There are multiple approaches to pixel-based interpretation of interfaces. Recent work by Chen et al [24] categorizes and evaluates two major GUI detection approaches: using traditional image processing methods (e.g., edge/contour detection [57], template matching [30,61,78]) and using deep learning models trained on large-scale GUI data [21,24].…”
Section: Ui Detection From Pixelsmentioning
confidence: 99%
See 1 more Smart Citation
“…Fischer et al [53] transfer the style from fine art to GUI. Chen et al [54] study different GUI element detection methods on large-scale GUI data and develop UIED [55] to handle diverse and complicated GUI images. Other supporting works such as GUI tag prediction [56] and GUI component gallery construction [57] can enhance designers' searching efficiency.…”
Section: A Gui Designmentioning
confidence: 99%
“…We use computer-vision techniques to achieve these two goals. In par ticular, we use the UI widget detection tool [23] to detect non text UI widget regions (e.g., icon, button), and use EAST [20] to detect text regions. The detected icons and images help to validate the guidelines regarding icon usage, such as " don't use same icons to represent different destinations in navigation drawer" .…”
Section: Parsing Input Ui Design Imagementioning
confidence: 99%