Wireframe-based UI Design Search through Image Autoencoder

Chen, Jieshan; Chen, Chunyang; Xing, Zhenchang; Xia, Xin; Zhu, Liming; Grundy, John; Wang, Jinshui

doi:10.1145/3391613

Cited by 64 publications

(26 citation statements)

References 60 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For instance, the OutSpoken screen reader for Windows 3.1 allowed users to label icons on the screen, which it then recognizes from their pixels alone [69]. Inferring information from pixels of interfaces has been applied in diverse applications such as interface augmentation and remapping [11,17,30,79], GUI testing [78], data-driven design for GUI search [20,22,45] or prototyping [70], generating UI code from existing apps to support app development [12,21,24,53,57], and GUI security [25]. Some work also employed pixel-based methods to improve accessibility, such as Prefab, which augments existing app interface with targetaware pointing techniques that enhance interaction for people with motor impairments [31].…”

Section: Ui Detection From Pixelsmentioning

confidence: 99%

Screen Recognition: Creating Accessibility Metadata for Mobile Applications from Pixels

Zhang

Greef

Swearngin

et al. 2021

Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems

View full text Add to dashboard Cite

Many accessibility features available on mobile platforms require applications (apps) to provide complete and accurate metadata describing user interface (UI) components. Unfortunately, many apps do not provide sucient metadata for accessibility features to work as expected. In this paper, we explore inferring accessibility metadata for mobile apps from their pixels, as the visual interfaces often best reect an app's full functionality. We trained a robust, fast, memory-ecient, on-device model to detect UI elements using a dataset of 77,637 screens (from 4,068 iPhone apps) that we collected and annotated. To further improve UI detections and add semantic information, we introduced heuristics (e.g., UI grouping and ordering) and additional models (e.g., recognize UI content, state, interactivity). We built Screen Recognition to generate accessibility metadata to augment iOS VoiceOver. In a study with 9 screen reader users, we validated that our approach improves the accessibility of existing mobile apps, enabling even previously inaccessible apps to be used. CCS CONCEPTS• Human-centered computing ! Accessibility technologies.

show abstract

Section: Ui Detection From Pixelsmentioning

confidence: 99%

Screen Recognition: Creating Accessibility Metadata for Mobile Applications from Pixels

Zhang

Greef

Swearngin

et al. 2021

Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems

View full text Add to dashboard Cite

show abstract

“…Therefore, SEQUER can effectively help users obtain better search results via high-quality query reformulation. Information Retrieval (IR) has been widely used in software engineering (SE) tasks, such as traceability recovery [56], [57], feature location [58], [59], library migration [2], [60], [61], API search [62], [63] and GUI design seeking [64]- [66]. In this section, we summarize the related works about query reformulation in general IR and its application in SE domain.…”

Section: E Discussionmentioning

confidence: 99%

Automated Query Reformulation for Efficient Search Based on Query Logs From Stack Overflow

Cao

Chen

Baltes

et al. 2021

2021 IEEE/ACM 43rd International Conference on Software Engineering (ICSE)

Self Cite

View full text Add to dashboard Cite

“…We hope that when generating new subtree combination sequences, the generator can also follow the composition conditions of GUI to a certain extent so that these synthetic sequences not only stay diverse but also meet the structural characteristics of the real GUIs. For this purpose, we use the structure strings of the subtrees from their metadata instead of the GUI wireframe images [25] [26] to represent their structures as there is explicit order among different GUI components. The minimum edit distance (MED) is introduced to quantify the structural similarity between two GUIs.…”

Section: Modeling Subtree Structurementioning

confidence: 99%

GUIGAN: Learning to Generate GUI Designs Using Generative Adversarial Networks

Zhao

Chen

Liu

et al. 2021

2021 IEEE/ACM 43rd International Conference on Software Engineering (ICSE)

Self Cite

View full text Add to dashboard Cite

Graphical User Interface (GUI) is ubiquitous in almost all modern desktop software, mobile applications, and online websites. A good GUI design is crucial to the success of the software in the market, but designing a good GUI which requires much innovation and creativity is difficult even to well-trained designers. Besides, the requirement of the rapid development of GUI design also aggravates designers' working load. So, the availability of various automated generated GUIs can help enhance the design personalization and specialization as they can cater to the taste of different designers. To assist designers, we develop a model GUIGAN to automatically generate GUI designs. Different from conventional image generation models based on image pixels, our GUIGAN is to reuse GUI components collected from existing mobile app GUIs for composing a new design that is similar to natural-language generation. Our GUIGAN is based on SeqGAN by modeling the GUI component style compatibility and GUI structure. The evaluation demonstrates that our model significantly outperforms the best of the baseline methods by 30.77% in Frechet Inception distance (FID) and 12.35% in 1-Nearest Neighbor Accuracy (1-NNA). Through a pilot user study, we provide initial evidence of the usefulness of our approach for generating acceptable brand new GUI designs.

show abstract

Wireframe-based UI Design Search through Image Autoencoder

Cited by 64 publications

References 60 publications

Screen Recognition: Creating Accessibility Metadata for Mobile Applications from Pixels

Screen Recognition: Creating Accessibility Metadata for Mobile Applications from Pixels

Automated Query Reformulation for Efficient Search Based on Query Logs From Stack Overflow

GUIGAN: Learning to Generate GUI Designs Using Generative Adversarial Networks

Contact Info

Product

Resources

About