Topic modeling of user interfaces (UIs), also known as layout design categorization, contributes to a better understanding of the UI functionality. Starting from Rico, a large dataset of mobile UIs, we revised a random sample of 10k UIs and concluded to Enrico (shorthand of Enhanced Rico), a human-supervised high-quality dataset comprising 1460 UIs and 20 design topics. As a validation example, we train a deep learning model for three different UI representations (screenshots, wireframes, and embeddings). The screenshot representation provides the highest discriminative power (95% AUC) and a competitive accuracy of 75% (a random classifier achieves 5% accuracy in the same task). We discuss several applications that can be developed with this new public resource, including e.g. semantic UI captioning and tagging, explainable UI designs, smart tutorials, and improved design search capabilities.
PDF, 1.4 MB
Enrico: A High-quality Dataset for Topic Modeling of Mobile UI Designs.
In Proceedings of the 22nd International Conference on Human-Computer Interaction with Mobile Devices and Services Adjunct (MobileHCI'20)
@inproceedings{leiva20_enrico,
author = {Luis A. Leiva and Asutosh Hota and Antti Oulasvirta},
title = {Enrico: A High-quality Dataset for Topic Modeling of Mobile {UI} Designs},
booktitle = {Proceedings of the 22nd International Conference on Human-Computer Interaction with Mobile Devices and Services Adjunct},
series = {MobileHCI'20},
year = {2020},
doi = {10.1145/3406324.3410710},
}
For questions and further information, please contact:
Luis A. Leiva
Email:
luis.leiva (at) aalto.fi
Acknowledgements: We acknowledge the computational resources provided by the Aalto Science-IT project. We thanks Crista Kaukinen for helping us with the UI labeling tasks. This work has been supported by the Academy of Finland (grant no. 318559).