Mobile GUI Research Articles

The ubiquity of mobile phones makes mobile GUI understanding an important task. Most previous works in this domain require human-created metadata of screens (e.g. View Hierarchy) during inference, which unfortunately is often not available or reliable enough for GUI understanding. Inspired by the impressive success of Transformers in NLP tasks, targeting for purely vision-based GUI understanding, we extend the concepts of Words/Sentence to Pixel-Words/Screen-Sentence, and propose a mobile GUI understanding architecture: Pixel-Words to Screen-Sentence (PW2SS). In analogy to the individual Words, we define the Pixel-Words as atomic visual components (text and graphic components), which are visually consistent and semantically clear across screenshots of a large variety of design styles. The Pixel-Words extracted from a screenshot are aggregated into Screen-Sentence with a Screen Transformer proposed to model their relations. Since the Pixel-Words are defined as atomic visual components, the ambiguity between their visual appearance and semantics is dramatically reduced. We are able to make use of metadata available in training data to auto-generate high-quality annotations for Pixel-Words. A dataset, RICO-PW, of screenshots with Pixel-Words annotations is built based on the public RICO dataset, which will be released to help to address the lack of high-quality training data in this area. We train a detector to extract Pixel-Words from screenshots on this dataset and achieve metadata-free GUI understanding during inference. We conduct experiments and show that Pixel-Words can be well extracted on RICO-PW and well generalized to a new dataset, P2S-UI, collected by ourselves. The effectiveness of PW2SS is further verified in the GUI understanding tasks including relation prediction, clickability prediction, screen retrieval, and app type classification.

Read full abstract

Mobile GUI tests can be classified as layout-based – i.e. using GUI properties as locators – or Visual – i.e. using widgets’ screen captures as locators –. Visual test scripts require significant maintenance efforts to be kept aligned with the tested application as it evolves or it is ported to different devices.This work aims to conceptualize a translation-based approach to automatically derive Visual tests from existing layout-based counterparts or repair them when graphical changes occur, and to develop a tool that implements and validates the approach.We present TOGGLE, a tool that translates Espresso layout-based tests for Android apps to Visual tests that conform to either SikuliX, EyeAutomate, or a combination of the two tools’ syntax. An experiment is conducted to measure the precision of the translation approach, which is evaluated on maintenance tasks triggered by graphical changes due to device diversity.Our results demonstrate the feasibility of a translation-based approach, show that script portability to different devices is improved (from 32% to 93%), and indicate that translation can repair up to 90% of Visual locators in failing tests.GUI test translation mitigates challenges with Visual tests like maintenance effort and portability, enabling their wider use in industrial practice.

Read full abstract

Mobile GUI Research Articles

Articles published on Mobile GUI

Understanding mobile GUI: From pixel-words to screen-sentences

델파이 조사를 통한 모바일 GUI 요소의 운영체제별 컴포넌트 분류

"니팅머신의 사용 편의성을 위한 모바일 GUI 디자인 사용성 평가원칙 개발 연구: 커스터마이징 앱을 중심으로"

Translation from layout-based to visual android test scripts: An empirical evaluation

섬진강 하구역 Mobile MGIS 구축 연구

Declarative GUI descriptions for device-independent applications

Application of Mobile GUI Design Theory to the Development of an Open Source Touchscreen Smartphone GUI

MobiGUITAR: Automated Model-Based Testing of Mobile Apps

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Mobile GUI Research Articles

Articles published on Mobile GUI

Understanding mobile GUI: From pixel-words to screen-sentences

델파이 조사를 통한 모바일 GUI 요소의 운영체제별 컴포넌트 분류

"니팅머신의 사용 편의성을 위한 모바일 GUI 디자인 사용성 평가원칙 개발 연구: 커스터마이징 앱을 중심으로"

Translation from layout-based to visual android test scripts: An empirical evaluation

섬진강 하구역 Mobile MGIS 구축 연구

Declarative GUI descriptions for device-independent applications

Application of Mobile GUI Design Theory to the Development of an Open Source Touchscreen Smartphone GUI

MobiGUITAR: Automated Model-Based Testing of Mobile Apps