Contemporary software products and services necessitate extensive accompanying information, including user manuals and documentation. The task becomes more complex in the case of graphical applications, as it requires capturing screenshots and highlighting the described elements. Attaining high-quality visual information using conventional means demands considerable effort and time, since, unlike text, raster images do not provide an interface for retrieving metadata about individual elements, necessitating programmatic extraction of such information. This study presents a software complex designed to automate and expedite this process. The primary focus is on a topological decomposition method that segments images based on topological features such as pixel brightness and proximity, enabling the segmentation of screenshots into control elements. This approach demonstrates high accuracy (96-99 %) in identifying graphical components across most tested images. The software product encompasses functions for the automatic capture of application windows, an algorithm for identifying control elements, and the capability to export the entire image or a specified region. Additionally, an algorithm for generating output templates for image insertion has been implemented. The decomposition method has exhibited effectiveness across the majority of test data. In instances with shadow effects, noise levels were recorded at a maximum of 34 %; however, in most cases, this figure did not exceed 3 %. Further research may contribute to enhancing these results. The findings of this study could prove valuable for the continued exploration of graphical user interface decomposition and the assessment of the method's effectiveness on diverse images. The developed software complex significantly simplifies and accelerates the documentation process, thereby opening new avenues in the automation of software development.
Read full abstract