Authors propose an efficient architecture of the hierarchical machine vision system and the multimedia transfer algorithm for a distributed client-server system which can adapt to mobile network speed variations. This algorithm is based on the wavelet transform of an input multimedia source and allows for data exchange using all the available bandwidth of the unstable mobile network. The main principle of the hierarchical recognition system is the distributed processing network utilizing the “coarse-fine” paradigm in computer vision. Each source of a video stream is processed in-place by the tiny SoC computer which acts as an Edge Computing Unit and detects the presence of an object fragment in a video frame and crops the bounds of the ROI. The resulting stream containing ROI is directed to the main server if the object is detected, reducing traffic and resource consumption.