We propose the novel visual analysis system Video of Things (VoT) using machine learning and a distributed Expectation-Maximization (EM) method. The VoT-enabled approach is proposed to perform a visual analysis of action in the context of wireless sensor systems. For the technical impact analysis of VoT, we used the EM algorithm. Furthermore, we investigate the notion of merging the clustered grid with the uniform filter in the target tracking of the distributed EM of particle filtering. We grid the network monitoring region with the technical impact analysis of VoT, then clusters the whole sensor nodes, build the system structure using a real scenario of visual actions in the VoT, and thoroughly analyze the system implementation process. Finally, we present investigations to validate the performance of the proposed research approach with a technical impact analysis of VoT. According to the study findings, the proposed VoT-based system can play an essential role in the recognition and technical impact analysis of table tennis activity.