Abstract

Video data is the biggest IoT data which is challenging for effective analysis with good performance. Object mis-detection is usually inevitable in edge-based distributed cross-scene video analysis. Traditional centralized model training can potentially result in edge data leakage. Even though joint model can be trained with federated learning while maintaining data privacy, the size of gradient data transmitted is large for computer vision models used. To address these problems, this paper proposed an Accurate and efficient federated learning-based edge intelligence for effective video analysis method called EIEVA-AEFL. In EIEVA-AEFL, a Federation Mis-detection Reinforcement Network (FMRN) is designed to alleviate the mis-detection problem. FMRN contains a vanilla object detection network and a mis-detection reinforcement branch, which finetunes object detection via feature re-extraction to reduce object mis-detection. To reduce the communication cost in training, an efficient federated learning strategy is designed. In this strategy, an oscillation suppression loss function is proposed to suppress the loss fluctuation resulting from data on edge clients. Average accuracy and recall increase 0.5 and 0.7 with FMRN on the MS COCO (Microsoft Common Objects in Context) data set respectively, and with improvements of 4.5 and 5.5 with FMRN on our self-made mis-detection dataset respectively. EIEVA-AEFL can reduce the training speed on the premise of ensuring the accuracy of the model. The model parameters, data amount, transmission delay and convergence epochs on EIEVA-AEFL model training are reduced by 78%, 89%, 84% and 36% respectively.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call