Ultra high-speed target recognition in complex electromagnetic environments is a critical and fundamental machine perception issue. It is difficult to ensure privacy protection and intelligent confrontation with centralized training in many practical situations. In this paper, an efficient ultra high-speed target recognition approach, named InVision, for robot systems is proposed using deep trustworthy federated learning (DTFL). InVision is accurate, safe, fast, and robust. Particularly, geometric component transformer (GCT) is presented to significantly promote neural element's complex representation description ability. And an ambiguity-aware cooperative learning (AACL) scheme is developed to relieve the noisy label problem. Moreover, decentralized federated training (DFT) is designed to mitigate the intractable privacy protection problem, to efficiently search for similarities and reduce the representation redundancy. Furthermore, to promote the running speed of the system in real-world environments, a lightweight deep architecture, called Mobile-XB, is developed. Extensive quantitative and qualitative experiments are carried out, and the results demonstrate that InVision greatly outperforms the outstanding comparison methods, establishing efficient connections and extraction, and providing security guarantees.