First-person shooter (FPS) focuses on first-person perspective action gameplay, with gunfights usually giving the player a choice of weapons, significantly impacting how the player approaches or strategies. General military-themed FPS games have realistic models with actual weapons’ shapes and characteristics. This type of game requires high aiming accuracy while using a mouse on a PC. However, not all players have a fast response time in knowing the surrounding situation. New players may need aid when targeting enemies in the FPS world. One popular yet underhanded method is injecting a program code using a dynamic-link library (DLL) to manipulate memory and asset data from the game. Instead of DLL, we promote a novel approach using the player’s real-time game screen, detecting the person without injecting program code into the game. The you only look once (YOLO) algorithm is used as an object detector model since it can process images in real time for up to 45 frames per second. The proposed object detection has an outstanding performance with 65% accuracy, 98% precision, and 61% recall of 51 tests for each game. YOLO’s fastest detection speed produces an average of 35 FPS on the YOLO tiny variant using a mixed precision (half) graphics processing unit (GPU).