Abstract

The detection of people in urban environments from satellite imagery can be employed in a variety of applications, such as urban planning, business management, crisis management, military operations, and security. A WorldView-3 satellite image of Prague was processed. Several variants of feature-extracting networks, referred to as backbone networks, were tested alongside the Faster R–CNN model. This model combines region proposal networks with object detection, offering a balance between speed and accuracy that is well suited for dense and varied urban environments. Data augmentation was used to increase the robustness of the models, which contributed to the improvement of classification results. Achieving a high level of accuracy is an ongoing challenge due to the low spatial resolution of available imagery. An F1 score of 54% was achieved using data augmentation, a 15 cm buffer, and a maximum distance limit of 60 cm.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.