In this study, Hölder Exponent (HE), Variance (var) and densely populated range are measured to extract meaningful features from the high resolution (HR) images. These extracted features are considered here as the reduced representation of high resolution (HR) images. Five-layer simplified Artificial Neural Network (ANN) architecture configured and trained using the reduced representation for building detection and counting in HR images. The scale and orientation of the movable window are changed to find an optimum bounding box of the buildings. The method is validated by applying on World View-2 pan-sharpened multispectral images having spatial resolution 0.46 m. In comparison with CNN and ResNet-18, the accuracy assessment result is quite promising (92%) with proposed method for detecting buildings in HR images. The proposed method, distinctly identified scale variant buildings and detected individual buildings even if they are in close proximity.