Abstract

Architectural style description is important for the renewal and reuse of historical monuments. However, the identification of Neoclassical buildings is more difficult than that of other European architectural styles owing to the revival of classicism and its numerous variants. This paper proposes the image-to-text model NeoDescriber for the automatic identification and description of Neoclassical buildings based on the combination of dedicated visual classification and detection modules organized by expert knowledge in architecture. NeoDescriber takes building façade images as input and follows a coarse-to-fine description logic to generate structured annotations covering the whole building and key building elements. We built several building image datasets composed of more than 300 historical monuments to train and validate different visual modules. The results showed the acceptance performance of both classification and detection modules, with average precisions ranging from 73.6% to 95.2%. The generated text descriptions were then compared with results from BLIP-2 by qualitative and quantitative analyses. With human descriptions as a reference, the results showed that NeoDescriber’s generated descriptions were more precise and detailed, having a higher BERTScore for Neoclassical monuments. Thus, NeoDescriber could successfully recognize and describe Neoclassical buildings.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.