Abstract
During a construction project lifecycle, an extensive corpus of unstructured or semistructured text documents is generated. The nature of unstructured sources impedes users’ acquisition, analysis, and reuse of relevant information in an integral form, leading to a possible reduction in project performance because of untimely or inadequate decisions. This paper explores the representation of information from unstructured documents in the form of a key-phrase network, intended to provide users with the possibility to visualize and analyze valuable project facts with less effort. A network of key phrases automatically extracted from various types of unstructured documents, with relations based on contextual similarity, was implemented as a graph database, enabling project participants to extract and visualize various patterns in data. With the objective of constructing a domain-independent key-phrase network with minimal expert involvement, an approach to detect key phrases in a multilingual environment was examined by using measures of association between words while avoiding text content from less informative contexts. A possible application is demonstrated using key-phrase networks generated from two complex international construction projects.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.