Abstract

The paper introduces a new solution for semantic analysis implementation in modern enterprise content management (ECM) systems. The system of semantic analysis is intended for the intellectual analysis of enterprise official and technical documents based on machine learning, namely the extraction of the specified attributes from them for further use. In this paper it is proposed to implement semantic search using the extracted data configurator, which is responsible for creating and managing ontologies. From the configurator of the extracted data by the name of the document type, a graph is generated containing attributes to be extracted (official terms and sections, dates, etc.), regular expressions to search for sentences that probably contain the desired attribute, Yargy and regular rules for extracting attributes from the arrays of sentences. The proposed solution was successfully probated and tested on a dataset containing engineering enterprise contract agreements and protocols.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call