Abstract
In the article the problem of increasing the information reliability in electronic document management systems is formulated, and mechanisms for controlling and correcting spelling and errors with semantic values are developed on the basis of a combined multilevel morphological analysis with n-gram models, a typical search, recognition, and classification tools. Mechanisms for verifying the spelling of a word on the basis of a vector representation of variables and comparison with a standard analogue are proposed according to the principles of using statistical, natural, structural, technological, semantic information redundancy. The solutions to the problems of increasing the information reliability based on a set of keywords, phrases, terms by comparing with virtual, frequency dictionaries located in the electronic document database and knowledge base are obtained. A technique has been developed to optimize control mechanisms and correct spelling errors based on the use of logical, semantic and structural - technological links, cross-relationships between individual or groups of words, phrases in the text information. The obtained tools to increase the reliability of the texts of electronic documents are tested in real condition, the results are compared with the conclusions of the system experts.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.