The development of information (intelligent) learning systems, electronic document management systems, web-oriented systems working with text information in natural language has led to an increase in the volume of educational content and/or arrays of processed full-text documents. All this requires new means of organizing access to information, many of which should be classified as intelligent systems for knowledge processing. One of the effective approaches to identifying and processing the meaning of educational content (and/or text documents) is the use of ontologies. The purpose of the article is research, analysis of various approaches to determining the semantic content of texts in natural language, consideration of existing concepts of text analysis and prospects for using the proposed ontological approach to semantic analysis of texts in natural language. Research methods are methods of semantic analysis of the main concepts of the analyzed subject area (semantic analysis of texts in natural language). The article considers an approach to the linguistic analysis of texts based on ontological modeling. The novelty of the research is the application of the proposed ontological approach to the semantic analysis of texts in natural language to determine the meaning (semantics) of text information, which is used in intelligent systems of various classes. The conclusion of the research carried out in the article is as follows: an ontological approach to the semantic analysis of natural language text, its tasks and methods is proposed. The use of the proposed approach to text analysis leads to the understanding of semantic analysis as a single triad: <Ontology – Text – Meaning>. For effective and correct extraction of knowledge, it is suggested to use a multi-level ontology. The result of the interaction of a specific natural language text with an ontology is an ontological meaning – a set of interconnected subgraphs of the ontograph. The ontological content is extracted from the ontograph using a semantic analyzer. The dialogue processor examines the syntactic tree of sentence parsing (a connected element of the natural language text) and, based on the given question, finds a fragment in the text that is the answer to the question. Computer understanding (in information or intellectual systems) of natural language text is achieved, in particular, by: immersion of the text in a single environment of knowledge – ontology; formal presentation of meaning (semantics) in the knowledge base of the corresponding system; the possibility of operations on the ontological content. The proposed approach can be used to create intelligent information repositories that work in a single knowledge environment. The proposed approach to the semantic analysis of texts in natural language is focused on the automatic extraction of metadata from texts of various nature (for example, a text document, Internet content, educational content of relevant online courses, description of computer and board games). With further development of the proposed approach, it can be used in systems of automatic referencing of scientific publications, meaningful interpretation of multimedia content, training and testing (including elements of visual display of information and elements of gamification).
Read full abstract