A METHOD OF SEQUENTIAL SEARCHING OF OCCURANCES IN TEXT WITH THE ACCOUNT OF POSSIBLE COLLISIONS

I N Efremova,N A Emelianova,V V Efremov

doi:10.21869/2223-1560-2017-21-4-68-74

Abstract

One of the fundamental tasks of modern computer information systems is processing of symbol information, the amount of which prevails in the total amount of information. At present, rules-based approach is effectively applied to the tasks of processing symbol information. The paper deals with the peculiarities of text search applying rules-based approach. The main essence of the approach is to find pattern occurrences in the text and possible implementation of substitution (text modification). Meanwhile, when implementing search for occurrences, various kinds of collisions may arise. They should be taken into account to solve the set tasks correctly. For example, algorythms of sequential word matching can run into collisions which involve the possibility of skipping positions of pattern occurrence in a word with some structural peculiarities. The paper presents a method of searching taking into account possible collisions developed by the authors, as well as algorithmic and automatic models of the method. The developed method involves patterm markup and setting a sequence of its viewing in the form of algorithm diagram. Three algorythms (implementation variants) of the method have been developed. Algorithms differ in the possibility to carry out transition to this oк that position of the pattern and the text depending on the result of matching (equality or inequality of the current symbols of the patten and text). An automation model of the method has been developed. The proposed method of sequential matching with the pattern with collisions elimination increases the effectiveness of the computer system when implementing search procedures and symbol information processing. The method can be used in the systems of symbol information processing.

Highlights

Так если образец S2 имеет структуру: S2 P R3 P R4, где P, R3, R4- произвольные слова в алфавите В, то при сопоставлении с некоторыми словами, например, S1 Р R3 P R3 P R4, вхождение образца не будет обнаружено тогда, когда после неудачного сопоставления с первой позицией R3 образца S2 начинать следующую итерацию в соответствии с алгоритмом с текущей позиции слова и начальной буквой образца
Алгоритм преобразования образца в таблицу переходов автомата, реализующего способ сопоставления с устранением коллизий, приведен на рис.[2], где обозначено: N – длина образца S2; I, J- указатели позиции символа образца; X1: S2[1]
Способ сопоставления символьной информации с множеством образцов // Известия Юго-Западного государственного университета. 2012

Summary

СПОСОБ ПОСЛЕДОВАТЕЛЬНОГО ПОИСКА ВХОЖДЕНИЙ В ТЕКСТЕ С УЧЕТОМ ВОЗМОЖНЫХ КОЛЛИЗИЙ

Одной из фундаментальных задач современных компьютерных информационных систем является обработка символьной информации, объем которой превалирует в общем объеме всей информации. В работе описывается разработанный авторами способ поиска с учетом возможных коллизий, а также алгоритмические и автоматные модели способа. Способ последовательного сопоставления с образцом с устранением коллизий повышает эффективность вычислительной системы при реализации поисковых процедур и обработки символьной информации. Способ последовательного поиска вхождений в тексте с учетом возможных коллизий // Известия Юго-Западного государственного университета. Одной из фундаментальных задач современных компьютерных информационных систем является обработка символьной информации, объем которой превалирует в общем объеме всей информации, циркулирующей в системах обработки данных. При реализации поиска вхождений могут возникать различного рода коллизии, которые необходимо учитывать для корректного решения поставленных задач.

Постановка задачи

Способ поиска вхождений в тексте с учетом возможных коллизий

Условия Xi

Список литературы

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A METHOD OF SEQUENTIAL SEARCHING OF OCCURANCES IN TEXT WITH THE ACCOUNT OF POSSIBLE COLLISIONS

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Proceedings of the Southwest State University

Lead the way for us

Journal: Proceedings of the Southwest State University	Publication Date: Aug 28, 2017
License type: cc-by

Similar Papers

Selforganization of Information and Value - Discussion of the Relation to Physics
Werner Ebeling
-
Werner EbelingWerner Ebeling
30 Jun 2015
30 Jun 2015

Stereo Dense Image Matching by Adaptive Fusion of Multiple-Window Matching Results
Yilong Han ... Xu Huang
Remote sensing | VOL. 12
Yilong Han, et. al.Yilong Han ... Xu Huang
24 Sep 2020
Remote sensing | VOL. 12

Robust scene matching method based on sparse representation and iterative correction
Sai Yang ... Yang Liu
Image and Vision Computing | VOL. 60
Sai Yang, et. al.Sai Yang ... Yang Liu
24 Nov 2016
Image and Vision Computing | VOL. 60

Property Matching and Weighted Matching
Amihood Amir ... Hui Zhang
-
Amihood Amir, et. al.Amihood Amir ... Hui Zhang
01 Jan 2006
01 Jan 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A METHOD OF SEQUENTIAL SEARCHING OF OCCURANCES IN TEXT WITH THE ACCOUNT OF POSSIBLE COLLISIONS

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Proceedings of the Southwest State University