ALGORITHM OF CROSS LANGUAGE FUZZY SEARCH BASED ON HASH-VECTORS FOR AUTOMATIC COMPARISON OF PERSONAL NAMES

E Yu Vakhromova,V I Goremychkin,A A Gerasimenko,V P Krivoshlyapov,I V Beketova

doi:10.14489/vkit.2020.03.pp.029-036

Abstract

The algorithm of cross language fuzzy search based on hash vectors for automatic matching of personal names is proposed. In the response mode for an input request, names in Latin spelling and a given value for the similarity measure, the algorithm determines the set of output Cyrillic names contained in the database of the information search system. The principal feature of the proposed algorithm is the rejection of the direct translation of personal names. Instead, the hashing mechanism of personal names is used, followed by mapping them into the same hidden vector space where the computational procedures of the decision-making system are built. In the process of research, it was solved a number of actual intermediate tasks. Thus, the decomposition algorithms of the explored database, the generation and clustering of the dictionary of basic morphemes are an instrument that is of independent value in solving the problem of automatically translating names from a foreign language, the translation rules of which are unknown – the socalled generalized transcription. After mapping names into a vector space, the matching operation is reduced to assessing the similarity between vectors. As a measure of similarity, several quantities were considered in the study. The most convenient measure of similarity is the cosine similarity, the critical value of which was obtained by plotting the FMR (False Match Rate) and FNMR (False Non-Match Rate) graphs. The developed algorithm is universal with respect to the languages used, that is, it does not depend on a specific alphabet. In the practical implementation of the developed algorithm, a series of experimental studies was carried out using a database containing more than 2.5 million names.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ALGORITHM OF CROSS LANGUAGE FUZZY SEARCH BASED ON HASH-VECTORS FOR AUTOMATIC COMPARISON OF PERSONAL NAMES

Abstract

Talk to us

Similar Papers

More From: Vestnik komp'iuternykh i informatsionnykh tekhnologii

Lead the way for us

Similar Papers

Fingerprint Matching and Non-Matching Analysis for Different Tolerance Rotation Degrees in Commercial Matching Algorithms
A J Perez-Diaz ... I C Arronte-Lopez
Journal of Applied Research and Technology | VOL. 8
A J Perez-Diaz, et. al.A J Perez-Diaz ... I C Arronte-Lopez
01 Aug 2010
Journal of Applied Research and Technology | VOL. 8

Towards Reducing the Error Rates in Template Protection for Iris Recognition Using Custom Cuckoo Filters
Kiran B Raja ... Christoph Busch
-
Kiran B Raja, et. al.Kiran B Raja ... Christoph Busch
01 Jan 2019
01 Jan 2019

Design of contactless hand biometric system with relative geometric parameters
A Siswanto ... P Tarigan
-
A Siswanto, et. al.A Siswanto ... P Tarigan
01 Nov 2013
01 Nov 2013

Person Identification Using Footprint Minutiae
Riti Kushwaha ... Neeta Nain
-
Riti Kushwaha, et. al.Riti Kushwaha ... Neeta Nain
20 Sep 2019
20 Sep 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ALGORITHM OF CROSS LANGUAGE FUZZY SEARCH BASED ON HASH-VECTORS FOR AUTOMATIC COMPARISON OF PERSONAL NAMES

Abstract

Talk to us

Similar Papers

More From: Vestnik komp'iuternykh i informatsionnykh tekhnologii