The study aims to provide open access to structured and annotated sound data of the dialect corpus of the Buryat language. It was decided to present the corpus on the Web in the form of a geoinformation system with data binding to a digital map, since the territorial principle plays one of the leading roles in the classification of Buryat dialects. First, the programme of the speech corpus was compiled and sound recordings performed by informants - speakers of the dialects were obtained. The recorded material was segmented and annotated in the ELAN programme. The next step was to develop a programme that allows transferring data from ELAN format files to a relational database. To present data on the Internet, a web application was developed in the form of an interactive digital map based on Google Maps Platform. As a result, a web resource was created that provides users with access to audio dialect data presented in an annotated and structured form and displayed according to the geographic principle. Scientific novelty lies in introducing into scientific and public use materials of a fundamentally new type that make it possible to obtain information about the modern sound of Buryat dialects, as well as to conduct research on modern Buryat dialect speech.
Read full abstract