Abstract

This dataset consists of linguistic data (recordings and transcriptions/translations) gathered on the Pnar language by Dr. Hiram Ring. Pnar (ISO 639-3: pbv) is a language spoken in Meghalaya state of northeast India by around 400,000 people. The recordings and transcriptions in this dataset were carried out in and around Jowai between June 2011 and July 2013. There are two files that describe the contents of this dataset in more detail: README_Dataset.txt and README_Toolbox.txt - these should be consulted for instructions on how to access the data, once downloaded. The Dictionary.txt and Texts.txt files contain lexical data and interlinearized transcriptions/translations respectively. The majority of the dataset consists of sound files encoded as *.flc or FLAC (Free Lossless Audio Codec) files. These have the advantage of containing lossless audio but taking up significantly less space than lossless WAV files.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call