Self-Supervised Learning in Hebrew–Model to Practice Framework

Oren Gal,Rafi Michaeli,Yerach Doytsher

doi:10.14738/tmlai.106.13515

Abstract

In this paper, we present the current state-of-the-art models for Automatic Speech Recognition due to a self-supervised training implemented on Hebrew language. The motivation behind using self-supervised learning is that even though we wouldn't probably get the accuracy rates as if we choose a supervised learning, we still can achieve amazing results with relatively low amount of data. This way of training allows us to train a model on unlabeled data (or to use a pre-trained model, which is always more accessible. It’s goal in the first unsupervised phase is to learn some good representations from raw audio samples, which can be useful for speech recognition tasks, without using any label data. Then, the model can be fine-tuned on a particular dataset for a specific purpose. It means that our involvement really occurs in the last layers of the model. This kind of training proved to be very powerful. We present complete framework from model to practice with simulations and training model and present an impressive result on Hebrew.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Self-Supervised Learning in Hebrew–Model to Practice Framework

Abstract

Talk to us

Similar Papers

More From: Transactions on Machine Learning and Artificial Intelligence

Lead the way for us

Journal: Transactions on Machine Learning and Artificial Intelligence	Publication Date: Dec 7, 2022
License type: CC BY 4.0

Similar Papers

Adversarial Attack and Defense for Commercial Black-box Chinese-English Speech Recognition Systems
Xuejing Yuan ... Xinqi Ling
ACM Transactions on Privacy and Security | VOL. -
Xuejing Yuan, et. al.Xuejing Yuan ... Xinqi Ling
07 Nov 2024
ACM Transactions on Privacy and Security | VOL. -

Recognition of target domain Japanese speech using language model replacement
Daiki Mori ... Norihide Kitaoka
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2024
Daiki Mori, et. al.Daiki Mori ... Norihide Kitaoka
20 Jul 2024
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2024

ETEH: Unified Attention-Based End-to-End ASR and KWS Architecture
Gaofeng Cheng ... Haoran Miao
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 30
Gaofeng Cheng, et. al.Gaofeng Cheng ... Haoran Miao
01 Jan 2021
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 30

Efficient Adapter Transfer of Self-Supervised Speech Models for Automatic Speech Recognition
Bethan Thomas ... Samuel Kessler
-
Bethan Thomas, et. al.Bethan Thomas ... Samuel Kessler
23 May 2022
23 May 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Self-Supervised Learning in Hebrew–Model to Practice Framework

Abstract

Talk to us

Similar Papers

More From: Transactions on Machine Learning and Artificial Intelligence