ПЕРЕОБУЧЕНИЕ В МАШИННОМ ОБУЧЕНИИ: ПРОБЛЕМЫ И РЕШЕНИЯ

V.A Parasich,G.I Volovich,I.V Parasich,A.V Parasich,S.G Nekrasov

doi:10.14529/ctcr240202

Abstract

Overfitting is one of the most important factors affecting the performance of machine lear¬ning algorithms. When solving machine learning problems, it is important to be able to effectively solve the problem of overfitting. The research objective. The purpose of this article is to study the problem of overfitting in machine learning tasks. The article discusses effective learning methods aimed at preventing overfitting. Material and methods. The focus of the article is on various non-standard issues related to overfitting that are important from a practical point of view. Various causes of overfitting, its consequences and methods of combating overfitting are considered. The dependence of overfitting and generalizing abi¬lity on the quality of features and properties of the training set is studied. Particular attention is paid to the features of training and the formation of a training sample in multidimensional feature spaces. The question of the correct formation of the training set and the correct addition of data to the training set from the point of view of overfitting prevention, as well as the impact of incorrect distribution of the target variable on overfitting, is considered. It is explained why the methods of adding incorrect data to the training set, such as MixUp and CutMix, can improve the quality of training. The problem of the algorithm's confidence in its predictions is considered, as well as the problem of algorithm overconfidence in incorrect predictions, which is also typical for ChatGPT. The problem of assessing the quality of the algorithm is considered. It is shown why normalization can help avoid overfitting. Results. An algorithm for training decision trees Random Samples Mix-Up is proposed to combat overfitting, which improves the quality of training decision trees. A comparative analysis of the quality of models before and after the application of this method of combating overfitting is carried out. Experiments on real data confirm effectiveness of this method. Conclusion. The results of the study can be useful in developing new machine learning algorithms and improving the efficiency of existing ones. The results of the study can be useful for developers of machine learning algorithms and specialists in the field of artificial intelligence.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ПЕРЕОБУЧЕНИЕ В МАШИННОМ ОБУЧЕНИИ: ПРОБЛЕМЫ И РЕШЕНИЯ

Abstract

Talk to us

Similar Papers

More From: Bulletin of the South Ural State University. Ser. Computer Technologies, Automatic Control & Radioelectronics

Lead the way for us

Journal: Bulletin of the South Ural State University. Ser. Computer Technologies, Automatic Control & Radioelectronics	Publication Date: May 1, 2024
License type: cc-by

Similar Papers

Machine Learning Applications in Orthopaedic Imaging.
Vincent M Wang ... Bert Huang
The Journal of the American Academy of Orthopaedic Surgeons | VOL. 28
Vincent M Wang, et. al.Vincent M Wang ... Bert Huang
15 May 2020
The Journal of the American Academy of Orthopaedic Surgeons | VOL. 28

Pushing the limits of solubility prediction via quality-oriented data selection.
Murat Cihan Sorkun ... Süleyman Er
iScience | VOL. 24
Murat Cihan Sorkun, et. al.Murat Cihan Sorkun ... Süleyman Er
17 Dec 2020
iScience | VOL. 24

Multimodal data for systolic and diastolic blood pressure prediction: The hypertension conscious artificial intelligence.
Quincy A Hathaway ... Naveena Yanamala
EBioMedicine | VOL. 84
Quincy A Hathaway, et. al.Quincy A Hathaway ... Naveena Yanamala
13 Sep 2022
EBioMedicine | VOL. 84

A Primer on Machine Learning.
Audrene S Edwards ... Tun Jie
Transplantation | VOL. 105
Audrene S Edwards, et. al.Audrene S Edwards ... Tun Jie
18 Aug 2020
Transplantation | VOL. 105

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ПЕРЕОБУЧЕНИЕ В МАШИННОМ ОБУЧЕНИИ: ПРОБЛЕМЫ И РЕШЕНИЯ

Abstract

Talk to us

Similar Papers

More From: Bulletin of the South Ural State University. Ser. Computer Technologies, Automatic Control &amp; Radioelectronics

More From: Bulletin of the South Ural State University. Ser. Computer Technologies, Automatic Control & Radioelectronics