Abstract

The performance of any machine learning algorithm heavily depends on the quality and quantity of the training data. Machine learning algorithms, driven by training data can accurately predict and produce the right outcome when trained through enough amount of quality data. In the medical applications, being more critical, the accuracy is of utmost importance. Obtaining medical imaging data, enough to train machine learning algorithm is difficult due to a variety of reasons. An effort has been made to produce an augmented dental radiography dataset to train machine learning algorithms. 116 panoramic dental radiographs have been manually segmented for each tooth producing 32 classes of teeth. Out of 3712 images of individual tooth, 2910 were used for machine learning through general augmentation methods that include rotation, intensity transformation and flipping of the images, creating a massive dataset of 5.12 million unique images. The dataset is labeled and classified into 32 classes. This dataset can be used to train deep convolutional neural networks to perform classification and segmentation of teeth in x-rays, Cone-Beam CT scans and other radiographs. We retrained AlexNet on a subset of 80,000 images of the entire dataset and obtained classification accuracy of 98.88% on 10 classes. The retraining on original dataset yielded 88.31%. The result is evident of nearly a 10% increase in the performance of the classifier trained on the augmented dataset. The training and validation datasets include teeth affected with metal objects. The manually segmented dataset can be used as a benchmark to evaluate the performance of machine learning algorithms for performing tooth segmentation and tooth classification.

Highlights

  • The machine learning, especially deep convolutional neural network has been playing a key role in the advancements in the medical imaging field [1]

  • The reported that augmentation through Generative Adversarial Networks (GAN) improved the classifier performance by nearly 10%

  • We produced a massive dataset of 5.12 million images from dental radiographs

Read more

Summary

INTRODUCTION

The machine learning, especially deep convolutional neural network has been playing a key role in the advancements in the medical imaging field [1]. The large datasets used to train these models are produced manually by teams which consist of many human resources. The absence of wisdom tooth is common, the ratio of subjects with no wisdom tooth is even higher in the youth To solve this issue of lack of training dataset and natural imbalance, data augmentation is proposed to generate synthetic radiography datasets for training deep convolutional neural networks to perform classification and segmentation in CBCT, X-Rays and panoramic radiographs. We retained AlexNet [4] on a subset from this dataset containing 80,000 images and obtained 98.88% classification accuracy on 10 classes. The dataset we produced is a useful resource for training deep neural networks for tooth segmentation, classification and labeling but it can be used as a benchmark for evaluating the performance of deep learning models

RELATED WORK
MATERIALS AND METHODOLOGY
Rotate
Resize
Intensity Transformation
Horizontal Flip
EXPERIMENT AND EVALUATION
LIMITATION
CONCLUSION
Findings
FUTURE WORK
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call