Need For Large Training Sets Research Articles

Segmentation of computed tomography (CT) is important for many clinical procedures including personalized cardiac ablation for the management of cardiac arrhythmias. While segmentation can be automated by machine learning (ML), it is limited by the need for large, labeled training data that may be difficult to obtain. We set out to combine ML of cardiac CT with domain knowledge, which reduces the need for large training datasets by encoding cardiac geometry, which we then tested in independent datasets and in a prospective study of atrial fibrillation (AF) ablation. We mathematically represented atrial anatomy with simple geometric shapes and derived a model to parse cardiac structures in a small set of N = 6 digital hearts. The model, termed "virtual dissection," was used to train ML to segment cardiac CT in N = 20 patients, then tested in independent datasets and in a prospective study. In independent test cohorts (N = 160) from 2 Institutions with different CT scanners, atrial structures were accurately segmented with Dice scores of 96.7% in internal (IQR: 95.3%-97.7%) and 93.5% in external (IQR: 91.9%-94.7%) test data, with good agreement with experts (r = 0.99; p < 0.0001). In a prospective study of 42 patients at ablation, this approach reduced segmentation time by 85% (2.3 ± 0.8 vs. 15.0 ± 6.9 min, p < 0.0001), yet provided similar Dice scores to experts (93.9% (IQR: 93.0%-94.6%) vs. 94.4% (IQR: 92.8%-95.7%), p = NS). Encoding cardiac geometry using mathematical models greatly accelerated training of ML to segment CT, reducing the need for large training sets while retaining accuracy in independent test data. Combining ML with domain knowledge may have broad applications.

Read full abstract

Brain haemorrhages often require urgent treatment with a consequent need for quick and accurate diagnosis. Therefore, in this study, we investigate Support Vector Machine (SVM) classifiers for detecting brain haemorrhages using Electrical Impedance Tomography (EIT) measurement frames. A 2-layer model of the head, along with a series of haemorrhages, is designed as both numerical models and physical phantoms. EIT measurement frames, taken from an electrode array placed on the head surface, are used to train and test linear SVM classifiers. Various scenarios are implemented on both platforms to examine the impact of variables such as noise level, lesion location, lesion size, variation in electrode positioning, and variation in anatomy, on the classifier performance. The classifier performed well in numerical models (sensitivity and specificity of 90%+) with signal-to-noise ratios of 60 dB+, was independent of lesion location, and could detect lesions reliably down to the tested minimum volume of 5 ml. Slight variations in electrode layout did not affect performance. Performance was affected by variations in anatomy however, emphasising the need for large training sets covering different anatomies. The phantom models proved more challenging, with maximal sensitivity and specificity of 75% when used with the linear SVM. Finally, the performance of two more complex classifiers is briefly examined and compared to the linear SVM classifier. These results demonstrate that a radial basis function (RBF) SVM classifier and a neural network classifier can improve detection accuracy. Classifiers applied to EIT measurement frames is a novel approach for lesion detection and may offer an effective diagnostic tool clinically. A challenge is to translate the strong results from numerical models into real world phantoms and ultimately human patients, as well as the selection and development of optimal classifiers for this application.

Read full abstract

Need For Large Training Sets Research Articles

Articles published on Need For Large Training Sets

Transferability of Machine Learning Models for Geogenic Contaminated Groundwaters.

Segmenting computed tomograms for cardiac ablation using machine learning leveraged by domain knowledge encoding.

The Devil is in the Upsampling: Architectural Decisions Made Simpler for Denoising with Deep Image Prior.

UWB Radar Applied to Lane Occupation and Vehicle Classification

Impact of Training Set Size on the Ability of Deep Neural Networks to Deal with Omission Noise

Predicting the dissolution kinetics of silicate glasses by topology-informed machine learning

Brain haemorrhage detection using a SVM classifier with electrical impedance tomography measurement frames.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Need For Large Training Sets Research Articles

Articles published on Need For Large Training Sets

Transferability of Machine Learning Models for Geogenic Contaminated Groundwaters.

Segmenting computed tomograms for cardiac ablation using machine learning leveraged by domain knowledge encoding.

The Devil is in the Upsampling: Architectural Decisions Made Simpler for Denoising with Deep Image Prior.

UWB Radar Applied to Lane Occupation and Vehicle Classification

Impact of Training Set Size on the Ability of Deep Neural Networks to Deal with Omission Noise

Predicting the dissolution kinetics of silicate glasses by topology-informed machine learning

Brain haemorrhage detection using a SVM classifier with electrical impedance tomography measurement frames.