Enhanced Sequence-to-Sequence Deep Transfer Learning for Day-Ahead Electricity Load Forecasting

  • Abstract
  • Highlights & Summary
  • PDF
  • Literature Map
  • Similar Papers
Abstract
Translate article icon Translate Article Star icon
Take notes icon Take Notes

Electricity load forecasting is a crucial undertaking within all the deregulated markets globally. Among the research challenges on a global scale, the investigation of deep transfer learning (DTL) in the field of electricity load forecasting represents a fundamental effort that can inform artificial intelligence applications in general. In this paper, a comprehensive study is reported regarding day-ahead electricity load forecasting. For this purpose, three sequence-to-sequence (Seq2seq) deep learning (DL) models are used, namely the multilayer perceptron (MLP), the convolutional neural network (CNN) and the ensemble learning model (ELM), which consists of the weighted combination of the outputs of MLP and CNN models. Also, the study focuses on the development of different forecasting strategies based on DTL, emphasizing the way the datasets are trained and fine-tuned for higher forecasting accuracy. In order to implement the forecasting strategies using deep learning models, load datasets from three Greek islands, Rhodes, Lesvos, and Chios, are used. The main purpose is to apply DTL for day-ahead predictions (1–24 h) for each month of the year for the Chios dataset after training and fine-tuning the models using the datasets of the three islands in various combinations. Four DTL strategies are illustrated. In the first strategy (DTL Case 1), each of the three DL models is trained using only the Lesvos dataset, while fine-tuning is performed on the dataset of Chios island, in order to create day-ahead predictions for the Chios load. In the second strategy (DTL Case 2), data from both Lesvos and Rhodes concurrently are used for the DL model training period, and fine-tuning is performed on the data from Chios. The third DTL strategy (DTL Case 3) involves the training of the DL models using the Lesvos dataset, and the testing period is performed directly on the Chios dataset without fine-tuning. The fourth strategy is a multi-task deep learning (MTDL) approach, which has been extensively studied in recent years. In MTDL, the three DL models are trained simultaneously on all three datasets and the final predictions are made on the unknown part of the dataset of Chios. The results obtained demonstrate that DTL can be applied with high efficiency for day-ahead load forecasting. Specifically, DTL Case 1 and 2 outperformed MTDL in terms of load prediction accuracy. Regarding the DL models, all three exhibit very high prediction accuracy, especially in the two cases with fine-tuning. The ELM excels compared to the single models. More specifically, for conducting day-ahead predictions, it is concluded that the MLP model presents the best monthly forecasts with MAPE values of 6.24% and 6.01% for the first two cases, the CNN model presents the best monthly forecasts with MAPE values of 5.57% and 5.60%, respectively, and the ELM model achieves the best monthly forecasts with MAPE values of 5.29% and 5.31%, respectively, indicating the very high accuracy it can achieve.

Similar Papers
  • PDF Download Icon
  • Research Article
  • Cite Count Icon 13
  • 10.1371/journal.pone.0282608
A hybrid CNN and ensemble model for COVID-19 lung infection detection on chest CT scans.
  • Mar 9, 2023
  • PLOS ONE
  • Ahmed A Akl + 3 more

COVID-19 is highly infectious and causes acute respiratory disease. Machine learning (ML) and deep learning (DL) models are vital in detecting disease from computerized chest tomography (CT) scans. The DL models outperformed the ML models. For COVID-19 detection from CT scan images, DL models are used as end-to-end models. Thus, the performance of the model is evaluated for the quality of the extracted feature and classification accuracy. There are four contributions included in this work. First, this research is motivated by studying the quality of the extracted feature from the DL by feeding these extracted to an ML model. In other words, we proposed comparing the end-to-end DL model performance against the approach of using DL for feature extraction and ML for the classification of COVID-19 CT scan images. Second, we proposed studying the effect of fusing extracted features from image descriptors, e.g., Scale-Invariant Feature Transform (SIFT), with extracted features from DL models. Third, we proposed a new Convolutional Neural Network (CNN) to be trained from scratch and then compared to the deep transfer learning on the same classification problem. Finally, we studied the performance gap between classic ML models against ensemble learning models. The proposed framework is evaluated using a CT dataset, where the obtained results are evaluated using five different metrics The obtained results revealed that using the proposed CNN model is better than using the well-known DL model for the purpose of feature extraction. Moreover, using a DL model for feature extraction and an ML model for the classification task achieved better results in comparison to using an end-to-end DL model for detecting COVID-19 CT scan images. Of note, the accuracy rate of the former method improved by using ensemble learning models instead of the classic ML models. The proposed method achieved the best accuracy rate of 99.39%.

  • Research Article
  • 10.1158/1538-7445.am2021-184
Abstract 184: The utility of deep metric learning for breast cancer identification on mammographic images
  • Jul 1, 2021
  • Cancer Research
  • Justin Du + 8 more

Purpose: Although deep learning (DL) models have shown increasing ability to accurately classify diagnostic images in oncology, significantly large amounts of well-curated data are often needed to match human level performance. Given the relative paucity of imaging datasets for less prevalent cancer types, there is an increasing need for methods which can improve the performance of deep learning models trained using limited diagnostic images. Deep metric learning (DML) is a potential method which can improve accuracy in deep learning models trained on limited datasets. Leveraging a triplet-loss function, DML exponentially increases training data compared to a traditional DL model. In this study, we investigated the utility of DML to improve the accuracy of DL models trained to classify cancerous lesions found on screening mammograms. Methods: Using a dataset of 2620 lesions found on routine screening mammogram, we trained both a traditional DL and DML models to classify suspicious lesions as cancerous or benign. The VGG16 architecture was used as the basis for the DL and DML models. Model performance was compared by calculating model accuracy, sensitivity, and specificity on a blinded test set of 378 lesions. In addition to individual model performance, we also measured agreement accuracy when both the DL and DML models were combined. Sub-analyses were conducted to identify phenotypes which were best suited for each model type. Both models underwent hyperparameters optimization to identify ideal batch size, learning rate, and regularization to prevent overfitting. Results: We found that the combination of the traditional DL model with DML model resulted in the highest overall accuracy (78.7%) representing a 7.1% improvement compared to the traditional DL model (p<.001). Alone, the traditional DL model had an improved accuracy compared to the DML model (71.4% vs 66.4%). The traditional DL model had a higher sensitivity (94.8% vs 73.6 %) , but lower specificity (34.7% vs 55.1%) compared the DML model. Sub-analyses suggested the traditional DL model was more accurate on higher density breasts, whereas the DML model was more accurate on lower density breasts. Additionally, the traditional DL model had the highest accuracy on oval shaped lesions, compared to the DML model which was most accurate on irregularly shaped breast lesions. Conclusion: Our study suggests that addition of DML models with traditional DL models can improve diagnostic image classification performance in cancer. Our results suggest DML models may provide increased specificity and help with classification of unique populations often misclassified by traditional DL models. Further studied investigating the utility of DML on other cancer imaging tasks are necessary to successfully build more robust DL models in cancer imaging. Citation Format: Justin Du, Sachin Umrao, Enoch Chang, Marina Joel, Aidan Gilson, Guneet Janda, Rachel Choi, Yongfeng Hui, Sanjay Aneja. The utility of deep metric learning for breast cancer identification on mammographic images [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2021; 2021 Apr 10-15 and May 17-21. Philadelphia (PA): AACR; Cancer Res 2021;81(13_Suppl):Abstract nr 184.

  • Research Article
  • 10.3389/fonc.2026.1763859
Identification of KRAS mutation in rectal cancer based on a 2.5D deep learning model
  • Feb 25, 2026
  • Frontiers in Oncology
  • Chengmeng Zhang + 5 more

Objective To explore the utility of a 2.5D deep transfer learning (DTL) model for distinguishing between Kirsten rat sarcoma viral oncogene (KRAS) mutant and wild-type phenotypes in patients with rectal cancer (RC). Methods We retrospectively analyzed 138 patients with pathologically confirmed RC who underwent next-generation sequencing to detect KRAS mutations. Among these, 43 KRAS mutant and 95 wild-type cases were enrolled and divided randomly into a training set (30 mutant, 66 wild-type) and a validation set (13 mutant, 29 wild-type) in a 7:3 ratio. Tumor regions of interest (ROIs) were delineated manually slice-by-slice in thin-section arterial-phase computed tomography images. DTL and radiomic features were extracted from ROIs using 2.5D deep learning and traditional radiomic approaches, respectively. After feature-dimensionality reduction and selection, six machine learning models were employed to construct radiomic models and 2.5D deep learning models. The diagnostic performance of each model was evaluated using the area under the receiver operating characteristic curve (AUC). Results After feature selection, 10 radiomic features and 17 DTL features were included for model construction. The AUCs for the radiomic models ranged from 0.808–0.988 in the training set and 0.521–0.672 in the validation set, with the XGBoost classifier achieving the optimal performance (AUC = 0.672) in the validation set. The AUCs for the 2.5D deep learning models ranged from 0.950–1.000 in the training set and 0.788–0.913 in the validation set, with the support vector machine classifier demonstrating the best diagnostic efficacy (AUC = 0.913) in the validation set. Conclusion A 2.5D deep learning model can effectively distinguish between KRAS mutant and KRAS wild-type RC, outperforming traditional radiomic models. It provides a novel non-invasive approach for the preoperative assessment of KRAS mutation status.

  • Research Article
  • Cite Count Icon 39
  • 10.1016/j.aap.2020.105665
Efficient mapping of crash risk at intersections with connected vehicle data and deep learning models
  • Jul 16, 2020
  • Accident Analysis & Prevention
  • Jiajie Hu + 2 more

Efficient mapping of crash risk at intersections with connected vehicle data and deep learning models

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 47
  • 10.3390/vibration6010014
Deep Transfer Learning Models for Industrial Fault Diagnosis Using Vibration and Acoustic Sensors Data: A Review
  • Feb 17, 2023
  • Vibration
  • Md Roman Bhuiyan + 1 more

In order to evaluate final quality, nondestructive testing techniques for finding bearing flaws have grown in favor. The precision of image processing-based vision-based technology has greatly improved for defect identification, inspection, and classification. Deep Transfer Learning (DTL), a kind of machine learning, combines the superiority of Transfer Learning (TL) for knowledge transfer with the benefits of Deep Learning (DL) for feature representation. As a result, the discipline of Intelligent Fault Diagnosis has extensively developed and researched DTL approaches. They can improve the robustness, reliability, and usefulness of DL-based fault diagnosis techniques (IFD). IFD has been the subject of several thorough and excellent studies, although most of them have appraised important research from an algorithmic standpoint, neglecting real-world applications. DTL-based IFD strategies have also not yet undergone a full evaluation. It is necessary and imperative to go through the relevant DTL-based IFD publications in light of this. Readers will be able to grasp the most cutting-edge concepts and develop practical solutions to any IFD challenges they may encounter by doing this. The theory behind DTL is briefly discussed before describing how transfer learning algorithms may be included into deep learning models. This research study looks at a number of vision-based methods for defect detection and identification utilizing vibration acoustic sensor data. The goal of this review is to assess where vision inspection system research is right now. In this respect, image processing as well as deep learning, machine learning, transfer learning, few-shot learning, and light-weight approach and its selection were explored. This review addresses the creation of defect classifiers and vision-based fault detection systems.

  • Research Article
  • Cite Count Icon 1
  • 10.1016/j.irbm.2024.100831
A Novel QCT-Based Deep Transfer Learning Approach for Predicting Stiffness Tensor of Trabecular Bone Cubes
  • Mar 18, 2024
  • IRBM
  • Pengwei Xiao + 3 more

A Novel QCT-Based Deep Transfer Learning Approach for Predicting Stiffness Tensor of Trabecular Bone Cubes

  • Research Article
  • Cite Count Icon 71
  • 10.1016/j.eswa.2023.122159
Development of hybrid models based on deep learning and optimized machine learning algorithms for brain tumor Multi-Classification
  • Oct 18, 2023
  • Expert Systems with Applications
  • Muhammed Celik + 1 more

Development of hybrid models based on deep learning and optimized machine learning algorithms for brain tumor Multi-Classification

  • Research Article
  • 10.1186/s12885-025-14971-7
Deep multi-instance learning model based on gadoxetic acid-enhanced MRI for predicting microvascular invasion of hepatocellular carcinoma: a multicenter, retrospective study
  • Oct 22, 2025
  • BMC Cancer
  • Yi Luo + 7 more

ObjectiveMicrovascular invasion (MVI) is of great significance for the individualized treatment of hepatocellular carcinoma (HCC) and preoperative noninvasive prediction of MVI is still an urgent clinical problem. To explore the effects of different regions of interest (ROI) and image input dimensions on the performance of deep learning (DL) models, and to select the best result to develop and validate a DL model for preoperative prediction of MVI.Materials and methodsA total of 206 patients with pathologically confirmed HCC from three hospitals were retrospectively enrolled and divided into training, internal validation and external test set. Based on hepatobiliary phase images (HBP) of gadoxetic acid-enhanced MRI, 2D DL, 3D DL and 2.5D deep multi-instance learning (MIL) models were established. The receiver operating characteristic curve (ROC) was used to evaluate the predictive efficacy of the above models. Based on the optimal performance model, the T1WI-FS and T2WI-FS images were preprocessed correspondingly, and a multimodal prediction model including three sequences was constructed. The ROC, and decision curve were used to visualize the predictive ability of the model.ResultsCompared with 2D DL and 3D DL models, the 2.5D DL model based on all axial images of ROI had the highest performance, with the AUC values of 0.802 (95% CI, 0.669–0.936) and 0.759 (95% CI, 0.643–0.875) in the validation and test sets. The AUCs of the multimodal MRI model were 0.954 (95% CI, 0.920–0.989) in the training set, 0.857 (95% CI, 0.736–0.978) in the validation set, and 0.788 (95% CI, 0.681–0.895) in the test set.ConclusionThe DL model that selects all axial slices of intratumor and peritumor as input shows robust capability in predicting MVI, which is expected to help clinical decision-making of individualized treatment for HCC.Supplementary InformationThe online version contains supplementary material available at 10.1186/s12885-025-14971-7.

  • Research Article
  • 10.1093/humrep/deab130.259
P–260 Towards better explainable deep learning models for embryo selection in ART
  • Aug 6, 2021
  • Human Reproduction
  • Ashu Sharma + 4 more

Study question Can heatmaps generated by occlusion explain the patterns learned by deep learning (DL) models classifying the embryo viability in ART? Summary answer Occlusion experiments generate heatmaps that reveal which regions in frames of time-lapse video (TLV) are more discriminative for classification and prediction by the DL models. What is known already DL has widely been explored in ART for embryo selection. Depending upon input (video or image), different DL models classifying embryo viability are developed. However, whether the prediction is based on actual input features or random guessing is unknown. The embryo selection in ART is subjective. If the intention is using DL models’ prediction to transfer, freeze or discard the embryo, explanations of how they interpret embryonic development features brings transparency and trust. In other areas, heatmaps are used for explaining DL predictions. The heatmaps can be a tool to understand patterns learned by DL models for embryo selection. Study design, size, duration We trained two separate DL models for predicting the presence of fetal heartbeat for the transferred embryos. We further used occlusion generated heatmaps to explain the predictions. For training, retrospective data was used. The input dataset consisted of 136 TLVs and corresponding patient data for 132 participants (128: single embryo transfers and 8: double embryo transfer) from both IVF and ICSI treatment. Each video was assessed by an embryologist. Participants/materials, setting, methods DL models (A as ResNet–18, B as VGG16) are trained for predicting the presence of fetal heartbeat on a single frame extracted from TLV after day three or later. Model A has a better recall (0.7) compared to B (0.5). Heatmaps explain the reason behind models’ recall rate by visually representing patterns learned by them. Using occlusion filter size 30*30 with stride 14 and size 50*50 with stride 25, we generate heatmaps for both models. Main results and the role of chance The heatmaps generated using occlusion can represent visually the patterns discovered by the DL models when predicting the presence of a fetal heartbeat. Using occlusion filter size 30*30 with stride 14, we verified that Model B has lower recall because the heatmaps show that the model finds redundant features present outside the embryo region in many input frames. It could be interpreted that either the model has not learned relevant patterns or is more robust to noise. This representation of DL models equips us in better decision-making, whether to consider or discard the prediction or rather train the model further, preprocess training data or change network architecture. The heatmaps revealed that for frames where significant patterns learned by the models are within the embryo region, more weight was given to specific features like the inner cell mass, trophectoderm and some parts within the zona pellucida. Moreover, the heat maps generated using occlusion are independent of the underlying model’s architecture as the same experiment settings were used for both models. For occlusion filter size 50*50 with stride 25, the expanse of input regions (in or outside the embryo) considered relevant could be visualized for both models A and B. Limitations, reasons for caution Heatmaps generated by occluding input regions give a visual representation of features in individual frames not directly on videos. Explaining DL models by heatmaps besides occlusion, other techniques (Grad-Cam) exist but were not evaluated. Furthermore, there is no quantitative measure for evaluating whether heatmaps are a good explanation or not. Wider implications of the findings: The heatmaps make the patterns discovered by DL models visually recognized and bring forth the prominent portions of embryo regions. This will again improve understanding and trust in DL models’ predictions. Visual representation of DL models using heatmaps enables interpreting a prediction, performing model analysis and determining scope for improvement. Trial registration number Not applicable

  • Research Article
  • Cite Count Icon 29
  • 10.1111/1365-2478.13097
Learning from unlabelled real seismic data: Fault detection based on transfer learning
  • Jun 6, 2021
  • Geophysical Prospecting
  • Ruoshui Zhou + 3 more

ABSTRACTSignificant advances have been made towards fault detection using deep learning. However, the fault labelling of seismic data requires great human effort. The resulting small sample problem makes traditional deep learning methods difficult to achieve desired results. Existing research proposes to train a deep learning model with labelled synthetic seismic data to get good fault detection results. However, due to the complexity of the actual geological situation, there are inevitable differences between synthetic seismic data and real seismic data in many aspects such as seismic signal frequency, frequency of fault distribution and degree of noise disturbance, which lead to the fact that the deep learning model trained by synthetic seismic data is difficult to get good fault detection result in field data applications. We propose to use transfer learning to reduce the impact of data differences to solve this problem: part of the deep transfer learning model is used to learn fault‐related features. And the other part of the deep transfer learning model is used to mine common features between the real seismic data and the synthetic seismic data, which makes the deep transfer learning model more suitable for real seismic data. Compared with the latest research progress, our method can greatly improve the effect of fault detection without real data label, which can significantly save the cost of manual label processing.

  • Research Article
  • Cite Count Icon 2
  • 10.1016/j.surg.2024.03.054
Deep learning predicts postoperative opioids refills in a multi-institutional cohort of surgical patients
  • May 25, 2024
  • Surgery
  • Hojjat Salehinejad + 4 more

Deep learning predicts postoperative opioids refills in a multi-institutional cohort of surgical patients

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 2
  • 10.1038/s41598-024-67217-0
Predictive value of MRI-based deep learning model for lymphovascular invasion status in node-negative invasive breast cancer
  • Jul 13, 2024
  • Scientific Reports
  • Rong Liang + 8 more

To retrospectively assess the effectiveness of deep learning (DL) model, based on breast magnetic resonance imaging (MRI), in predicting preoperative lymphovascular invasion (LVI) status in patients diagnosed with invasive breast cancer who have negative axillary lymph nodes (LNs). Data was gathered from 280 patients, including 148 with LVI-positive and 141 with LVI-negative lesions. These patients had undergone preoperative breast MRI and were histopathologically confirmed to have invasive breast cancer without axillary LN metastasis. The cohort was randomly split into training and validation groups in a 7:3 ratio. Radiomics features for each lesion were extracted from the first post-contrast dynamic contrast-enhanced (DCE)-MRI. The Least Absolute Shrinkage and Selection Operator (LASSO) regression method and logistic regression analyses were employed to identify significant radiomic features and clinicoradiological variables. These models were established using four machine learning (ML) algorithms and one DL algorithm. The predictive performance of the models (radiomics, clinicoradiological, and combination) was assessed through discrimination and compared using the DeLong test. Four clinicoradiological parameters and 10 radiomic features were selected by LASSO for model development. The Multilayer Perceptron (MLP) model, constructed using both radiomic and clinicoradiological features, demonstrated excellent performance in predicting LVI, achieving a high area under the curve (AUC) of 0.835 for validation. The DL model (MLP-radiomic) achieved the highest accuracy (AUC = 0.896), followed by DL model (MLP-combination) with an AUC of 0.835. Both DL models were significantly superior to the ML model (RF-clinical) with an AUC of 0.720. The DL model (MLP), which integrates radiomic features with clinicoradiological information, effectively aids in the preoperative determination of LVI status in patients with invasive breast cancer and negative axillary LNs. This is beneficial for making informed clinical decisions.

  • Research Article
  • Cite Count Icon 19
  • 10.18280/ts.390515
Human Gender Prediction Based on Deep Transfer Learning from Panoramic Dental Radiograph Images
  • Nov 30, 2022
  • Traitement du Signal
  • Isa Ataş

Panoramic Dental Radiography (PDR) image processing is one of the most extensively used manual methods for gender determination in forensic medicine. With the assistance of the PDR images, a person's biological gender determination can be performed through analyzing skeletal structures expressing sexual dimorphism. Manual approaches require a wide range of mandibular parameter measurements in metric units. Besides being time-consuming, these methods also necessitate the employment of experienced professionals. In this context, deep learning models are widely utilized in the auto-analysis of radiological images nowadays, owing to their high processing speed, accuracy, and stability. In our study, a data set consisting of 24,000 dental panoramic images was prepared for binary classification, and the transfer learning method was used to accelerate the training and increase the performance of our proposed DenseNet121 deep learning model. With the transfer learning method, instead of starting the learning process from scratch, the existing patterns learned beforehand were used. Extensive comparisons were made using deep transfer learning (DTL) models VGG16, ResNet50, and EfficientNetB6 to assess the classification performance of the proposed model in PDR images. According to the findings of the comparative analysis, the proposed model outperformed the other approaches by achieving a success rate of 97.25% in gender classification.

  • Research Article
  • 10.1093/humrep/deaf097.481
P-172 Retrospective comparison of deep learning versus logistic regression for selecting the best embryo for transfer
  • Jun 1, 2025
  • Human Reproduction
  • T Vermilyea + 1 more

Study question How does the performance of a deep learning model compare to a logistic regression model for embryo ranking? Summary answer Top-ranked embryos via both deep learning and logistic regression showed higher pregnancy rates, but deep learning showed a greater improvement in pregnancy success. What is known already Artificial intelligence (AI) algorithms are now being utilized for embryo selection in IVF. However, there remains a debate about whether interpretable machine learning models are more suitable versus deep learning models, which are often considered “black-box” due to their complexity. Two previously developed models were compared in this study: 1) A deep learning model which utilizes CNNs to automatically analyze a static image of an embryo, and 2) A logistic regression model that incorporates embryo morphology grade (ie 5AB), embryo day (5, 6, or 7), and patient age. Both models were developed using 10,000+ embryo images from 11 U.S. clinics. Study design, size, duration A total of 4543 images and morphology grades of individual embryos were collected prospectively from 870 patients at two U.S. clinics using an embryo image capture software, from January - December 2024. Of these, 406 embryos were transferred. 90% of the transferred embryos were genetically tested. None of these embryos were used for training or testing either AI models. Participants/materials, setting, methods After removing aneuploid embryos, embryos were ranked within each patient’s cohort using both the deep learning and logistic regression models. We then compared pregnancy rates of embryos that were top-ranked in their cohort versus those that were lower-ranked. To reduce bias, we included only patients with multiple viable embryos to choose from and only considered first transfers. Differences in biochemical pregnancy and fetal heartbeat were compared for both approaches. Main results and the role of chance Retrospectively, the top-ranked deep learning embryo was transferred 43% of the time, whereas the top-ranked logistic regression embryo was transferred 76% of the time. Transferring the top-ranked embryo by deep learning was associated with an 8.9% higher pregnancy rate (76.1% vs. 67.2%, p = 0.08) and a 6.2% higher fetal heartbeat (60.0% vs 53.8%, p = 0.38). For logistic regression, the top-ranked embryo selection was associated with a 4.1% higher pregnancy rate (71.3% vs. 67.2%, p = 0.51) and a 4.1% higher fetal heartbeat (56.8% vs 52.7%, p = 0.45). P-values were >0.05 for all comparisons, indicating statistical non-significance. For all comparisons, there were no statistical or clinical differences in the average age of the patients between the two groups, nor were there differences in the average AI score of the top-ranked embryo in the cohort, suggesting that these comparisons did not introduce significant biases. Limitations, reasons for caution As this was a retrospective study, clinical decision making about which embryo to transfer was not influenced by either model rankings. The dataset was limited to two clinics, so further prospective validation is needed. Wider implications of the findings Both deep learning and logistic regression models show promise for selecting the top ranked embryo in a patient’s cohort. The simplicity and interpretability of the logistic regression may allow for faster adoption and clinical trust, while deep learning may further enhance success rates. Trial registration number No

  • Conference Article
  • Cite Count Icon 8
  • 10.3997/2214-4609.201901607
Progress and Challenges in Deep Learning Analysis of Geoscience Images
  • Jan 1, 2019
  • R Pires De Lima + 3 more

Summary Deep learning and deep convolutional neural network (CNN) models have shown promising results and are gaining popularity in the geoscientific community. In contrast to traditional machine learning methodologies based on a suite of carefully selected attributes, deep learning is based on the raw images themselves. Deep CNNs are currently the tools of choice for computer vision tasks such as self-driving cars. Unfortunately, deep learning is encumbered by jargon that is unfamiliar to most geoscientists, providing black box applications resulting in two common reactions: deep learning models are the solution for everything or deep learning models are a modern fad that discards the interpreter's insight or experience with a given problem. In this presentation, we show that CNN models are based on attributes similar to those we use in seismic interpretation and remote sensing. We also show that through a process called transfer learning based on the analysis of 2D colour images, we can exploit much of the previous work developed for image recognition applications to rocks. We illustrate the successful use of transfer learning to microfossil classification, core description, petrographic analysis, and hand specimen identification. We also discuss some of the challenges in CNN analysis of 3D seismic data volumes.

Save Icon
Up Arrow
Open/Close
  • Ask R Discovery Star icon
  • Chat PDF Star icon
Setting-up Chat
Loading Interface