Input Reconstruction Research Articles

The reliable and efficient estimation of uncertainty in artificial intelligence (AI) models poses an ongoing challenge in many fields such as radiation therapy. AI models are intended to automate manual steps involved in the treatment planning workflow. We focus in this study on dose prediction models that predict an optimal dose trade-off for each new patient for a specific treatment modality. They can guide physicians in the optimization, be part of automatic treatment plan generation or support decision in treatmentindication. Most common uncertainty estimation methods are based on Bayesian approximations, like Monte Carlo dropout (MCDO) or Deep ensembling (DE). These two techniques, however, have a high inference time (i.e., require multiple inference passes) and might not work for detecting out-of-distribution (OOD) data (i.e., overlapping uncertainty estimate for in-distribution (ID) and OOD). In this study, we present a direct uncertainty estimation method and apply it for a dose prediction U-Net architecture. It can be used to flag OOD data and give information on the quality of the doseprediction. Our method consists in the addition of a branch decoding from the bottleneck which reconstructs the CT scan given as input. The input reconstruction error can be used as a surrogate of the model uncertainty. For the proof-of-concept, our method is applied to proton therapy dose prediction in head and neck cancer patients. A dataset of 60 oropharyngeal patients was used to train the network using a nested cross-validation approach with 11 folds (training: 50 patients, validation: 5 patients, test: 5 patients). For the OOD experiment, we used 10 extra patients with a different head and neck sub-location. Accuracy, time-gain, and OOD detection are analyzed for our method in this particular application and compared with the popular MCDO andDE. The additional branch did not reduce the accuracy of the dose prediction model. The median absolute error is close to zero for the target volumes and less than 1% of the dose prescription for organs at risk. Our input reconstruction method showed a higher Pearson correlation coefficient with the prediction error (0.620) than DE (0.447) and MCDO (between 0.599 and 0.612). Moreover, our method allows an easier identification of OOD (no overlap for ID and OOD data and a Z-score of 34.05). The uncertainty is estimated simultaneously to the regression task, therefore requires less time and computationalresources. This study shows that the error in the CT scan reconstruction can be used as a surrogate of the uncertainty of the model. The Pearson correlation coefficient with the dose prediction error is slightly higher than state-of-the-art techniques. OOD data can be more easily detected and the uncertainty metric is computed simultaneously to the regression task, therefore faster than MCDO or DE. The code and pretrained model are available on the gitlab repository: https://gitlab.com/ai4miro/ct-reconstruction-for-uncertainty-quatification-of-hdunet.

Read full abstract

Despite achieving exceptional performance, deep neural networks (DNNs) suffer from the harassment caused by adversarial examples, which are produced by corrupting clean examples with tiny perturbations. Many powerful defense methods have been presented such as training data augmentation and input reconstruction which, however, usually rely on the prior knowledge of the targeted models or attacks. A clean example and its adversarial version are very similar but have different high-level representations in a victim model. If we can obtain a space in which the representations of similar examples are also similar, then adversarial examples can be picked out by comparing the representations of input examples in this space and the high-level space of the victim model. Inspired by this, we propose a novel approach for detecting adversarial images, which can protect any pre-trained DNN classifiers and resist an endless stream of new attacks. Specifically, we first adopt a dual autoencoder to project images to a latent space. The dual autoencoder uses the self-supervised learning to ensure that small modifications to samples do not significantly alter their latent representations. Next, the mutual information neural estimation is utilized to enhance the discrimination of the latent representations. We then leverage the prior distribution matching to regularize the latent representations. To easily compare the representations of examples in the two spaces, and not rely on the prior knowledge of the targeted model, a simple fully connected neural network is used to embed the learned representations into an eigenspace, which is consistent with the output eigenspace of the targeted model. Through the distribution similarity of an input example in the two eigenspaces, we can judge whether the input example is adversarial or not. Extensive experiments on MNIST, CIFAR-10, and ImageNet show that the proposed method has superior defense performance and transferability than state-of-the-arts.

Read full abstract

Input Reconstruction Research Articles

Related Topics

Articles published on Input Reconstruction

Learned scalable video coding for humans and machines

A systematic review of progenitor survival and maturation in Parkinsonian models

Can input reconstruction be used to directly estimate uncertainty of a dose prediction U-Net model?

Resilient Self-Triggered Model Predictive Control of Cyber-Physical Systems Under Two-Channel False Data Injection Attacks

Multi-scale input reconstruction network and one-stage instance segmentation for enhancing heart defect prediction rate

Efficient sparse spiking auto-encoder for reconstruction, denoising and classification

Fault detection of industrial processes using attention-based gated recurrent unit autoencoder with skip connection

Transformations establishing equivalence across neural networks: When have two networks learned the same task?

RECONSTRUCTING RETINAL VISUAL IMAGES FROM 3T FMRI DATA ENHANCED BY UNSUPERVISED LEARNING.

Optimal sensor placement for joint reconstruction of multiscale responses and unknown inputs using modal Kalman filter

Video anomaly detection based on a multi-layer reconstruction autoencoder with a variance attention strategy

D‐Unet: A symmetric architecture of convolutional neural network with two auxiliary outputs for dementia recognition

Vertical-horizontal latent space with iterative memory review network for multi-class anomaly detection

Orbital Controls on North Pacific Dust Flux During the Late Quaternary

An reconstruction bidirectional recurrent neural network ‐based deinterleaving method for known radar signals in open‐set scenarios

Mind Reasoning Manners: Enhancing Type Perception for Generalized Zero-Shot Logical Reasoning Over Text.

Asymptotical left inversion for single-input linear systems

Human-in-the-Loop Formation-Containment Control for Multiagent Systems: An Observer-Based Distributed Unknown Input Reconstruction Method

Observer‐based event‐triggered consensus control for multi‐agent systems with nonlinearity and unknown inputs

Detecting Adversarial Examples on Deep Neural Networks With Mutual Information Neural Estimation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Input Reconstruction Research Articles

Related Topics

Articles published on Input Reconstruction

Learned scalable video coding for humans and machines

A systematic review of progenitor survival and maturation in Parkinsonian models

Can input reconstruction be used to directly estimate uncertainty of a dose prediction U-Net model?

Resilient Self-Triggered Model Predictive Control of Cyber-Physical Systems Under Two-Channel False Data Injection Attacks

Multi-scale input reconstruction network and one-stage instance segmentation for enhancing heart defect prediction rate

Efficient sparse spiking auto-encoder for reconstruction, denoising and classification

Fault detection of industrial processes using attention-based gated recurrent unit autoencoder with skip connection

Transformations establishing equivalence across neural networks: When have two networks learned the same task?

RECONSTRUCTING RETINAL VISUAL IMAGES FROM 3T FMRI DATA ENHANCED BY UNSUPERVISED LEARNING.

Optimal sensor placement for joint reconstruction of multiscale responses and unknown inputs using modal Kalman filter

Video anomaly detection based on a multi-layer reconstruction autoencoder with a variance attention strategy

D‐Unet: A symmetric architecture of convolutional neural network with two auxiliary outputs for dementia recognition

Vertical-horizontal latent space with iterative memory review network for multi-class anomaly detection

Orbital Controls on North Pacific Dust Flux During the Late Quaternary

An reconstruction bidirectional recurrent neural network ‐based deinterleaving method for known radar signals in open‐set scenarios

Mind Reasoning Manners: Enhancing Type Perception for Generalized Zero-Shot Logical Reasoning Over Text.

Asymptotical left inversion for single-input linear systems

Human-in-the-Loop Formation-Containment Control for Multiagent Systems: An Observer-Based Distributed Unknown Input Reconstruction Method

Observer‐based event‐triggered consensus control for multi‐agent systems with nonlinearity and unknown inputs

Detecting Adversarial Examples on Deep Neural Networks With Mutual Information Neural Estimation