Statistical Guarantees Research Articles

Incorporating learning based components in the current state-of-the-art cyber-physical systems (CPS) has been a challenge due to the brittleness of the underlying deep neural networks. On the bright side, if executed correctly with safety guarantees, this has the ability to revolutionize domains like autonomous systems, medicine, and other safety-critical domains. This is because it would allow system designers to use high-dimensional outputs from sensors like camera and LiDAR. The trepidation in deploying systems with vision and LiDAR components comes from incidents of catastrophic failures in the real world. Recent reports of self-driving cars running into difficult to handle scenarios is ingrained in the software components which handle such sensor inputs. The ability to handle such high-dimensional signals is due to the explosion of algorithms which use deep neural networks. Sadly, the reason behind the safety issues is also due to deep neural networks themselves. The pitfalls occur due to possible over-fitting and lack of awareness about the blind spots induced by the training distribution. Ideally, system designers would wish to cover as many scenarios during training as possible. However, achieving a meaningful coverage is impossible. This naturally leads to the following question: is it feasible to flag out-of-distribution (OOD) samples without causing too many false alarms? Such an OOD detector should be executable in a fashion that is computationally efficient. This is because OOD detectors often are executed as frequently as the sensors are sampled. Our aim in this article is to build an effective anomaly detector. To this end, we propose the idea of a memory bank to cache data samples which are representative enough to cover most of the in-distribution data. The similarity with respect to such samples can be a measure of familiarity of the test input. This is made possible by an appropriate choice of distance function tailored to the type of sensor we are interested in. Additionally, we adapt conformal anomaly detection framework to capture the distribution shifts with a guarantee of false alarm rate. We report the performance of our technique on two challenging scenarios: a self-driving car setting implemented inside the simulator CARLA with image inputs and autonomous racing car navigation setting with LiDAR inputs. From the experiments, it is clear that a deviation from the in-distribution setting can potentially lead to unsafe behavior. It should be noted that not all OOD inputs lead to precarious situations in practice, but staying in-distribution is akin to staying within a safety bubble and predictable behavior. An added benefit of our memory-based approach is that the OOD detector produces interpretable feedback for a human designer. This is of utmost importance since it recommends a potential fix for the situation as well. In other competing approaches, such feedback is difficult to obtain due to reliance on techniques which use variational autoencoders.

Read full abstract

In the past few years, Transformer has been widely adopted in many domains and applications because of its impressive performance. Vision Transformer (ViT), a successful and well-known variant, attracts considerable attention from both industry and academia thanks to its record-breaking performance in various vision tasks. However, ViT is also highly nonlinear like other classical neural networks and could be easily fooled by both natural and adversarial perturbations. This limitation could pose a threat to the deployment of ViT in the real industrial environment, especially in safety-critical scenarios. How to improve the robustness of ViT is thus an urgent issue that needs to be addressed. Among all kinds of robustness, patch robustness is defined as giving a reliable output when a random patch in the input domain is perturbed. The perturbation could be natural corruption, such as part of the camera lens being blurred. It could also be a distribution shift, such as an object that does not exist in the training data suddenly appearing in the camera. And in the worst case, there could be a malicious adversarial patch attack that aims to fool the prediction of a machine learning model by arbitrarily modifying pixels within a restricted region of an input image. This kind of attack is also called physical attack, as it is believed to be more real than digital attack. Although there has been some work on patch robustness improvement of Convolutional Neural Network, related studies on its counterpart ViT are still at an early stage as ViT is usually much more complex with far more parameters. It is harder to assess and improve its robustness, not to mention to provide a provable guarantee. In this work, we propose PatchCensor, aiming to certify the patch robustness of ViT by applying exhaustive testing. We try to provide a provable guarantee by considering the worst patch attack scenarios. Unlike empirical defenses against adversarial patches that may be adaptively breached, certified robust approaches can provide a certified accuracy against arbitrary attacks under certain conditions. However, existing robustness certifications are mostly based on robust training, which often requires substantial training efforts and the sacrifice of model performance on normal samples. To bridge the gap, PatchCensor seeks to improve the robustness of the whole system by detecting abnormal inputs instead of training a robust model and asking it to give reliable results for every input, which may inevitably compromise accuracy. Specifically, each input is tested by voting over multiple inferences with different mutated attention masks, where at least one inference is guaranteed to exclude the abnormal patch. This can be seen as complete-coverage testing, which could provide a statistical guarantee on inference at the test time. Our comprehensive evaluation demonstrates that PatchCensor is able to achieve high certified accuracy (e.g., 67.1% on ImageNet for 2%-pixel adversarial patches), significantly outperforming state-of-the-art techniques while achieving similar clean accuracy (81.8% on ImageNet). The clean accuracy is the same as vanilla ViT models. Meanwhile, our technique also supports flexible configurations to handle different adversarial patch sizes by simply changing the masking strategy.

Read full abstract

Statistical Guarantees Research Articles

Related Topics

Articles published on Statistical Guarantees

Distributed Estimation of Principal Support Vector Machines for Sufficient Dimension Reduction

Heterogeneous functional regression for subgroup analysis

Dynamic frequent subgraph mining algorithms over evolving graphs: a survey

Statistical Testing of Quantum Programs via Fixed-Point Amplitude Amplification

Net-Zero Scheduling of Multi-Energy Building Energy Systems: A Learning-Based Robust Optimization Approach With Statistical Guarantees

A Benchmark Proposal for Non‐Generative Fair Adversarial Learning Strategies Using a Fairness‐Utility Trade‐off Metric

End-to-End Statistical Model Checking for Parameterization and Stability Analysis of ODE Models

Leveraging conformal prediction to annotate enzyme function space with limited false positives.

Memory-based Distribution Shift Detection for Learning Enabled Cyber-Physical Systems with Statistical Guarantees

Optimistic Rates: A Unifying Theory for Interpolation Learning and Regularization in Linear Regression

Variable Importance in High-Dimensional Settings Requires Grouping

MeshPointNet: 3D Surface Classification Using Graph Neural Networks and Conformal Predictions on Mesh-Based Representations

Statistical guaranteed noisy tensor recovery by fusing low-rankness on all orientations in frequency–original domains

Identification and Auto-Debiased Machine Learning for Outcome-Conditioned Average Structural Derivatives

Finding Near-optimal Configurations in Colossal Spaces with Statistical Guarantees

Toward quantitative super-resolution microscopy: molecular maps with statistical guarantees.

Denoising cosine similarity: A theory-driven approach for efficient representation learning

Deep nonparametric estimation of intrinsic data structures by chart autoencoders: Generalization error and robustness

A Doubly Stochastic Simulator with Applications in Arrivals Modeling and Simulation

PatchCensor: Patch Robustness Certification for Transformers via Exhaustive Testing

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Statistical Guarantees Research Articles

Related Topics

Articles published on Statistical Guarantees

Distributed Estimation of Principal Support Vector Machines for Sufficient Dimension Reduction

Heterogeneous functional regression for subgroup analysis

Dynamic frequent subgraph mining algorithms over evolving graphs: a survey

Statistical Testing of Quantum Programs via Fixed-Point Amplitude Amplification

Net-Zero Scheduling of Multi-Energy Building Energy Systems: A Learning-Based Robust Optimization Approach With Statistical Guarantees

A Benchmark Proposal for Non‐Generative Fair Adversarial Learning Strategies Using a Fairness‐Utility Trade‐off Metric

End-to-End Statistical Model Checking for Parameterization and Stability Analysis of ODE Models

Leveraging conformal prediction to annotate enzyme function space with limited false positives.

Memory-based Distribution Shift Detection for Learning Enabled Cyber-Physical Systems with Statistical Guarantees

Optimistic Rates: A Unifying Theory for Interpolation Learning and Regularization in Linear Regression

Variable Importance in High-Dimensional Settings Requires Grouping

MeshPointNet: 3D Surface Classification Using Graph Neural Networks and Conformal Predictions on Mesh-Based Representations

Statistical guaranteed noisy tensor recovery by fusing low-rankness on all orientations in frequency–original domains

Identification and Auto-Debiased Machine Learning for Outcome-Conditioned Average Structural Derivatives

Finding Near-optimal Configurations in Colossal Spaces with Statistical Guarantees

Toward quantitative super-resolution microscopy: molecular maps with statistical guarantees.

Denoising cosine similarity: A theory-driven approach for efficient representation learning

Deep nonparametric estimation of intrinsic data structures by chart autoencoders: Generalization error and robustness

A Doubly Stochastic Simulator with Applications in Arrivals Modeling and Simulation

PatchCensor: Patch Robustness Certification for Transformers via Exhaustive Testing