Deep Learning Inference Research Articles

In this paper, we present a new AI (Artificial Intelligence) edge platform, called “MiniDeep”, which provides a standalone deep learning platform based on the cloud-edge architecture. This AI-Edge platform provides developers with a whole deep learning development environment to set up their deep learning life cycle processes, such as model training, model evaluation, model deployment, model inference, ground truth collecting, data pre-processing, and training data management. To the best of our knowledge, such a whole deep learning development environment has not been built before. MiniDeep uses Amazon Web Services (AWS) as the backend platform of a deep learning tuning management model. In the edge device, the OpenVino enables deep learning inference acceleration at the edge. To perform a deep learning life cycle job, MiniDeep proposes a mini deep life cycle (MDLC) system which is composed of several microservices from the cloud to the edge. MiniDeep provides Train Job Creator (TJC) for training dataset management and the models’ training schedule and Model Packager (MP) for model package management. All of them are based on several AWS cloud services. On the edge device, MiniDeep provides Inference Handler (IH) to handle deep learning inference by hosting RESTful API (Application Programming Interface) requests/responses from the end device. Data Provider (DP) is responsible for ground truth collection and dataset synchronization for the cloud. With the deep learning ability, this paper uses the MiniDeep platform to implement a recommendation system for AI-QSR (Quick Service Restaurant) KIOSK (interactive kiosk) application. AI-QSR uses the MiniDeep platform to train an LSTM (Long Short-Term Memory)-based recommendation system. The LSTM-based recommendation system converts KIOSK UI (User Interface) flow to the flow sequence and performs sequential recommendations with food suggestions. At the end of this paper, the efficiency of the proposed MiniDeep is verified through real experiments. The experiment results have demonstrated that the proposed LSTM-based scheme performs better than the rule-based scheme in terms of purchase hit accuracy, categorical cross-entropy, precision, recall, and F1 score.

Read full abstract

Time-lapse electrical resistivity tomography (ERT) is a popular geophysical method to estimate three-dimensional (3D) permeability fields from electrical potential difference measurements. Traditional inversion and data assimilation methods are used to ingest this ERT data into hydrogeophysical models to estimate permeability. Due to ill-posedness and the curse of dimensionality, existing inversion strategies provide poor estimates and low resolution of the 3D permeability field. Recent advances in deep learning provide us with powerful algorithms to overcome this challenge. This paper presents a deep learning (DL) framework to estimate the 3D subsurface permeability from time-lapse ERT data. To test the feasibility of the proposed framework, we train DL-enabled inverse models on simulation data. Each measurement in both synthetic and field data is standardized by removing the mean and scaling the time-series to unit variance. This pre-processing step is necessary to bring simulation data closer to field observations. Subsurface process models based on hydrogeophysics are used to generate this synthetic data. Training performed on limited simulation data resulted in the DL model over-fitting. An advanced data augmentation based on mixup is implemented to generate additional training samples to overcome this issue. This mixup technique creates weakly labeled (low-fidelity) samples from strongly labeled (high-fidelity) data. The weakly labeled training data is then used to develop DL-enabled inverse models and reduce over-fitting. As both time-lapse ERT (1133048 features/realization) and 3D permeability (585453 features/realization) data samples are from a high-dimensional space, principal component analysis (PCA) is employed to reduce dimensionality. Encoded ERT and encoded permeability are generated using the trained PCA estimators. A deep neural network is then trained to map the encoded ERT to encoded permeability. This mixup training and unsupervised learning allowed us to build a fast and reasonably accurate DL-based inverse model under limited simulation data. Results show that proposed weak supervised learning can capture salient spatial features in the 3D permeability field. Quantitatively, the average mean squared error (in terms of the natural log) on the strongly labeled training, validation, and test datasets is less than 0.5. The R2-score (global metric) is greater than 0.75, and the percent error in each cell (local metric) is less than 10%. Finally, an added benefit in terms of computational cost is that the proposed DL-based inverse model is at least O(104) times faster than running a forward model once it is trained. Data generation, DL model training, and hyperparameter tuning to identify optimal neural network architectures utilized high-performance computing resources while the DL inference is performed on a standard laptop. Approximately, O(105) processor hours are used for generating data and DL tuning and training. We acknowledge that the data generation and DL model development are expensive. But once a DL model is trained, it can be re-used for inversion rapidly for the given system, with set physics and domain. Note that traditional inversion may require multiple forward model simulations (e.g., in the order of 10 to 1000), which are very expensive. This computational savings ≈O(105)−O(107) makes the proposed DL-based inverse model attractive for subsurface imaging and real-time ERT monitoring applications due to fast and yet reasonably accurate estimations of permeability field.

Read full abstract

Deep Learning Inference Research Articles

Related Topics

Articles published on Deep Learning Inference

Efficient Acceleration of Deep Learning Inference on Resource-Constrained Edge Devices: A Review

Reformulating the direct convolution for high-performance deep learning inference on ARM processors

Impact of Embedded Deep Learning Optimizations for Inference in Wireless IoT Use Cases

Dnadna: a deep learning framework for population genetics inference.

Server load and network-aware adaptive deep learning inference offloading for edge platforms

Towards high-accuracy deep learning inference of compressible flows over aerofoils

Exploring Bitslicing Architectures for Enabling FHE-Assisted Machine Learning

OnSRAM: Efficient Inter-Node On-Chip Scratchpad Management in Deep Learning Accelerators

Minimizing Image Quality Loss After Channel Count Reduction for Plane Wave Ultrasound via Deep Learning Inference.

GRIM: A General, Real-Time Deep Learning Inference Framework for Mobile Devices Based on Fine-Grained Structured Weight Sparsity.

TensorRT-Based Framework and Optimization Methodology for Deep Learning Inference on Jetson Boards

Improved Secure Deep Neural Network Inference Offloading with Privacy-Preserving Scalar Product Evaluation for Edge Computing

Can differential privacy practically protect collaborative deep learning inference for IoT?

MiniDeep: A Standalone AI-Edge Platform with a Deep Learning-Based MINI-PC and AI-QSR System

Exploring Distributed Deep Learning Inference Using Raspberry Pi Spark Cluster

Deep learning to estimate permeability using geophysical data

Multimodal image translation via deep learning inference model trained in video domain

A novel image representation of GNSS correlation for deep learning multipath detection

An Industrial-Grade Solution for Crop Disease Image Detection Tasks.

Resource-Constrained Edge AI with Early Exit Prediction

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Deep Learning Inference Research Articles

Related Topics

Articles published on Deep Learning Inference

Efficient Acceleration of Deep Learning Inference on Resource-Constrained Edge Devices: A Review

Reformulating the direct convolution for high-performance deep learning inference on ARM processors

Impact of Embedded Deep Learning Optimizations for Inference in Wireless IoT Use Cases

Dnadna: a deep learning framework for population genetics inference.

Server load and network-aware adaptive deep learning inference offloading for edge platforms

Towards high-accuracy deep learning inference of compressible flows over aerofoils

Exploring Bitslicing Architectures for Enabling FHE-Assisted Machine Learning

OnSRAM: Efficient Inter-Node On-Chip Scratchpad Management in Deep Learning Accelerators

Minimizing Image Quality Loss After Channel Count Reduction for Plane Wave Ultrasound via Deep Learning Inference.

GRIM: A General, Real-Time Deep Learning Inference Framework for Mobile Devices Based on Fine-Grained Structured Weight Sparsity.

TensorRT-Based Framework and Optimization Methodology for Deep Learning Inference on Jetson Boards

Improved Secure Deep Neural Network Inference Offloading with Privacy-Preserving Scalar Product Evaluation for Edge Computing

Can differential privacy practically protect collaborative deep learning inference for IoT?

MiniDeep: A Standalone AI-Edge Platform with a Deep Learning-Based MINI-PC and AI-QSR System

Exploring Distributed Deep Learning Inference Using Raspberry Pi Spark Cluster

Deep learning to estimate permeability using geophysical data

Multimodal image translation via deep learning inference model trained in video domain

A novel image representation of GNSS correlation for deep learning multipath detection

An Industrial-Grade Solution for Crop Disease Image Detection Tasks.

Resource-Constrained Edge AI with Early Exit Prediction