Layers Of Deep Neural Network Research Articles

This study presents a novel transfer learning (TL)-based multi-fidelity modeling approach for a set of granular material-filled particle dampers (PDs) with varying cavity height and particle filling ratio, targeting to realize vibration/noise mitigation across a broad frequency band. The dynamic characteristics of this kind of dampers are highly nonlinear and depend on a number of features such as particle material and size, cavity configuration, filling ratio, excitation frequency and amplitude, etc. While deep neural network (DNN) has demonstrated success in a variety of fields including nonlinear dynamics, DNN is a data-hungry modeling approach and tends to yield inaccurate or inadequate models for high-dimensional nonlinear problems when data are scarce or expensive to collect. In this paper, we propose a multi-fidelity approach for characterizing the dynamics of granular material-filled PDs by combining low-fidelity data from an approximate governing/constitutive equation and high-fidelity experimental data in the context of deep TL. Making use of the low-fidelity data, a DNN is first trained to represent a mapping between input parameters (cavity height, particle filling ratio, excitation frequency and amplitude) and output parameter (damper energy loss factor). Then, in compliance with the deep TL philosophy, the weights and biases in all layers of the pre-trained DNN except a few outermost layers will be frozen, while those in the outermost layers are re-trained using the experimental data to formulate a multi-fidelity DNN. The modeling capability of this multi-fidelity DNN model developed by the deep TL strategy is compared with a DNN model with the same architecture but trained using only the experimental data. Results show that the multi-fidelity DNN model offers much better performance than the DNN model trained using only the experimental data for characterizing the PD dynamics across a broad frequency band from 100 to 2000 Hz. Since the formulated model is versatile to varying cavity height and particle filling ratio and accommodates different excitation frequencies and amplitudes, it is amenable to use in the optimal design of PDs.

Read full abstract

Edge systems are required to autonomously make real-time decisions based on large quantities of input data under strict power, performance, area, and other constraints. Meeting these constraints is only possible by specializing systems through hardware accelerators purposefully built for machine learning and data analysis algorithms. However, data science evolves at a quick pace, and manual design of custom accelerators has high non-recurrent engineering costs: general solutions are needed to automatically and rapidly transition from the formulation of a new algorithm to the deployment of a dedicated hardware implementation. Our solution is the SOftware Defined Architectures (SODA) Synthesizer, an end-to-end, multi-level, modular, extensible compiler toolchain providing a direct path from machine learning tools to hardware. The SODA Synthesizer frontend is based on the multilevel intermediate representation (MLIR) framework; it ingests pre-trained machine learning models, identifies kernels suited for acceleration, performs high-level optimizations, and prepares them for hardware synthesis. In the backend, SODA leverages state-of-the-art high-level synthesis techniques to generate highly efficient accelerators, targeting both field programmable devices (FPGAs) and application-specific circuits (ASICs). In this paper, we describe how the SODA Synthesizer can also assemble the generated accelerators (based on the finite state machine with datapath model) in a custom system driven by a distributed controller, building a coarse-grained dataflow architecture that does not require a host processor to orchestrate parallel execution of multiple accelerators. We show the effectiveness of our approach by automatically generating ASIC accelerators for layers of popular deep neural networks (DNNs). Our high-level optimizations result in up to 74x speedup on isolated accelerators for individual DNN layers, and our dynamically scheduled architecture yields an additional 3x performance improvement when combining accelerators to handle streaming inputs.

Read full abstract

Layers Of Deep Neural Network Research Articles

Related Topics

Articles published on Layers Of Deep Neural Network

Block Walsh–Hadamard Transform-based Binary Layers in Deep Neural Networks

Anomaly detection of adversarial examples using class-conditional generative adversarial networks

A fine-grained mixed precision DNN accelerator using a two-stage big–little core RISC-V MCU

Improving Fault Tolerance for Reliable DNN Using Boundary-Aware Activation

An Application-oblivious Memory Scheduling System for DNN Accelerators

TALIPOT: Energy-Efficient DNN Booster Employing Hybrid Bit Parallel-Serial Processing in MSB-First Fashion

Sparse Bayesian deep learning for dynamic system identification

Multi-Knowledge Aggregation and Transfer for Semantic Segmentation

Physics-guided, data-refined modeling of granular material-filled particle dampers by deep transfer learning

Automatic Mapping of the Best-Suited DNN Pruning Schemes for Real-Time Mobile Acceleration

Fed2A: Federated Learning Mechanism in Asynchronous and Adaptive Modes

Energy-Efficient Offloading for DNN-Based Smart IoT Systems in Cloud-Edge Environments

EosDNN: An Efficient Offloading Scheme for DNN Inference Acceleration in Local-Edge-Cloud Collaborative Environments

Visual servoing with deep reinforcement learning for rotor unmanned helicopter

ADCF Loss Function for Deep Metric Learning in End-to-End Text-Dependent Speaker Verification Systems

Visualizing Transform Relations of Multilayers in Deep Neural Networks for ISAR Target Recognition

Multi-Source Unsupervised Domain Adaptation via Pseudo Target Domain.

End-to-End Synthesis of Dynamically Controlled Machine Learning Accelerators

Elastic-DF: Scaling Performance of DNN Inference in FPGA Clouds through Automatic Partitioning

A Lego-Based Neural Network Design Methodology With Flexible NoC

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Layers Of Deep Neural Network Research Articles

Related Topics

Articles published on Layers Of Deep Neural Network

Block Walsh–Hadamard Transform-based Binary Layers in Deep Neural Networks

Anomaly detection of adversarial examples using class-conditional generative adversarial networks

A fine-grained mixed precision DNN accelerator using a two-stage big–little core RISC-V MCU

Improving Fault Tolerance for Reliable DNN Using Boundary-Aware Activation

An Application-oblivious Memory Scheduling System for DNN Accelerators

TALIPOT: Energy-Efficient DNN Booster Employing Hybrid Bit Parallel-Serial Processing in MSB-First Fashion

Sparse Bayesian deep learning for dynamic system identification

Multi-Knowledge Aggregation and Transfer for Semantic Segmentation

Physics-guided, data-refined modeling of granular material-filled particle dampers by deep transfer learning

Automatic Mapping of the Best-Suited DNN Pruning Schemes for Real-Time Mobile Acceleration

Fed2A: Federated Learning Mechanism in Asynchronous and Adaptive Modes

Energy-Efficient Offloading for DNN-Based Smart IoT Systems in Cloud-Edge Environments

EosDNN: An Efficient Offloading Scheme for DNN Inference Acceleration in Local-Edge-Cloud Collaborative Environments

Visual servoing with deep reinforcement learning for rotor unmanned helicopter

ADCF Loss Function for Deep Metric Learning in End-to-End Text-Dependent Speaker Verification Systems

Visualizing Transform Relations of Multilayers in Deep Neural Networks for ISAR Target Recognition

Multi-Source Unsupervised Domain Adaptation via Pseudo Target Domain.

End-to-End Synthesis of Dynamically Controlled Machine Learning Accelerators

Elastic-DF: Scaling Performance of DNN Inference in FPGA Clouds through Automatic Partitioning

A Lego-Based Neural Network Design Methodology With Flexible NoC