Energy Overhead Research Articles

Computing servers play a key role in the development and process of emerging compute-intensive applications in recent years. However, they need to operate efficiently from an energy perspective viewpoint, while maximizing the performance and lifetime of the hottest server components (i.e., cores and cache). Previous methods focused on either improving energy efficiency by adopting new hybrid-cache architectures including the resistive random-access memory (RRAM) and static random-access memory (SRAM) at the hardware level, or exploring tradeoffs between lifetime limitation and performance of multicore processors under stable workloads conditions. Therefore, no work has so far proposed a co-optimization method with hybrid-cache-based server architectures for real-life dynamic scenarios taking into account scalability, performance, lifetime reliability, and energy efficiency at the same time. In this article, we first formulate a reliability model for the hybrid-cache architecture to enable precise lifetime reliability management and energy efficiency optimization. We also include the performance and energy overheads of cache switching, and optimize the benefits of hybrid-cache usage for better energy efficiency and performance. Then, we propose a runtime <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$q$ </tex-math></inline-formula> -learning-based reliability management and performance optimization approach for multicore microprocessors with the hybrid-cache architecture, jointly incorporated with a dynamic preemptive priority queue management method to improve the overall tasks’ performance by targeting to respect their end time limits. Experimental results show that our proposed method achieves up to 44% average performance (i.e., tasks execution time) improvement, while maintaining the whole system design lifetime longer than five years, when compared to the latest state-of-the-art energy efficiency optimization and reliability management methods for computing servers.

Read full abstract

An emerging use case of machine learning (ML) is to train a model on a high-performance system and deploy the trained model on energy-constrained embedded systems. Neuromorphic hardware platforms, which operate on principles of the biological brain, can significantly lower the energy overhead of an ML inference task, making these platforms an attractive solution for embedded ML systems. We present a design-technology tradeoff analysis to implement such inference tasks on the processing elements (PEs) of a non-volatile memory (NVM)-based neuromorphic hardware. Through detailed circuit-level simulations at scaled process technology nodes, we show the negative impact of technology scaling on the information-processing latency, which impacts the quality of service of an embedded ML system. At a finer granularity, the latency inside a PE depends on (1) the delay introduced by parasitic components on its current paths, and (2) the varying delay to sense different resistance states of its NVM cells. Based on these two observations, we make the following three contributions. First, on the technology front, we propose an optimization scheme where the NVM resistance state that takes the longest time to sense is set on current paths having the least delay, and vice versa, reducing the average PE latency, which improves the quality of service. Second, on the architecture front, we introduce isolation transistors within each PE to partition it into regions that can be individually power-gated, reducing both latency and energy. Finally, on the system-software front, we propose a mechanism to leverage the proposed technological and architectural enhancements when implementing an ML inference task on neuromorphic PEs of the hardware. Evaluations with a recent neuromorphic hardware architecture show that our proposed design-technology co-optimization approach improves both performance and energy efficiency of ML inference tasks without incurring high cost-per-bit.

Read full abstract

Energy Overhead Research Articles

Related Topics

Articles published on Energy Overhead

LNS-Madam: Low-Precision Training in Logarithmic Number System Using Multiplicative Weight Update

Reinforcement Learning-Based Joint Reliability and Performance Optimization for Hybrid-Cache Computing Servers

Design-Technology Co-Optimization for NVM-Based Neuromorphic Processing Elements

An Adaptive Multipath Routing Method Based on Improved GA and Information Entropy

Cloud–Edge Collaborative Resource Allocation for Blockchain-Enabled Internet of Things: A Collective Reinforcement Learning Approach

Efficient Integrity-Tree Structure for Convolutional Neural Networks through Frequent Counter Overflow Prevention in Secure Memories.

Near-Optimal Energy Management for Energy Harvesting IoT Devices Using Imitation Learning

PVoT: Reconfigurable Photovoltaic Array for Indoor Light Energy-Powered Batteryless Devices

Sorting in Memristive Memory

Intelligent energy-efficient scheduling with ant colony techniques for heterogeneous edge computing

Dataflow Driven Partitioning of Machine Learning Applications for Optimal Energy Use in Batteryless Systems

Energy optimization for deadline-constrained parallel applications on multi-ECU embedded systems

Energy aware resource control mechanism for improved performance in future green 6G networks

Nonuniform Compressive Sensing via Ohmic Voltage Attenuation: A Memristive Crossbar Design Approach Leveraging Intrinsic Computation

Adaptive Energy Management for Self-Sustainable Wearables in Mobile Health

Split PO for paging in B5G networks

Linear Error Correction Codec Implementation Based on an In-Memory Computing Architecture for Nonvolatile Memories

An FPGA-based Approach to Evaluate Thermal and Resource Management Strategies of Many-core Processors

Characterization and Costs of Integrating Blockchain and IoT for Agri-Food Traceability Systems

GLBR: A novel global load balancing routing scheme based on intelligent computing in partially disconnected wireless sensor networks

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Energy Overhead Research Articles

Related Topics

Articles published on Energy Overhead

LNS-Madam: Low-Precision Training in Logarithmic Number System Using Multiplicative Weight Update

Reinforcement Learning-Based Joint Reliability and Performance Optimization for Hybrid-Cache Computing Servers

Design-Technology Co-Optimization for NVM-Based Neuromorphic Processing Elements

An Adaptive Multipath Routing Method Based on Improved GA and Information Entropy

Cloud–Edge Collaborative Resource Allocation for Blockchain-Enabled Internet of Things: A Collective Reinforcement Learning Approach

Efficient Integrity-Tree Structure for Convolutional Neural Networks through Frequent Counter Overflow Prevention in Secure Memories.

Near-Optimal Energy Management for Energy Harvesting IoT Devices Using Imitation Learning

PVoT: Reconfigurable Photovoltaic Array for Indoor Light Energy-Powered Batteryless Devices

Sorting in Memristive Memory

Intelligent energy-efficient scheduling with ant colony techniques for heterogeneous edge computing

Dataflow Driven Partitioning of Machine Learning Applications for Optimal Energy Use in Batteryless Systems

Energy optimization for deadline-constrained parallel applications on multi-ECU embedded systems

Energy aware resource control mechanism for improved performance in future green 6G networks

Nonuniform Compressive Sensing via Ohmic Voltage Attenuation: A Memristive Crossbar Design Approach Leveraging Intrinsic Computation

Adaptive Energy Management for Self-Sustainable Wearables in Mobile Health

Split PO for paging in B5G networks

Linear Error Correction Codec Implementation Based on an In-Memory Computing Architecture for Nonvolatile Memories

An FPGA-based Approach to Evaluate Thermal and Resource Management Strategies of Many-core Processors

Characterization and Costs of Integrating Blockchain and IoT for Agri-Food Traceability Systems

GLBR: A novel global load balancing routing scheme based on intelligent computing in partially disconnected wireless sensor networks