Supercomputer Research Articles

Abstract. The Community Multiscale Air Quality Model (CMAQ) is a local- to hemispheric-scale numerical air quality modeling system developed by the U.S. Environmental Protection Agency (USEPA) and supported by the Community Modeling and Analysis System (CMAS) center. CMAQ is used for regulatory purposes by the USEPA program offices and state and local air agencies and is also widely used by the broader global research community to simulate and understand complex air quality processes and for computational environmental fate and transport and climate and health impact studies. Leveraging state-of-the-science cloud computing resources for high-performance computing (HPC) applications, CMAQ is now available as a fully tested, publicly available technology stack (HPC cluster and software stack) for two major cloud service providers (CSPs). Specifically, CMAQ configurations and supporting materials have been developed for use on their HPC clusters, including extensive online documentation, tutorials and guidelines to scale and optimize air quality simulations using their services. These resources allow modelers to rapidly bring together CMAQ, cloud-hosted datasets, and visualization and evaluation tools on ephemeral clusters that can be deployed quickly and reliably worldwide. Described here are considerations in CMAQ version 5.3.3 cloud use and the supported resources for each CSP, presented through a benchmark application suite that was developed as an example of a typical simulation for testing and verifying components of the modeling system. The outcomes of this effort are to provide findings from performing CMAQ simulations on the cloud using popular vendor-provided resources, to enable the user community to adapt this for their own needs, and to identify specific areas of potential optimization with respect to storage and compute architectures.

Despite the indispensable role of traditional electronic computers in modern society, their limitations in parallel processing capabilities, bit-width constraints, and processor bit-width are becoming increasingly apparent, especially when handling large-scale datasets and complex computational tasks. Although hardware technology and algorithm optimization continue to advance, the arithmetic units of traditional computers—adders—remain constrained by carry delay and bit-width limitations. This bottleneck is particularly pronounced in multiplication operations, mainly when adders are used for partial product accumulation. However, since 2018, the emergence of a new type of Reconfigurable Four-Valued Logic Electronic Processor (RFLEP) has provided a potential solution to these traditional limitations. With its large processor bit-width, flexible bit grouping capabilities, and dynamic hardware function reconfiguration features, this processor has brought revolutionary changes to the field of computing. In this context, this paper proposes and implements a Reconfigurable Four-Valued Logic Multiplication Routine (RFLMR) tailored explicitly for the RFLEP. The RFLMR utilizes the Modified Signed-Digit (MSD) representation method in multi-valued logic combined with the M transformation in four-valued logic to generate partial products. These partial products are then efficiently summed in parallel using the JW-MSD parallel adder, achieving the rapid execution of multiplication operations. Experimental results demonstrate that the multiplication routine based on the RFLEP performs multiplication operations accurately and meets theoretical expectations regarding implementation efficiency and performance. This research not only provides new ideas for developing next-generation high-performance computing systems but also paves the way for exploring more efficient and powerful computing models, heralding a profound transformation in future computing technology.

Supercomputer Research Articles

Related Topics

Articles published on Supercomputer

DECOMICS, a shiny application for unsupervised cell type deconvolution and biological interpretation of bulk omic data

Enabling high-performance cloud computing for the Community Multiscale Air Quality Model (CMAQ) version 5.3.3: performance evaluation and benefits for the user community

Research on Multiplication Routine Based on Reconfigurable Four-Valued Logic Processor

IGZO/PVP Composite Nanofiber Neuromorphic Transistors with Optoelectronic Synapse Emulation and Reservoir Computing.

V2X Communication Computing Resource Collaboration Technology Based on Deep Reinforcement Learning

In-line rate encrypted links using pre-shared post-quantum keys and DPUs

Thermal Interaction and Cooling of Electronic Device with Chiplet 2.5D Integration

Viewpoint-dependent lighting on polygonal holograms using bump mapping.

On chip control and detection of complex SPP and waveguide modes based on plasmonic interconnect circuits

Application of high performance computing and deep neural network learning in intelligent city mechanical product recommendation

Non-smooth Bayesian optimization in tuning scientific applications

Accelerating joint species distribution modelling with Hmsc-HPC by GPU porting.

A general framework for metaverse based on parallel computing and HPC

Deep-learning optimization using the gradient of a custom objective function: A full-waveform inversion example study on the convolutional objective function

Robust and computationally efficient design for run-of-river hydropower

Video Analytic for Human Management and Security and FPGA Accelerated High Concurrency

Highly efficient modeling and optimization of neural fiber responses to electrical stimulation

Machine learning approaches to predict the execution time of the meteorological simulation software COSMO

Ichor: A Python library for computational chemistry data management and machine learning force field development.

Ultra-low-power consumption silicon electro-optic switch based on photonic crystal nanobeam cavity

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Supercomputer Research Articles

Related Topics

Articles published on Supercomputer

DECOMICS, a shiny application for unsupervised cell type deconvolution and biological interpretation of bulk omic data

Enabling high-performance cloud computing for the Community Multiscale Air Quality Model (CMAQ) version 5.3.3: performance evaluation and benefits for the user community

Research on Multiplication Routine Based on Reconfigurable Four-Valued Logic Processor

IGZO/PVP Composite Nanofiber Neuromorphic Transistors with Optoelectronic Synapse Emulation and Reservoir Computing.

V2X Communication Computing Resource Collaboration Technology Based on Deep Reinforcement Learning

In-line rate encrypted links using pre-shared post-quantum keys and DPUs

Thermal Interaction and Cooling of Electronic Device with Chiplet 2.5D Integration

Viewpoint-dependent lighting on polygonal holograms using bump mapping.

On chip control and detection of complex SPP and waveguide modes based on plasmonic interconnect circuits

Application of high performance computing and deep neural network learning in intelligent city mechanical product recommendation

Non-smooth Bayesian optimization in tuning scientific applications

Accelerating joint species distribution modelling with Hmsc-HPC by GPU porting.

A general framework for metaverse based on parallel computing and HPC

Deep-learning optimization using the gradient of a custom objective function: A full-waveform inversion example study on the convolutional objective function

Robust and computationally efficient design for run-of-river hydropower

Video Analytic for Human Management and Security and FPGA Accelerated High Concurrency

Highly efficient modeling and optimization of neural fiber responses to electrical stimulation

Machine learning approaches to predict the execution time of the meteorological simulation software COSMO

Ichor: A Python library for computational chemistry data management and machine learning force field development.

Ultra-low-power consumption silicon electro-optic switch based on photonic crystal nanobeam cavity