Graphics Processing Units Research Articles

Named entity recognition (NER) models are essential for extracting structured information from unstructured medical texts by identifying entities such as diseases, treatments, and conditions, enhancing clinical decision-making and research. Innovations in machine learning, particularly those involving Bidirectional Encoder Representations From Transformers (BERT)-based deep learning and large language models, have significantly advanced NER capabilities. However, their performance varies across medical datasets due to the complexity and diversity of medical terminology. Previous studies have often focused on overall performance, neglecting specific challenges in medical contexts and the impact of macrofactors like lexical composition on prediction accuracy. These gaps hinder the development of optimized NER models for medical applications. This study aims to meticulously evaluate the performance of various NER models in the context of medical text analysis, focusing on how complex medical terminology affects entity recognition accuracy. Additionally, we explored the influence of macrofactors on model performance, seeking to provide insights for refining NER models and enhancing their reliability for medical applications. This study comprehensively evaluated 7 NER models-hidden Markov models, conditional random fields, BERT for Biomedical Text Mining, Big Transformer Models for Efficient Long-Sequence Attention, Decoding-enhanced BERT with Disentangled Attention, Robustly Optimized BERT Pretraining Approach, and Gemma-across 3 medical datasets: Revised Joint Workshop on Natural Language Processing in Biomedicine and its Applications (JNLPBA), BioCreative V CDR, and Anatomical Entity Mention (AnatEM). The evaluation focused on prediction accuracy, resource use (eg, central processing unit and graphics processing unit use), and the impact of fine-tuning hyperparameters. The macrofactors affecting model performance were also screened using the multilevel factor elimination algorithm. The fine-tuned BERT for Biomedical Text Mining, with balanced resource use, generally achieved the highest prediction accuracy across the Revised JNLPBA and AnatEM datasets, with microaverage (AVG_MICRO) scores of 0.932 and 0.8494, respectively, highlighting its superior proficiency in identifying medical entities. Gemma, fine-tuned using the low-rank adaptation technique, achieved the highest accuracy on the BioCreative V CDR dataset with an AVG_MICRO score of 0.9962 but exhibited variability across the other datasets (AVG_MICRO scores of 0.9088 on the Revised JNLPBA and 0.8029 on AnatEM), indicating a need for further optimization. In addition, our analysis revealed that 2 macrofactors, entity phrase length and the number of entity words in each entity phrase, significantly influenced model performance. This study highlights the essential role of NER models in medical informatics, emphasizing the imperative for model optimization via precise data targeting and fine-tuning. The insights from this study will notably improve clinical decision-making and facilitate the creation of more sophisticated and effective medical NER models.

Traffic simulation is a critical tool for congestion analysis, travel time estimation, and route optimization in urban planning, benefiting navigation apps, transportation network companies, and state agencies. Traditionally, traffic micro-simulation frameworks are based on road segments and can only support a limited number of main roads. Efficient traffic simulation on a regional scale remains a significant challenge due to the complexity of urban mobility and the large scale of spatiotemporal data. This paper introduces a Large Scale Multi-GPU Parallel Computing based Regional Scale Traffic Simulation Framework (LPSim), which leverages graphical processing unit (GPU) parallel computing to address these challenges. LPSim utilizes a multi-GPU architecture to simulate extensive and dynamic traffic networks with high fidelity and reduced computation time. Using the parallel processing capabilities of GPUs, LPSim can perform tens of millions of individual vehicle dynamics simulations simultaneously, significantly outperforming traditional CPU-based approaches. The framework is designed to be scalable and can easily accommodate the increasing complexity of traffic simulations. We present the theory behind GPU-based traffic simulation, the architecture of single- and multi-GPU based simulations, and the graph partition strategies that enhance computation resource load balance. Our experimental results demonstrate the effectiveness of LPSim in simulating large-scale traffic scenarios. LPSim is capable of completing simulations of 2.82 million trips in just 6.28 min on a single GPU machine equipped with 5120 CUDA cores (Tesla V100-SXM2). Furthermore, utilizing a Google Cloud instance with two NVIDIA V100 GPUs, which collectively offer 10240 CUDA cores, LPSim successfully simulates 9.01 million trips within 21.16 min. We further tested our simulator with the same demand on dual NVIDIA A100-PCIE-40GB GPUs, which finished the simulation in 0.0398 h, approximately 113 times faster than the same simulation scenario running on an Intel(R) Xeon(R) Gold 6326 CPU @ 2.90 GHz, which takes 4.49 h to complete. This performance not only demonstrates its speed and scalability advantages over traditional simulation techniques but also highlights LPSim’s unique position as the first traffic simulation framework that is scalable for both single- and multiple-GPU configurations. Consequently, LPSim provides an invaluable tool for individuals and extensive research teams alike, enabling the acquisition of large-scale traffic simulation results in a time-efficient manner. LPSim code is available at: https://github.com/Xuan-1998/LPSim

Graphics Processing Units Research Articles

Related Topics

Articles published on Graphics Processing Units

Tensor power flow formulations for multidimensional analyses in distribution systems

Immersed boundary method for dynamic simulation of polarizable colloids of arbitrary shape in explicit ion electrolytes.

Improved modularity and new features in ipie: Toward even larger AFQMC calculations on CPUs and GPUs at zero and finite temperatures.

Fault3DNnet: A lightweight 3D seismic fault detection network with bidirectional decoding

High-throughput software LDPC decoder on GPU

Reducing the replication time for structural estimations: A successful replication of “An Anatomy of International Trade” using GPU computing

Big data research is everyone's research-Making epilepsy data science accessible to the global community: Report of the ILAE big data commission.

An explicit time integration method for Boussinesq approximation

An artifactual fibre overlap removal algorithm for micro-computed tomography image post-processing and 3D microstructure generation with graphics processing unit acceleration

All-to-all reconfigurability with sparse and higher-order Ising machines

Evaluating Medical Entity Recognition in Health Care: Entity Model Quantitative Study.

Large scale multi-GPU based parallel traffic simulation for accelerated traffic assignment and propagation

HybridSA: GPU Acceleration of Multi-pattern Regex Matching using Bit Parallelism

Accelerating Neural Network Inference in Handwritten Digit Recognition — Comparative Study

A GPU-Based Lattice Boltzmann Method for Predicting Near- and Far-Field Jet Noise

Very-Large-Scale GPU-Accelerated Nuclear Gradient of Time-Dependent Density Functional Theory with Tamm-Dancoff Approximation and Range-Separated Hybrid Functionals.

Efficient Random Field Generation With Rotational Anisotropy for Probabilistic SPH Analysis of Slope Failure

Dynamic Load Balancing of Multi-GPU Parallelization for VULCANO VE-U7 Corium Spreading Analysis Using SOPHIA

GPU implementation of a fast active noise control algorithm with multiple reference signals

Direct aeroacoutic simulation of a ducted axial fan with acoustic liners using a volume penalization lattice Boltzmann solver on heterogeneous massively parallel platforms

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Graphics Processing Units Research Articles

Related Topics

Articles published on Graphics Processing Units

Tensor power flow formulations for multidimensional analyses in distribution systems

Immersed boundary method for dynamic simulation of polarizable colloids of arbitrary shape in explicit ion electrolytes.

Improved modularity and new features in ipie: Toward even larger AFQMC calculations on CPUs and GPUs at zero and finite temperatures.

Fault3DNnet: A lightweight 3D seismic fault detection network with bidirectional decoding

High-throughput software LDPC decoder on GPU

Reducing the replication time for structural estimations: A successful replication of “An Anatomy of International Trade” using GPU computing

Big data research is everyone's research-Making epilepsy data science accessible to the global community: Report of the ILAE big data commission.

An explicit time integration method for Boussinesq approximation

An artifactual fibre overlap removal algorithm for micro-computed tomography image post-processing and 3D microstructure generation with graphics processing unit acceleration

All-to-all reconfigurability with sparse and higher-order Ising machines

Evaluating Medical Entity Recognition in Health Care: Entity Model Quantitative Study.

Large scale multi-GPU based parallel traffic simulation for accelerated traffic assignment and propagation

HybridSA: GPU Acceleration of Multi-pattern Regex Matching using Bit Parallelism

Accelerating Neural Network Inference in Handwritten Digit Recognition — Comparative Study

A GPU-Based Lattice Boltzmann Method for Predicting Near- and Far-Field Jet Noise

Very-Large-Scale GPU-Accelerated Nuclear Gradient of Time-Dependent Density Functional Theory with Tamm-Dancoff Approximation and Range-Separated Hybrid Functionals.

Efficient Random Field Generation With Rotational Anisotropy for Probabilistic SPH Analysis of Slope Failure

Dynamic Load Balancing of Multi-GPU Parallelization for VULCANO VE-U7 Corium Spreading Analysis Using SOPHIA

GPU implementation of a fast active noise control algorithm with multiple reference signals

Direct aeroacoutic simulation of a ducted axial fan with acoustic liners using a volume penalization lattice Boltzmann solver on heterogeneous massively parallel platforms