Re-engineering the ant colony optimization for CMP architectures

José M Cecilia,José M García

doi:10.1007/s11227-019-02869-8

Abstract

The ant colony optimization (ACO) is inspired by the behavior of real ants, and as a bioinspired method, its underlying computation is massively parallel by definition. This paper shows re-engineering strategies to migrate the ACO algorithm applied to the Traveling Salesman Problem to modern Intel-based multi- and many-core architectures in a step-by-step methodology. The paper provides detailed guidelines on how to optimize the algorithm for the intra-node (thread and vector) parallelization, showing the performance scalability along with the number of cores on different Intel architectures, reporting up to 5.5x speedup factor between the Intel Xeon Phi Knights Landing and Intel Xeon v2. Moreover, parallel efficiency is provided for all targeted architectures, finding that core load imbalance, memory bandwidth limitations, and NUMA effects on data placement are some of the key factors limiting performance. Finally, a distributed implementation is also presented, reaching up to 2.96x speedup factor when running the code on 3 nodes over the single-node counterpart version. In the latter case, the parallel efficiency is affected by the synchronization frequency, which also affects the quality of the solution found by the distributed implementation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: The Journal of Supercomputing	Publication Date: Apr 30, 2019
Citations: 4	License type: other-oa

R Discovery Prime

R Discovery Prime

Re-engineering the ant colony optimization for CMP architectures

Abstract

Talk to us

Similar Papers

More From: The Journal of Supercomputing

Lead the way for us

Similar Papers

On the Mitigation of Cache Hostile Memory Access Patterns on Many-Core CPU Architectures
Tom Deakin ... Simon Mcintosh-Smith
-
Tom Deakin, et. al.Tom Deakin ... Simon Mcintosh-Smith
01 Jan 2017
01 Jan 2017

Performance and Scalability Study of FMM Kernels on Novel Multi- and Many-core Architectures
Antón Rey ... Jan F Prins
Procedia Computer Science | VOL. 108
Antón Rey, et. al.Antón Rey ... Jan F Prins
01 Jan 2017
Procedia Computer Science | VOL. 108

Scalability of Hybrid SpMV on Intel Xeon Phi Knights Landing
Brian A Page ... Peter M Kogge
-
Brian A Page, et. al.Brian A Page ... Peter M Kogge
01 Jul 2019
01 Jul 2019

Some useful optimisations for unstructured computational fluid dynamics codes on multicore and manycore architectures
Ioan Hadade ... Luca Di Mare
Computer Physics Communications | VOL. 235
Ioan Hadade, et. al.Ioan Hadade ... Luca Di Mare
18 Jul 2018
Computer Physics Communications | VOL. 235

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Re-engineering the ant colony optimization for CMP architectures

Abstract

Talk to us

Similar Papers

More From: The Journal of Supercomputing