Solving Matrix Equations on Multi-Core and Many-Core Architectures

Benner, Peter; Ezzatti, Pablo; Mena, Hermann; Quintana-Ortí, Enrique S.; Remón, Alfredo
December 2013
Algorithms;Dec2013, Vol. 6 Issue 4, p857
Academic Journal
We address the numerical solution of Lyapunov, algebraic and differential Riccati equations, via the matrix sign function, on platforms equipped with general-purpose multicore processors and, optionally, one or more graphics processing units (GPUs). In particular, we review the solvers for these equations, as well as the underlying methods, analyze their concurrency and scalability and provide details on their parallel implementation. Our experimental results show that this class of hardware provides sufficient computational power to tackle large-scale problems, which only a few years ago would have required a cluster of computers.


Related Articles

  • Un sistema heterogéneo Multicore/GPU para acelerar la búsqueda por similitud en estructuras métricas. Uribe-Paredes, Roberto; Cazorla, Diego; Arias, Enrique; Sánchez, José Luis // INGENIARE - Revista Chilena de Ingeniería;jan-mar2014, Vol. 22 Issue 1, p26 

    Nowadays, similarity search on metric spaces is becoming a research field of interest due to the fact of its application to different scientific areas. However, when these applications produce a huge amount of data, it is necessary to accelerate the searching process by means of parallel...

  • Comparison of Selected Parallel Path Planning Algorithms on GPGPUs and Multi-Core Processors. Seidler, Ralf; Schmidt, Michael; Schäfer, Andreas; Fey, Dietmar // Annual International Conference on Advances in Distributed & Par;Nov2010, pR133 

    In robot systems several computationally intensive tasks can be found, with path planning being one of them. Especially in dynamically changing environments, it is difficult to meet real-time constraints with a serial processing approach. For those systems employing standard computers, a viable...

  • GPU ACCELERATED PARALLEL BRANCH PREDICTION FOR MULTI/MANY-CORE PROCESSOR SIMULATION. Liqiang He; Guangyong Zhang; Jingdong Jiang // International Journal of Numerical Analysis & Modeling;May2012, Vol. 9 Issue 2, p193 

    Branch Prediction is a common function in nowadays microprocessors. Branch predictor is duplicated in each core of a multi/many-core processor and makes prediction for multiple concurrent running programs respectively. To evaluate the parallel branch prediction in a multi/many-core processor,...

  • Parallelizing RSA Algorithm on Multicore CPU and GPU. Fadhil, Heba Mohammed; Younis, Mohammed Issam // International Journal of Computer Applications;Feb2014, Vol. 87, p15 

    Public key algorithms are extensively known to be slower than symmetric key alternatives in the area of cryptographic algorithms for the reason of their basis in modular arithmetic. The most public key algorithm widely used is the RSA. Therefore, how to enhance the speed of RSA algorithm has...

  • Parallelization Research of Circle Detection Based on Hough Transform. Suping Wu; Xiangjiao Liu // International Journal of Computer Science Issues (IJCSI);Nov2012, Vol. 9 Issue 6, p481 

    There is a problem of too long computation time in Circle detection of Hough transform. In this paper, two paralleled methods are given based on Threading Building Blocks (TBB) and CUDA, by utilizing multi-core and GPU, the most timeconsuming part of circle detection is coped with...

  • FPGA on FPGA: Implementation of Fine-grained Parallel Genetic Algorithm on Field Programmable Gate Array. AL-Marakeby, A. // International Journal of Computer Applications;Oct2013, Vol. 80, p29 

    Many optimization problems have complex search space, which either increase the solving problem time or finish searching without obtaining the best solution. Genetic Algorithm (GA) is an optimization technique used in solving many practical problems in science, engineering, and business domains....

  • Frequent Pattern-growth Algorithm on Multi-core CPU and GPU Processors. Arour, Khedija; Belkahla, Amani // Journal of Computing & Information Technology;2014, Vol. 22 Issue 3, p159 

    Discovering association rules that identify relationships among sets of items is an important problem in data mining. It's a two steps process, the first step finds all frequent itemsets and the second one constructs association rules from these frequent sets. Finding frequent itemsets is...

  • Automatic routine tuning to represent landform attributes on multicore and multi-GPU systems. Boratto, Murilo; Alonso, Pedro; Gimenéz, Domingo; Barreto, Marcos // Journal of Supercomputing;Nov2014, Vol. 70 Issue 2, p733 

    Auto-tuning techniques have been used in the design of routines in recent years. The goal is to develop routines which automatically adapt to the conditions of the computational system in such a way that efficient executions are obtained independently of the end-user experience. This paper aims...

  • Design patterns percolating to parallel programming framework implementation. Aldinucci, M.; Campa, S.; Danelutto, M.; Kilpatrick, P.; Torquati, M. // International Journal of Parallel Programming;Dec2014, Vol. 42 Issue 6, p1012 

    Structured parallel programming is recognised as a viable and effective means of tackling parallel programming problems. Recently, a set of simple and powerful parallel building blocks ( $$\mathsf{RISC\text{- }pb^2l}$$ ) has been proposed to support modelling and implementation of parallel...


Read the Article


Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics