Developing a High Performance Software Library with MPI and CUDA for Matrix Computations

Nowadays, the paradigm of parallel computing is changing. CUDA is now a popular programming model for general purpose computations on GPUs and a great number of applications were ported to CUDA obtaining speedups of orders of magnitude comparing to optimized CPU implementations. Hybrid approaches th...

Full description

Bibliographic Details
Main Authors: Bogdan Oancea, Tudorel Andrei
Format: Article
Language:English
Published: "Nicolae Titulescu" University of Bucharest 2014-04-01
Series:Computational Methods in Social Sciences
Subjects:
Online Access:http://cmss.univnt.ro/wp-content/uploads/vol/split/vol_I_issue_2/CMSS_vol_I_issue_2_art_001.pdf