Massively parallel polar decomposition on distributed-memory systems

We present a high-performance implementation of the Polar Decomposition (PD) on distributed-memory systems. Building upon on the QR-based Dynamically Weighted Halley (QDWH) algorithm, the key idea lies in finding the best rational approximation for the scalar sign function, which also corresponds to...

Ful tanımlama

Detaylı Bibliyografya
Asıl Yazarlar: Ltaief, H, Sukkari, D, Esposito, A, Nakatsukasa, Y, Keyes, D
Materyal Türü: Journal article
Dil:English
Baskı/Yayın Bilgisi: Association for Computing Machinery 2019