Massively parallel polar decomposition on distributed-memory systems
We present a high-performance implementation of the Polar Decomposition (PD) on distributed-memory systems. Building upon on the QR-based Dynamically Weighted Halley (QDWH) algorithm, the key idea lies in finding the best rational approximation for the scalar sign function, which also corresponds to...
Main Authors: | , , , , |
---|---|
Format: | Journal article |
Language: | English |
Published: |
Association for Computing Machinery
2019
|