The best approach to parallelize multidimensional FFT algorithms has long been under debate, Distributed transposes are widely used, but they also vary in communication policies and hence performance. In this work we analyze the impact of different redistribution strategies on the performance of parallel FFT, on various machine architectures. We found that some redistribution strategies were consistently superior, while some others were unexpectedly inferior, An in-depth investigation into the reasons for this behavior is included in this work. Copyright (C) 2001 John Wiley & Sons, Ltd.

Redistribution strategies for portable parallel FFT: A case study

Tessera D.
2001-01-01

Abstract

The best approach to parallelize multidimensional FFT algorithms has long been under debate, Distributed transposes are widely used, but they also vary in communication policies and hence performance. In this work we analyze the impact of different redistribution strategies on the performance of parallel FFT, on various machine architectures. We found that some redistribution strategies were consistently superior, while some others were unexpectedly inferior, An in-depth investigation into the reasons for this behavior is included in this work. Copyright (C) 2001 John Wiley & Sons, Ltd.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11571/1522795
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 8
  • ???jsp.display-item.citation.isi??? 7
social impact