Abstract
This work presents a highly optimized computational framework for the Discrete Dipole Approximation, a numerical method for calculating the optical properties associated with a target of arbitrary geometry that is widely used in atmospheric, astrophysical and industrial simulations. Core optimizations include the bit-fielding of integer data and iterative methods that complement a new Discrete Fourier Transform (DFT) kernel, which efficiently calculates the matrix' vector products required by these iterative solution schemes. The new kernel performs the requisite 3-D DFTs as ensembles of 1-D transforms, and by doing so, is able to reduce the number of constituent 1-D transforms by 60% and the memory by over 80%. The optimizations also facilitate the use of parallel techniques to further enhance the performance. Complete OpenMP-based shared-memory and MPI-based distributed-memory implementations have been created to take full advantage of the various architectures. Several benchmarks of the new framework indicate extremely favorable performance and scalability.
Original language | English (US) |
---|---|
Pages (from-to) | 42-61 |
Number of pages | 20 |
Journal | International Journal of High Performance Computing Applications |
Volume | 23 |
Issue number | 1 |
DOIs | |
State | Published - Mar 2009 |
Keywords
- CG-FFT
- Discrete dipole approximation
- Matrix-vector product
- Optimization
- Parallel algorithm
ASJC Scopus subject areas
- Software
- Theoretical Computer Science
- Hardware and Architecture