Vol. 23
Latest Volume
All Volumes
PIERM 125 [2024] PIERM 124 [2024] PIERM 123 [2024] PIERM 122 [2023] PIERM 121 [2023] PIERM 120 [2023] PIERM 119 [2023] PIERM 118 [2023] PIERM 117 [2023] PIERM 116 [2023] PIERM 115 [2023] PIERM 114 [2022] PIERM 113 [2022] PIERM 112 [2022] PIERM 111 [2022] PIERM 110 [2022] PIERM 109 [2022] PIERM 108 [2022] PIERM 107 [2022] PIERM 106 [2021] PIERM 105 [2021] PIERM 104 [2021] PIERM 103 [2021] PIERM 102 [2021] PIERM 101 [2021] PIERM 100 [2021] PIERM 99 [2021] PIERM 98 [2020] PIERM 97 [2020] PIERM 96 [2020] PIERM 95 [2020] PIERM 94 [2020] PIERM 93 [2020] PIERM 92 [2020] PIERM 91 [2020] PIERM 90 [2020] PIERM 89 [2020] PIERM 88 [2020] PIERM 87 [2019] PIERM 86 [2019] PIERM 85 [2019] PIERM 84 [2019] PIERM 83 [2019] PIERM 82 [2019] PIERM 81 [2019] PIERM 80 [2019] PIERM 79 [2019] PIERM 78 [2019] PIERM 77 [2019] PIERM 76 [2018] PIERM 75 [2018] PIERM 74 [2018] PIERM 73 [2018] PIERM 72 [2018] PIERM 71 [2018] PIERM 70 [2018] PIERM 69 [2018] PIERM 68 [2018] PIERM 67 [2018] PIERM 66 [2018] PIERM 65 [2018] PIERM 64 [2018] PIERM 63 [2018] PIERM 62 [2017] PIERM 61 [2017] PIERM 60 [2017] PIERM 59 [2017] PIERM 58 [2017] PIERM 57 [2017] PIERM 56 [2017] PIERM 55 [2017] PIERM 54 [2017] PIERM 53 [2017] PIERM 52 [2016] PIERM 51 [2016] PIERM 50 [2016] PIERM 49 [2016] PIERM 48 [2016] PIERM 47 [2016] PIERM 46 [2016] PIERM 45 [2016] PIERM 44 [2015] PIERM 43 [2015] PIERM 42 [2015] PIERM 41 [2015] PIERM 40 [2014] PIERM 39 [2014] PIERM 38 [2014] PIERM 37 [2014] PIERM 36 [2014] PIERM 35 [2014] PIERM 34 [2014] PIERM 33 [2013] PIERM 32 [2013] PIERM 31 [2013] PIERM 30 [2013] PIERM 29 [2013] PIERM 28 [2013] PIERM 27 [2012] PIERM 26 [2012] PIERM 25 [2012] PIERM 24 [2012] PIERM 23 [2012] PIERM 22 [2012] PIERM 21 [2011] PIERM 20 [2011] PIERM 19 [2011] PIERM 18 [2011] PIERM 17 [2011] PIERM 16 [2011] PIERM 14 [2010] PIERM 13 [2010] PIERM 12 [2010] PIERM 11 [2010] PIERM 10 [2009] PIERM 9 [2009] PIERM 8 [2009] PIERM 7 [2009] PIERM 6 [2009] PIERM 5 [2008] PIERM 4 [2008] PIERM 3 [2008] PIERM 2 [2008] PIERM 1 [2008]
2012-01-08
Graphics Processor Unit (GPU) Acceleration of Finite-Difference Frequency-Domain (Fdfd) Method
By
Progress In Electromagnetics Research M, Vol. 23, 29-51, 2012
Abstract
Recently, many numerical methods that are developed for the solution of electromagnetic problems have greatly benefited from the hardware accelerated scientific computing capability provided by graphics processing units (GPUs) and orders of magnitude speed-up factors have been reported. Among these methods, the finite-difference frequency-domain (FDFD) method as well can be accelerated substantially by utilizing an efficient algorithm customized for GPU computing. In this contribution, an algorithm is presented that treats iterative solution of the FDFD linear equation system similar to solution of three-dimensional Finite-Difference Time-Domain (FDTD) method, which inherently yields itself to high level parallelization. The presented algorithm uses BICGSTAB iterative solver. Integrated with BICGSTAB, an efficient method of performing matrix-vector products for the linear system of FDFD equations is adapted and implemented in Compute Unified Device Architecture (CUDA). It is shown that FDFD can be solved with a speed-up factor of more than 20 on a GPU compared with the solution on a central processing unit (CPU), while memory usage as well can be reduced substantially with the presented algorithm.
Citation
Veysel Demir, "Graphics Processor Unit (GPU) Acceleration of Finite-Difference Frequency-Domain (Fdfd) Method," Progress In Electromagnetics Research M, Vol. 23, 29-51, 2012.
doi:10.2528/PIERM11090909
References

1. Sypek, P., A. Dziekonski, and M. Mrozowski, "How to render FDTD computations more effective using a graphics accelerator," IEEE Transactions on Magnetics, Vol. 45, No. 3, 1324-1327, 2009.
doi:10.1109/TMAG.2009.2012614

2. Demir, V. and A. Z. Elsherbeni, "Compute unified device architecture (CUDA) based finite-difference time-domain (FDTD) implementation," Journal of the Applied Computational Electromagnetics Society (ACES), Vol. 25, No. 4, 303-314, April 2010.

3. Ong, C. Y., M. Weldon, S. Quiring, L. Maxwell, M. C. Hughes, C. Whelan, and M. Okoniewski, "Speed it up," IEEE Microwave Magazine, Vol. 11, No. 2, 70-78, April 2010.
doi:10.1109/MMM.2010.935776

4. De Donno, D., A. Esposito, L. Tarricone, and L. Catarinucci, "Introduction to GPU computing and CUDA programming: A case study on FDTD," IEEE Antennas and Propagation Magazine, EM Programmer's Notebook, Vol. 52, No. 3, 116-122, June 2010.

5. Godel, N., N. Nunn, T. Warburton, and T. Clemens, "Scalability of higher-order discontinuous galerkin FEM computations for solving electromagnetic wave propagation problems on GPU clusters," IEEE Transactions on Magnetics, Vol. 46, No. 8, August 2010.

6. De Donno, D., A. Esposito, G. Monti, and L. Tarricone, "GPU-based acceleration of the MPIE/MoM matrix calculation for the analysis of microstrip circuits," Proceedings of the European Conference on Antennas and Propagation (EuCAP 2011), Rome, Italy, April 2011.

7. Al Sharkawy, M. H., V. Demir, and A. Z. Elsherbeni, Electromagnetic Scattering Using the Iterative Multi-Region Technique (Synthesis Lectures on Computational Electromagnetics), Morgan and Claypool Publishers, November 2007.

8. Al Sharkawy, M. H., V. Demir, and A. Z. Elsherbeni, "The iterative multi-region algorithm using a hybrid finite difference frequency domain and method of moments techniques," Progress In Electromagnetics Research, Vol. 57, 19-32, 2006.
doi:10.2528/PIER05071001

9. Al Sharkawy, M. H., V. Demir, and A. Z. Elsherbeni, "Plane wave scattering from three dimensional multiple objects using the Iterative Multi-Region technique based on the FDFD method," IEEE Transactions on Antennas and Propagation, Vol. 54, No. 2, 666-673, February 2006.
doi:10.1109/TAP.2005.863129

10. Kuzu, L., V. Demir, A. Z. Elsherbeni, and E. Arvas, "Electromagnetic scattering from arbitrarily shaped chiral objects using the finite difference frequency domain method," Progress In Electromagnetics Research, Vol. 67, 1-24, 2007.
doi:10.2528/PIER06083104

11. Alkan, E., V. Demir, A. Z. Elsherbeni, and E. Arvas, "Double-grid finite-difference frequency-domain method for modeling chiral medium," IEEE Transactions on Antennas and Propagation, Vol. 58, No. 3, 817-823, March 2010.
doi:10.1109/TAP.2009.2039297

12. Zainud-Deen, S. H., E. El-Deen, M. S. Ibrahim, K. H. Awadalla, and A. Z. Botros, "Electromagnetic scattering using GPU-based finite difference frequency domain method," Progress In Electromagnetics Research B, Vol. 16, 351-369, 2009.
doi:10.2528/PIERB09060703

13. De Donno, D., A. Esposito, G. Monti, and L. Tarricone, "Iterative solution of linear systems in electromagnetics (and not only): Experiences with CUDA," UnConventional High Performance Computing, Lecture Notes in Computer Science (LNCS), Vol. 6586, Springer 2011.

14. Demir, V., E. Alkan, A. Z. Elsherbeni, and E. Arvas, "An algorithm for effcient solution of finite-difference frequency-domain (FDFD) methods," IEEE Antennas and Propagation Magazine, Vol. 61, No. 6, 143-150, December 2009.
doi:10.1109/MAP.2009.5433120

15. Kunz, K. S. and R. J. Luebbers, The Finite Difference Time Domain Method for Electromagnetics, CRC Press, 1993.

16. Berenger, J. P., "A perfectly matched layer for the absorption of electromagnetic waves," Journal of Computational Physics, Vol. 114, No. 2, 185-200, October 1994.
doi:10.1006/jcph.1994.1159

17. Yee, K. S., "Numerical solution of initial boundary value problems involving Maxwell's equations in isotropic media," IEEE Transactions on Antennas and Propagation, Vol. 14, 302-307, May 1966.

18. Saad, Y., Iterative Methods for Sparse Linear Systems, Society for Industrial and Applied Mathematics (SIAM), Philadelphia, PA, USA, 2003.
doi:10.1137/1.9780898718003

19. Barrett, R., M. Berry, T. F. Chan, J. Demmel, J. Donato, J. Dongarra, V. Eijkhout, R. Pozo, C. Romine, and H. Van der Vorst, Templates for the Solution of Linear Systems: Building Blocks for Iterative Methods, 2nd Edition, Society for Industrial and Applied Mathematics (SIAM), Philadelphia, PA, USA, 1994.
doi:10.1137/1.9781611971538

20. Saad, Y. and M. H. Schultz, "GMRES: A generalized minimal residual algorithm for solving nonsymmetric linear systems," SIAM Journal on Scientific and Statistical Computing, Vol. 7, No. 3, 856-869, July 1986.
doi:10.1137/0907058

21. Van der Vorst, H. A., "BI-CGSTAB: A fast and smoothly converging variant of BI-CG for the solution of nonsymmetric linear systems," SIAM Journal on Scientific and Statistical Computing, Vol. 13, No. 2, 631-644, March 1992.
doi:10.1137/0913035

22. Sleijpen, G. L. G. and D. R. Fokkema, "BICGSTAB(l) for linear equations involving unsymmetric matrices with complex spectrum," Electronic Transactions on Numerical Analysis, Vol. 1, 11-32, 1993.

23., CUDA 2.1 Programming Guide, http://www.nvidia.com/object/-cuda develop.html..

24., http://www.staff.science.uu.nl/»vorst102/software.html..

25. Blackford, L. S., J. Demmel, J. Dongarra, I. Duff, S. Hammarling, G. Henry, M. Heroux, L. Kaufman, A. Lumsdaine, A. Petitet, R. Pozo, K. Remington, and R. C. Whaley, "An updated set of basic linear algebra subprograms (BLAS)," ACM Trans. Math. Soft., Vol. 28, No. 2, 135-151, 2002.
doi:10.1145/567806.567807

26. Demir, V., A. Z. Elsherbeni, D. Worasawate, and E. Arvas, "A graphical user interface (GUI) for plane wave scattering from a conducting, dielectric or a chiral sphere," IEEE Antennas and Propagation Magazine, Vol. 46, No. 5, 94-99, October 2004.
doi:10.1109/MAP.2004.1388838

27. Zubal, I. G., C. R. Harrell, E. O. Smith, Z. Rattner, G. R. Gindi, and P. B. Hoffer, "Computerized three-dimensional segmented human anatomy," Medical Physics, Vol. 21, No. 2, 299-302, February 1994.
doi:10.1118/1.597290

28., The Zubal Phantom, http://noodle.med.yale.edu/zubal/index.htm..
doi:10.1118/1.597290

29. "An internet resource for the calculation of the dielectric properties of body tissues in the frequency range 10 Hz-100 GHz,", 2011, http://niremf.ifac.cnr.it/tissprop/..
doi:10.1118/1.597290

30. Benzi, M., "Preconditioning techniques for large linear systems: A survey," Journal of Computational Physics, Vol. 182, No. 2, 418-477, November 2002.
doi:10.1006/jcph.2002.7176

31. Li, R. and Y. Saad, "GPU-accelerated preconditioned iterative linear solvers,", CUDA ITSOL Technical Report, available at http://www-users.cs.umn.edu/»rli/, 2011.