PIER
 
Progress In Electromagnetics Research
ISSN: 1070-4698, E-ISSN: 1559-8985
Home | Search | Notification | Authors | Submission | PIERS Home | EM Academy
Home > Vol. 112 > pp. 199-213

ON OPENMP PARALLELIZATION OF THE MULTILEVEL FAST MULTIPOLE ALGORITHM

By X.-M. Pan, W.-C. Pi, and X.-Q. Sheng

Full Article PDF (381 KB)

Abstract:
Compared with MPI, OpenMP provides us an easy way to parallelize the multilevel fast multipole algorithm (MLFMA) on shared-memory systems. However, the implementation of OpenMP parallelization has many pitfalls because different parts of the MLFMA have distinct numerical characteristics due to its complicated algorithm structure. These pitfalls often cause very low efficiency, especially when many threads are employed. Through an in-depth investigation on these pitfalls with analysis and numerical experiments, we propose an efficient OpenMP parallel MLFMA. Three strategies are proposed in the parallelization, including: 1) the choice of OpenMP schedule manners; 2) loop reorganization for far-field interaction in the MLFMA; 3) determination of a transition level. Numerical experiments on large scale targets show the proposed OpenMP parallel scheme can perform as efficiently as the MPI counterpart, and much more efficiently than the straightforward OpenMP parallel one.

Citation:
X.-M. Pan, W.-C. Pi, and X.-Q. Sheng, "On openmp parallelization of the multilevel fast multipole algorithm," Progress In Electromagnetics Research, Vol. 112, 199-213, 2011.
doi:10.2528/PIER10120802
http://www.jpier.org/pier/pier.php?paper=10120802

References:
1. Song, J. M., C. C. Lu, and W. C. Chew, "MLFMA for electromagnetic scattering by large complex objects," IEEE Trans. Antennas Propagat., Vol. 45, 1488-1493, Oct. 1997.
doi:10.1109/8.633855

2. Ayestaran, R. G., J. Laviada-Martinez, and F. Las-Heras, "Realistic antenna array synthesis in complex environments using a Mom-SVR approach," Journal of Electromagnetic Waves and Applications, Vol. 23, No. 1, 97-108, 2009.
doi:10.1163/156939309787604670

3. Lai, B., N. Wang, H. B. Yuan, and C. H. Liang, "Hybrid method of higher-order MoM and nyström disretization PO for 3D PECproblems," Progress In Electromagnetics Research, Vol. 109, 381-398, 2010.
doi:10.2528/PIER10081401

4. Hou, Z. G., C. Wang, and H. C. Yin, "Multilevel thresholding method for a sparse representation of reduced matrix in CBFM," Journal of Electromagnetic Waves and Applications, Vol. 24, No. 17--18, 2605-2614, 2010.
doi:10.1163/156939310793675673

5. Ling, J., S. X. Gong, S. T. Qin, W. T. Wang, and Y. J. Zhang, "Wide-band analysis of on-platform antenna using MoM-PO combined with maehly approximation," Journal of Electromagnetic Waves and Applications, Vol. 24, No. 4, 475-484, 2010.

6. Ergul, O. and L. Gurel, "Improving iterative solutions of the electric-field integral equation via transformations into normal equations," Journal of Electromagnetic Waves and Applications, Vol. 24, 2129-2138, 2010.
doi:10.1163/156939310793699082

7. Ping, X. W., T. J. Cui, and W. B. Lu, "The combination of bcgstab with multifrontal algorithm to solve febi-mlfma linear systems arising from inhomogeneous electromagnetic scattering problems," Progress In Electromagnetics Research, Vol. 93, 91-105, 2009.
doi:10.2528/PIER09050604

8. Peng, Z., X. Q. Sheng, and F. Yin, "An efficient twofold iterative algorithm of Fe-Bi-MLFMA using multilevel inverse-based ilu preconditioning," Progress In Electromagnetics Research, Vol. 93, 369-384, 2009.
doi:10.2528/PIER09060305

9. Eibert, T. F., Ismatullah, E. Kaliyaperumal, and C. H. Schmidt, "Inverse equivalent surface current method with hierarchical higher order basis functions, full probe correction and multilevel fast multipole acceleration," Progress In Electromagnetics Research, Vol. 106, 377-394, 2010.
doi:10.2528/PIER10061604

10. Wu, G., X. Zhang, and B. Liu, "A hybrid method for predicting the shielding effectiveness of rectangular metallic enclosures with thickness apertures," Journal of Electromagnetic Waves and Applications, Vol. 24, No. 8--9, 1157-1169, 2010.
doi:10.1163/156939310791585972

11. Cui, Z., Y. Han, Q. Xu, and M. Li, "Parallel MoM solution of jmcfie for scattering by 3-d electrically large dielectric objects," Progress In Electromagnetics Research M, Vol. 12, 217-228, 2010.
doi:10.2528/PIERM10042607

12. Donepudi, K. C., J. M. Jin, S. Velamparambil, J. M. Song, and W. C. Chew, "A higher order parallelized multilevel fast multipole algorithm for 3-D scattering," IEEE Trans. Antennas Propag., Vol. 49, 1069-1078, Jul. 2001.

13. Velamparambil, S., W. C. Chew, and J. M. Song, "10 million unknowns: Is it that big?," IEEE Antennas Propagat. Mag., Vol. 45, 43-58, Apr. 2003.
doi:10.1109/MAP.2003.1203119

14. Velamparambil, S. and W. C. Chew, "Analysis and performance of a distributed memory multilevel fast multipole algorithm," IEEE Trans. Antennas Propagat., Vol. 53, 2719-2727, Aug. 2005.
doi:10.1109/TAP.2005.851859

15. Pan, X. M. and X. Q. Sheng, "A highly efiicient parallel approach of multi-level fast multipole algorithm," Journal of Electromagnetic Waves and Applications, Vol. 20, No. 8, 1081-1092, 2006.
doi:10.1163/156939306776930321

16. Pan, X. M. and X. Q. Sheng, "A sophisticated parallel MLFMA for scattering by extremely large targets," IEEE Antennas Propag. Mag., Vol. 50, No. 3, 129-138, Jun. 2008.
doi:10.1109/MAP.2008.4563583

17. Ergül, Ö. and L. Gürel, "Efficient parallelization of the multilevel fast multipole algorithm for the solution of large-scale scattering problems," IEEE Trans. Antennas Propagat., Vol. 56, 2335-2345, Aug. 2008.
doi:10.1109/TAP.2008.926757

18. Ergül, Ö. and L. Gürel, "A hierarchical partitioning strategy for an efficient parallelization of the multilevel fast multipole algorithm," IEEE Trans. Antennas Propag., Vol. 57, No. 6, 1740-1750, Jun. 2009.
doi:10.1109/TAP.2009.2019913

19. Li, W. D., W. Hong, and H.-X. Zhou, "An IE-ODDM-MLFMA scheme with DILU preconditioner for analysis of electromagnetic scattering from large complex objects," IEEE Trans. Antennas Propag., Vol. 56, 1368-1380, May 2008.
doi:10.1109/TAP.2008.922608

20. Yang, M. L. and X. Q. Sheng, "Parallel high-order Fe-Bi-MLFMA for scattering by large and deep coated cavities loaded with obstacles," Journal of Electromagnetic Waves and Applications, Vol. 23, No. 13, 1813-1823, 2009.
doi:10.1163/156939309789566932

21. Taboada, J. M., M. G. Araujo, F. Obelleiro, J. M. Bertolo, L. Landesa, J. Rivero, and J. L. Rodriguez, "Supercomputer aware approach for the solution of challenging electromagnetic problems," Progress In Electromagnetics Research, Vol. 101, 241-256, 2010.

22. Buchau, A., S. M. Tsafak, W. Hafla, and W. M. Rucker, "Parallelization of a fast multipole boundary element method with cluster OpenMP," IEEE Trans. Magn., Vol. 44, 1338-1341, Jun. 2008.
doi:10.1109/TMAG.2007.916262

23. Zhang, H. W., X. W. Zhao, Y. Zhang, D. G. Donoro, W. X. Zhao, and C. H. Liang, "Analysis of a large scale narrow-wall slotted waveguide array by parallel MoM out-of-core solver using the higher order basis functions," Journal of Electromagnetic Waves and Applications, Vol. 24, No. 14--15, 1953-1965, 2010.

24. Gao, P. C., Y.-B. Tao, and H. Lin, "Fast RCS prediction using multiresolution shooting and bouncing ray method on the GPU," Progress In Electromagnetics Research, Vol. 107, 187-202, 2010.
doi:10.2528/PIER10061807

25. Jiang, W. Q., M. Zhang, and Y. Wang, "CUDA-based radiative transfer method with application to the em scattering from a two-layer canopy model," Journal of Electromagnetic Waves and Applications, Vol. 24, No. 17--18, 2509-2521, 2010.
doi:10.1163/156939310793675772

26. Tay, W. C., D. Y. Heh, and E. L. Tan, "Gpu-accelerated fundamental adi-FDTD with complex frequency shifted convolutional perfectly matched layer," Progress In Electromagnetics Research M, Vol. 14, 177-192, 2010.
doi:10.2528/PIERM10090605

27. Qinn, M. J., Parallel Programming in C with MPI and OpenMP,, McGraw Hill, 2004.

28. http://www.sccas.cn/gb/index.html.,.


© Copyright 2014 EMW Publishing. All Rights Reserved