The University of Southampton
University of Southampton Institutional Repository

Speed efficiency optimization for GPU accelerated rigorous coupled-wave analysis program

Speed efficiency optimization for GPU accelerated rigorous coupled-wave analysis program
Speed efficiency optimization for GPU accelerated rigorous coupled-wave analysis program
Rigorous Coupled Wave Analysis (RCWA) method is highly efficient for the simulation of diffraction efficiency and field distribution patterns in periodic structures and textured optoelectronic devices such as VCSELs, LEDs, and DOEs. RCWA provides exact solutions provided the Fourier expansion has infinite order. In practice, the Fourier expansion must be truncated due to computer memory limitations. Researchers are trying to utilize fast convergence algorithms such as the ‘normal vector method’ and ‘Li’s rule’ which could obtain accurate TM mode results with fewer Fourier orders. However, to thoroughly investigating the behavior of a structure usually requires thousand and even millions of RCWA simulations which may last hours and days. GPU is highly suitable for solutions of complex systems allowing large-scale multi-threaded parallel programming (< 1000 / low-end GPU, <5k / high-end GPU) to speed up matrix computations significantly. In this paper, we present a high-speed RCWA program utilizing optimized CUDA-GPU code and MAGMA libraries. It achieves 2-6 X speedup compared to conventional multithreaded CPU-based code utilizing the Intel MKL library running on IRIDIS 5 super-computer (1 NVIDIA v100 GPU, 40 Intel Xeon Gold 6138 2.0GHz cores CPU).
0277-786X
SPIE
Xu, Jingxiao
6a01b40f-4a5c-4908-a2b9-61433a03757e
Charlton, Martin D.B.
fcf86ab0-8f34-411a-b576-4f684e51e274
Witzigmann, Bernd
Osiński, Marek
Arakawa, Yasuhiko
Xu, Jingxiao
6a01b40f-4a5c-4908-a2b9-61433a03757e
Charlton, Martin D.B.
fcf86ab0-8f34-411a-b576-4f684e51e274
Witzigmann, Bernd
Osiński, Marek
Arakawa, Yasuhiko

Xu, Jingxiao and Charlton, Martin D.B. (2023) Speed efficiency optimization for GPU accelerated rigorous coupled-wave analysis program. Witzigmann, Bernd, Osiński, Marek and Arakawa, Yasuhiko (eds.) In Physics and Simulation of Optoelectronic Devices XXXI. vol. 12415, SPIE. 10 pp . (doi:10.1117/12.2650022).

Record type: Conference or Workshop Item (Paper)

Abstract

Rigorous Coupled Wave Analysis (RCWA) method is highly efficient for the simulation of diffraction efficiency and field distribution patterns in periodic structures and textured optoelectronic devices such as VCSELs, LEDs, and DOEs. RCWA provides exact solutions provided the Fourier expansion has infinite order. In practice, the Fourier expansion must be truncated due to computer memory limitations. Researchers are trying to utilize fast convergence algorithms such as the ‘normal vector method’ and ‘Li’s rule’ which could obtain accurate TM mode results with fewer Fourier orders. However, to thoroughly investigating the behavior of a structure usually requires thousand and even millions of RCWA simulations which may last hours and days. GPU is highly suitable for solutions of complex systems allowing large-scale multi-threaded parallel programming (< 1000 / low-end GPU, <5k / high-end GPU) to speed up matrix computations significantly. In this paper, we present a high-speed RCWA program utilizing optimized CUDA-GPU code and MAGMA libraries. It achieves 2-6 X speedup compared to conventional multithreaded CPU-based code utilizing the Intel MKL library running on IRIDIS 5 super-computer (1 NVIDIA v100 GPU, 40 Intel Xeon Gold 6138 2.0GHz cores CPU).

Text
124150J - Version of Record
Available under License Creative Commons Attribution.
Download (545kB)

More information

Published date: 10 March 2023
Venue - Dates: Physics and Simulation of Optoelectronic Devices XXXI, , San Francisco, United States, 2023-01-28 - 2023-02-03

Identifiers

Local EPrints ID: 490035
URI: http://eprints.soton.ac.uk/id/eprint/490035
ISSN: 0277-786X
PURE UUID: e7787f1a-9828-4c42-9110-168648f3b6b2
ORCID for Jingxiao Xu: ORCID iD orcid.org/0000-0001-6116-0057

Catalogue record

Date deposited: 14 May 2024 16:30
Last modified: 13 Jun 2024 01:57

Export record

Altmetrics

Contributors

Author: Jingxiao Xu ORCID iD
Author: Martin D.B. Charlton
Editor: Bernd Witzigmann
Editor: Marek Osiński
Editor: Yasuhiko Arakawa

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×