OptoGPT: Foundation Models for Multilayer Film Design

Compuscript Ltd

A new publication from Opto-Electronic Advances; DOI 10.29026/oea.2024.240062, discusses OptoGPT, a new inverse design algorithm.

Optical multilayer thin film structure is one of the most important photonic structures widely used in many applications, including color filters, absorbers, optical cavities or resonators, photovoltaic and radiative cooling, special mirrors for extreme UV lithography and for space telescopes. Designing these structures requires much training and expertise as identifying the best combination of materials and the thickness at each layer is not an easy task: considering the large number of material choices, the possible searching space for designing such a multilayer structure can be extremely large. Different researchers may care about different types of optical responses, such as reflection, transmission, polarization, angled spectrum, etc. Therefore a general method that can inverse design structures to produce these different responses is highly desirable and can benefit all researchers in the field. How to achieve such a goal, however, is still unknown.

The authors of this article introduce a new inverse design algorithm called OptoGPT, inspired from the popular and powerful GPT models in natural language processing, such as OpenAI's ChatGPT and Google's Bard. In these GPT models, given an input prompt such as questions or guidance, they can output desirable and meaningful sentences as answers. The proposed OptoGPT model works in a similar way. When given a specific type of optical responses that a user desires, the model can output a designed multilayer structure directly.

To make OptoGPT work, the researchers first introduce the idea of structure tokenization to convert the material and thickness information at each layer into a single token. Then, they use the structure serialization to convert a multilayer thin film structure to a sequence of tokens as an input to the GPT model. In this way, their OptoGPT model can effectively design for both materials and thickness information. On the other hand, to make their model suited for different types of optical responses, they propose to design for both transmission and reflection spectrum and use multiple techniques to extend to diverse responses. Details can be found in Figure 1.

Using OptoGPT, the authors have successfully demonstrated a unified inverse design in different applications, including transmission filter, high reflection filter, perfect absorber, arbitrary absorber, reflective/transmissive structural color, to name a few. During design process, the OptoGPT model can automatically determine the best material as well as its thickness at each layer. In addition, each design process is extremely fast, and can be done within 0.1 second. Examples can be found in Figure 2.

In addition, the authors further propose the idea of "probability resampling" to design structure that can satisfy arbitrary constraints during fabrication or other practical requirements. By using finetuning and "mixed sampling", their model can work for different incident angles and different polarizations, as well as simultaneous design that can satisfy multiple angles and polarization requirements. In this way, ultimately the team build a model that can unify the multilayer design for different materials and thickness, and for different types of optical responses, including transmission, reflection, absorber, structural color, angles, and polarization states, which significantly simplify the inverse design process, making them easily accessible to researchers and engineers.

Keywords: multilayer thin film structure / inverse design / foundation models / deep learning / structural color

/Public Release. This material from the originating organization/author(s) might be of the point-in-time nature, and edited for clarity, style and length. Mirage.News does not take institutional positions or sides, and all views, positions, and conclusions expressed herein are solely those of the author(s).View in full here.