Inverse design optimization of a metalens

Inverse design optimization of a metalens#

In this notebook, we will use inverse design and the Tidy3D autograd feature to design a high numerical aperture (NA) metalens for optimal focusing to a point. This demo also introduces how to use automatic differentiation in tidy3d for objective functions that depend on the FieldMonitor outputs.

We will follow the basic set up from Mansouree et al. “Large-Scale Parametrized Metasurface Design Using Adjoint Optimization”. The published paper can be found here and the arxiv preprint can be found here.

Schematic of the metalens

Setup#

We first perform basic imports of the packages needed.

[1]:

import matplotlib.pyplot as plt
import numpy as np
import optax
import tidy3d as td
from autograd import value_and_grad
from tidy3d import web

The metalens design consists of a rectangular array of Si rectangular prisms sitting on an SiO2 substrate.

Here we define all of the basic parameters of the setup, including the wavelength, NA, geometrical dimensions, and material properties.

[2]:

# 1 nanometer in units of microns (for conversion)
nm = 1e-3

# free space central wavelength
wavelength = 850 * nm
f0 = td.C_0 / wavelength
fwidth = f0 / 20

# desired numerical aperture
NA = 0.78

# shape parameters of metalens unit cell (um) (refer to image above and see paper for details)
H = 430 * nm
S = 320 * nm

# minimum and maximum radius of cylinder in unit cell
rmin = 50 * nm
rmax = S / 2 - 10 * nm  # to avoid touching cylinders

# space above and below metalens (in substrate and air, respectively)
buffer_z = wavelength / 2

# buffer region in x and y
buffer_xy = wavelength

# diameter of entire metalens (um)
diameter = 10

# Define material properties at 850 nm
n_Si = 3.84  # aSi
n_SiO2 = 1.45
air = td.Medium(permittivity=1.0)
SiO2 = td.Medium(permittivity=n_SiO2**2)
Si = td.Medium(permittivity=n_Si**2)

# define symmetry
symmetry = (-1, 1, 0)

# simulation run time
run_time = 100 / fwidth
min_steps_per_wvl = 10

Next, we will also define some derived parameters:

[3]:

# Compute the domain size in x, y
length_xy = diameter + 2 * buffer_xy

# focal length given diameter and numerical aperture
focal_length = length_xy / 2 / NA * np.sqrt(1 - NA**2)

# total domain size in z: unit cells + buffer in z
length_z = H + 2 * buffer_z

# construct simulation size array
sim_size = (length_xy, length_xy, length_z)

Create Metalens Geometry#

Now we will define the structures in our simulation. We will first write a function to get the coordinates of the centers of the cylinders in the metalens.

[4]:

def get_cylinder_centers(diameter, spacing, full_circle: bool = False):
    r_eff = diameter / 2 - spacing / 2  # max radius of centers
    coords = np.arange(0, r_eff, spacing)

    if full_circle:
        coords = np.concatenate([-coords[::-1], coords])

    x, y = np.meshgrid(coords, coords)
    points = np.vstack((x.flat, y.flat)).T

    # Create a boolean mask for points within the circle
    mask = x**2 + y**2 <= r_eff**2

    # Apply the mask to get the final points
    return points[mask.flat]

[5]:

centers_quarter = get_cylinder_centers(diameter, S, full_circle=False)
centers_full = get_cylinder_centers(diameter, S, full_circle=True)
N = len(centers_full)

print(f"For a diameter of {diameter:.1f} µm, there are {N} cylinders.")
print(
    f"The metalens has an area of {np.pi * (diameter / 2) ** 2:.1f} µm² and a focal length of {focal_length:.1f} µm."
)

For a diameter of 10.0 µm, there are 780 cylinders.
The metalens has an area of 78.5 µm² and a focal length of 4.7 µm.

Let’s visualize the cell centers.

[6]:

fig, ax = plt.subplots(1, 1, tight_layout=True)
ax.scatter(*get_cylinder_centers(diameter, S, full_circle=True).T, s=1)
circle = plt.Circle((0, 0), diameter / 2, color="r", fill=False)
ax.add_artist(circle)
ax.set_xlabel("x (µm)")
ax.set_ylabel("y (µm)")
ax.set_aspect("equal")
plt.show()

../_images/notebooks_Autograd7Metalens_11_0.png

Now, we will start defining the structures in our simulation, starting with the substrate.

[7]:

substrate = td.Structure(
    geometry=td.Box.from_bounds(
        rmin=(-td.inf, -td.inf, -1000),
        rmax=(+td.inf, +td.inf, -H / 2),
    ),
    medium=SiO2,
)

aperture = [
    td.Structure(
        geometry=td.Box.from_bounds(
            rmin=(-td.inf, -td.inf, -H / 2),
            rmax=(+td.inf, +td.inf, -H / 2 + 0.2),
        ),
        medium=td.PECMedium(),
    ),
    td.Structure(
        geometry=td.Cylinder(
            center=(0, 0, 0),
            radius=diameter / 2 + buffer_xy / 4,
            length=H,
        ),
        medium=air,
    ),
]

And we will write a function to make a td.Structure containing a td.GeometryGroup with a td.Cylinder for each unit cell.

Note: while one could create a separate td.Structure for each td.Cylinder, using td.GeometryGroup leads to performance improvements, especially for the gradient processing.

[8]:

def make_cylinders(params):
    """Make the metalens unit cell structures."""
    # scale the parameters to be between rmin and rmax
    radii = rmin + (rmax - rmin) / (1 + np.exp(-params))

    geometries = []
    for r, (x, y) in zip(radii, centers_quarter):
        geometry = td.Cylinder(center=(x, y, 0), radius=r, length=H)
        geometries.append(geometry)
    geo_group = td.GeometryGroup(geometries=geometries)
    medium = td.Medium(permittivity=n_Si**2)

    return td.Structure(medium=medium, geometry=geo_group)

Define Source#

Now we define the incident fields. We simply use an x-polarized, normally incident plane wave with Gaussian time dependence centered at our central frequency. For more details, see the plane wave source documentation and the gaussian source documentation

[9]:

# time dependence of source
gaussian = td.GaussianPulse(freq0=f0, fwidth=fwidth, phase=0)

source = td.PlaneWave(
    source_time=gaussian,
    size=(td.inf, td.inf, 0),
    center=(0, 0, -H / 2 - buffer_z / 2),
    direction="+",
    pol_angle=0,
)

Define Monitors#

Now we define the monitor that measures field output from the FDTD simulation. For simplicity, we use measure the fields at the central frequency at the focal spot.

This will be the monitor that we use in our objective function.

We additionally define the monitor used for the far field projection here, but note that it is not included in the simulation as we will perform the far field projection locally using the near field monitor.

[10]:

monitor_near = td.FieldMonitor(
    center=(0, 0, H / 2 + buffer_z / 2),
    size=(td.inf, td.inf, 0),
    freqs=[f0],
    name="near_fields",
    colocate=False,
)

monitor_far = td.FieldProjectionAngleMonitor(
    center=monitor_near.center,
    size=monitor_near.size,
    freqs=monitor_near.freqs,
    name="far_fields",
    phi=[0],
    theta=[0],
    proj_distance=focal_length - buffer_z / 2,
    far_field_approx=False,
)

Create Simulation#

Now we can put everything together and define a Simulation object to be run.

Note: we add symmetry of (-1, 1, 0) to speed up the simulation by approximately 4x taking into account the symmetry in our source and dielectric function.

[11]:

def make_sim(params):
    structures = [substrate, *aperture]
    if params is not None:
        structures.append(make_cylinders(params))
    sim = td.Simulation(
        size=sim_size,
        structures=structures,
        sources=[source],
        monitors=[monitor_near],
        run_time=run_time,
        boundary_spec=td.BoundarySpec.all_sides(td.PML()),
        grid_spec=td.GridSpec.auto(min_steps_per_wvl=min_steps_per_wvl),
        symmetry=symmetry,
    )
    return sim

Let’s define some initial parameters for the metalens.

[12]:

params0 = np.zeros(len(centers_quarter))
sim = make_sim(params0)

Visualize Geometry#

Lets take a look and make sure everything is defined properly. Note that we see only the upper right quadrant of the metalens, but this will be reflected across the other quadrants to create the full metalens due to the symmetry that we specified in the simulation.

[13]:

fig, (ax1, ax2, ax3) = plt.subplots(1, 3, figsize=(14, 6))

sim.plot(x=0, ax=ax1)
sim.plot(y=0, ax=ax2)
sim.plot(z=-H / 2, ax=ax3)  # so we can see the aperture
plt.show()

../_images/notebooks_Autograd7Metalens_25_0.png

Objective Function#

Now that our simulation is set up, we can define our objective function over the td.SimulationData results.

We first write a function to take a td.SimulationData object and return the projected far field power.

Next, we write a function to

Set up our simulation given our design parameters.
Run the simulation through the adjoint run function.
Compute and return the intensity at the focal point.

[14]:

def measure_focal_power(sim_data: td.SimulationData) -> float:
    """Measures far field power at focal point."""
    projector = td.FieldProjector.from_near_field_monitors(
        sim_data=sim_data,
        near_monitors=[monitor_near],
        normal_dirs=["+"],
        pts_per_wavelength=None,
    )
    projected_fields = projector.project_fields(monitor_far)
    return projected_fields.power.sum().item()


def J(params) -> float:
    """Objective function, returns power at focal point as a function of params."""
    sim = make_sim(params)
    sim_data = web.run(sim, task_name="metalens_invdes", verbose=False)
    return measure_focal_power(sim_data)

Next, we use autograd to get a function returning the objective value and its gradient, given some parameters.

[15]:

dJ = value_and_grad(J)

And try it out.

[16]:

val, grad = dJ(params0)
print(val)
print(grad)

0.004137948605772573
[ 5.11895647e-04  5.33146407e-04  4.51338145e-04  3.78266813e-04
  4.14082619e-05 -2.75581521e-04 -3.57957714e-04 -2.01094870e-04
  1.28572633e-04  2.53150670e-04  4.35762401e-05 -1.69713945e-04
 -6.93649161e-05  1.41367352e-04  1.39177745e-04 -1.36550979e-05
  5.11687451e-04  5.43767108e-04  4.61974605e-04  3.55684877e-04
  2.76939112e-05 -3.12252495e-04 -3.81812351e-04 -1.82201362e-04
  1.70647619e-04  2.67162826e-04  1.13125323e-05 -2.07569785e-04
 -6.70616320e-05  1.57807284e-04  9.16375845e-05 -1.04076254e-04
  4.59495813e-04  5.20080773e-04  4.07560462e-04  2.40743451e-04
 -1.01773430e-04 -3.97350023e-04 -3.63533549e-04 -1.07653128e-04
  2.06410629e-04  2.60857306e-04 -2.34360402e-05 -2.19295420e-04
 -5.98902810e-05  1.74005807e-04  3.67529170e-05  3.66168158e-04
  4.01313066e-04  2.65743359e-04  3.15760936e-05 -2.70193629e-04
 -4.29457923e-04 -2.91438630e-04  3.94641980e-05  2.56080532e-04
  1.93385358e-04 -1.20618470e-04 -1.91944226e-04  2.22685959e-05
  1.87602247e-04  2.47681632e-05  1.79282442e-04  1.44161502e-04
 -3.08129879e-05 -2.39416564e-04 -4.20569719e-04 -4.24345511e-04
 -1.42965921e-04  1.72983758e-04  2.67565448e-04  7.00122451e-05
 -1.65524102e-04 -1.68458249e-04  1.05056688e-04  2.10598642e-04
  4.19693689e-05 -1.43425336e-04 -2.15726418e-04 -3.46832120e-04
 -4.42788578e-04 -4.05000100e-04 -2.56654243e-04  7.08810981e-05
  2.80721313e-04  1.93538288e-04 -6.87028086e-05 -1.95563272e-04
 -3.79865559e-05  1.61159406e-04  1.19542025e-04 -5.97298632e-05
 -4.09557539e-04 -4.63832634e-04 -4.69656562e-04 -4.47231015e-04
 -1.97142922e-04 -3.36718979e-06  2.55013758e-04  3.15523290e-04
  5.00021394e-05 -2.05235922e-04 -1.63954636e-04  1.02845072e-04
  1.85045570e-04  5.86658144e-05 -4.00463963e-04 -3.89002818e-04
 -2.41460843e-04 -1.28548189e-04  9.84135684e-05  3.03855785e-04
  3.15631377e-04  1.63813770e-04 -1.30776471e-04 -2.10686149e-04
  4.07213777e-06  2.14054964e-04  7.70195098e-05 -5.47731759e-05
 -1.29728772e-05  5.33507792e-05  1.93832403e-04  2.99192631e-04
  3.58228100e-04  3.82723435e-04  1.09050866e-04 -1.63773979e-04
 -2.23946550e-04 -7.62676665e-05  1.87114319e-04  1.79212846e-04
  9.10214463e-06  4.28370241e-04  4.38978904e-04  4.10972169e-04
  4.20681447e-04  2.01159916e-04  1.54750206e-05 -2.26615937e-04
 -2.34393035e-04 -5.92384195e-05  9.60685666e-05  1.90048357e-04
 -7.25694947e-05 -4.50823882e-05  2.92954934e-04  2.33124701e-04
  5.25767687e-05 -1.63014004e-04 -2.64638049e-04 -3.09838224e-04
 -2.24889834e-04 -5.76170497e-05  2.47432698e-04  2.20349450e-04
  2.00327951e-05 -1.43222953e-04 -2.63312593e-04 -3.81467766e-04
 -4.13518796e-04 -3.30183207e-04 -2.20230818e-04 -7.39850343e-05
  1.28809820e-04  3.14590184e-04  1.52050533e-04 -1.62826005e-04
 -1.31595042e-04 -1.81015833e-04 -2.12702308e-04 -1.92933246e-04
 -3.78136738e-05  1.91769044e-04  3.83963765e-04  2.29059110e-04
 -1.24757500e-06 -1.10719760e-04 -1.52759352e-04  2.98762368e-04
  2.67059969e-04  2.58281780e-04  3.08718772e-04  2.34635416e-04
  1.02684317e-04 -5.34253014e-05 -1.49312062e-04  2.75734668e-05
  4.84589962e-05  1.40690707e-04  1.07891436e-04 -6.34459932e-05
 -9.59790748e-05 -1.46801321e-04 -1.49245926e-04]

Normalize Objective#

To normalize our objective function value to something more understandable, we first run a simulation with no metalens to compute the focal point intensity in this case. Then, we construct a new objective function value that normalizes the raw intensity by this value, giving us an “intensity enhancement” factor. In this normalization, if our objective is given by “x”, it means that the intensity at the focal point is “x” times stronger with our design than with no structures at all.

[17]:

J_empty = J(None)

def J_normalized(params):
    return J(params) / J_empty

val_normalized = val / J_empty

dJ_normalized = value_and_grad(J_normalized)

print(val_normalized)

0.2322061552812715

Optimization#

With our objective function set up, we can now run the optimization.

As before, we will optax’s “adam” optimization with initial parameters of all zeros (corresponding to cylinders of radius rmax/2).

[18]:

# hyperparameters
num_steps = 10
learning_rate = 1e-1

# initialize adam optimizer with starting parameters
params = np.copy(params0)
optimizer = optax.adam(learning_rate=learning_rate)
opt_state = optimizer.init(params)

# store history
J_history = [val_normalized]
params_history = [params0]

for i in range(num_steps):
    # compute gradient and current objective function value
    value, gradient = dJ_normalized(params)

    # outputs
    print(f"step = {i + 1}")
    print(f"\tJ = {value:.4e}")
    print(f"\tgrad_norm = {np.linalg.norm(gradient):.4e}")

    # compute and apply updates to the optimizer based on gradient (-1 sign to maximize obj_fn)
    updates, opt_state = optimizer.update(-gradient, opt_state, params)
    params[:] = optax.apply_updates(params, updates)

    # save history
    J_history.append(value)
    params_history.append(params.copy())

step = 1
        J = 2.3221e-01
        grad_norm = 1.9278e-01

step = 2
        J = 1.6492e+00
        grad_norm = 5.1011e-01

step = 3
        J = 4.0145e+00
        grad_norm = 8.1819e-01

step = 4
        J = 6.4819e+00
        grad_norm = 1.1493e+00

step = 5
        J = 9.3541e+00
        grad_norm = 1.5721e+00

step = 6
        J = 1.3383e+01
        grad_norm = 2.2682e+00

step = 7
        J = 1.7127e+01
        grad_norm = 2.7479e+00

step = 8
        J = 1.8937e+01
        grad_norm = 2.6465e+00

step = 9
        J = 2.0945e+01
        grad_norm = 2.3762e+00

step = 10
        J = 2.2378e+01
        grad_norm = 3.2033e+00

[19]:

params_after = params_history[-1]

[20]:

plt.plot(J_history)
plt.xlabel("iterations")
plt.ylabel("objective function (focusing intensity enhancement)")
plt.show()

../_images/notebooks_Autograd7Metalens_37_0.png

[21]:

sim_before = make_sim(params0)
sim_after = make_sim(params_after)

[22]:

f, (ax1, ax2) = plt.subplots(1, 2)

sim_before.plot(z=0, ax=ax1)
sim_after.plot(z=0, ax=ax2)

plt.show()

../_images/notebooks_Autograd7Metalens_39_0.png

[23]:

sim_after_mnt_xy = td.FieldMonitor(
    center=(*monitor_near.center[:2], H / 2 + focal_length),
    size=monitor_near.size,
    freqs=monitor_near.freqs,
    fields=("Ex", "Ey", "Ez"),
    name="focal_fields_xy",
)
sim_after_mnt_xz = td.FieldMonitor(
    size=(0, td.inf, td.inf),
    freqs=monitor_near.freqs,
    fields=("Ex", "Ey", "Ez"),
    name="focal_fields_yz",
)
sim_after_mnt = sim_after.updated_copy(
    center=(0, 0, focal_length / 2),
    size=(length_xy, length_xy, focal_length + 4 * buffer_z),
    monitors=[sim_after_mnt_xy, sim_after_mnt_xz],
)
sim_after_mnt.plot(y=0)
plt.show()

../_images/notebooks_Autograd7Metalens_40_0.png

[24]:

sim_data_after_mnt = web.run(sim_after_mnt, task_name="meta_near_field_after")

13:56:14 CEST Created task 'meta_near_field_after' with task_id
              'fdve-1e8606f2-3390-410c-9234-5a45d5a59a91' and task_type 'FDTD'.

              View task using web UI at
              'https://tidy3d.simulation.cloud/workbench?taskId=fdve-1e8606f2-33
              90-410c-9234-5a45d5a59a91'.

              Task folder: 'default'.

13:56:16 CEST Maximum FlexCredit cost: 0.504. Minimum cost depends on task
              execution details. Use 'web.real_cost(task_id)' to get the billed
              FlexCredit cost after a simulation run.

13:56:17 CEST status = queued

              To cancel the simulation, use 'web.abort(task_id)' or
              'web.delete(task_id)' or abort/delete the task in the web UI.
              Terminating the Python script will not stop the job running on the
              cloud.

13:56:29 CEST status = preprocess

13:56:33 CEST starting up solver

              running solver

13:57:13 CEST early shutoff detected at 12%, exiting.

              status = postprocess

13:57:14 CEST status = success

13:57:16 CEST View simulation result at
              'https://tidy3d.simulation.cloud/workbench?taskId=fdve-1e8606f2-33
              90-410c-9234-5a45d5a59a91'.

13:57:18 CEST loading simulation from simulation_data.hdf5

[25]:

fig, (ax1, ax2) = plt.subplots(1, 2, tight_layout=True, figsize=(10, 4))
sim_data_after_mnt.plot_field("focal_fields_xy", field_name="E", val="abs^2", vmax=105, ax=ax1)
sim_data_after_mnt.plot_field("focal_fields_yz", field_name="E", val="abs^2", vmax=180, ax=ax2)
plt.show()

../_images/notebooks_Autograd7Metalens_42_0.png

Conclusions#

We notice that our metalens does quite well at focusing at this high NA! For the purposes of demonstration, this is quite a small device, but the same the same principle can be applied to optimize a much larger metalens.

For more case studies using autograd support in tidy3d, see the