# **Regular Articles**

# Ultralow Latency Optical Logic Operations with an Ultrasmall Silicon Wire $\Psi$ Gate

# Shota Kita, Kengo Nozaki, Kenta Takata, Akihiko Shinya, and Masaya Notomi

# Abstract

Electronic computation circuits are composed of logic gates. However, due to the exponential increase in wiring resistance of miniaturized electronic components, both the increase in latency and energy consumption is becoming a serious problem. To overcome the electronic bottleneck, we developed an ultrasmall silicon-wire multimode interferometer called a  $\Psi$  (*psi*) gate for low-loss and high-speed optical logic operation. With a single  $\Psi$  gate, we can carry out Boolean logic operations (OR, XOR, AND, XNOR, NOR, and NAND) in telecom wavelength (1535–1565 nm) with an ultralow latency of ~30 fs. Such gates are expected to be used in novel photonics-electronics convergence processors for ultralow latency pattern matching and vector operations for photonic neural network applications.

Keywords: optical logic gates, linear optics, silicon photonics

## 1. Introduction

Data processing infrastructures based on complementary metal oxide semiconductors (CMOS) have seen continuous growth owing to the huge progress in semiconductor fabrication technologies. However, increases in leakage current and wiring resistance due to the miniaturization of electronic circuits will put an end to this continuous growth in the near future [1]. Accordingly, we are seeing accelerated exploration of novel technologies in all domains of science and technology towards the next generation of scalable data processing infrastructure. Moreover, latency has been worsening because the increase in wiring resistance also limits the response speed of electronic circuits (known as resister-capacitor time constant), which would limit the future development of applications related to communications security, real-time control, financial transactions, and so on. Note that latency can be reduced by inserting repeaters in electronic wires. This method is not energy efficient and would not help to overcome the traditional trade-off between latency and energy consumption.

It is expected that this latency problem can be drastically mitigated using light for processing not only for communication because an optical signal can propagate through photonic circuits at the speed of light. Additionally, if the elemental photonic devices comprising a photonic circuit become smaller, the total optical pass length in the circuit will become shorter. Thus, further reducing device size lowers latency [2]. Our group developed high-performance micro/nanophotonic devices, e.g., nanolasers [3], nanophotodetectors [4], nanomodulators, and optical transistors [5], which should be fundamental building blocks for fabricating low-latency nanophotonic processors. We also developed the first high-performance ultralow-latency optical logic gate based on lightinterference, called a  $\Psi(psi)$  gate based on its shape (**Fig. 1**(**a**)), as another building block [6].

In this article, we introduce the concept and a brief theoretical background of  $\Psi$  gates and the experimental demonstration of various optical logic operations with a single  $\Psi$  gate. We also present preliminary



Fig. 1. A Si wire  $\Psi$  gate. (a) An illustration of  $\Psi$  gate operation. (b) An example of simulated AND logic operation ( $\lambda = 1.54 \mu m$ ). The horizontal white bar at bottom-right inset indicates a 1- $\mu m$ -long scale bar.

on-chip integration of a  $\Psi$  gate and other photonic components for stable operation. Next, we show how to use  $\Psi$  gates for optical multibit AND circuits and estimate latency, which is 10 times lower than the best case of CMOS electronic circuits. Finally, we conclude with the potential of using  $\Psi$  gates in ultralow-latency photo-electronic-converged accelerators.

### **2.** Logic operations with a silicon wire $\Psi$ gate

Light-interference is a linear phenomenon, so it has been too primitive to carry out various logic operations, and the functions and performance were limited. However, almost all representative Boolean logic operations can be carried out with a single linear gate by introducing the concept of bias light and adjusting the input conditions [7]. Such multiple interference systems can be implemented using silica planer lightwave circuits [8], silicon (Si) photonics [9], and plasmonics [10]. The footprint of the gates is crucial for dense integration. Therefore, if we only need to make it as small as possible, we should choose plasmonics as the platform. However, we need to consider insertion loss to consider the configuration for multibit operations (discussed in Section 5). We found that the most promising platform is Si photonics.

Our  $\Psi$  gate (Fig. 1(a)) has three input ports and one output port (the  $\Psi$  gate is a part of a 3 × 3 interferometer with two extra hidden radiation ports). Two of them are signal ports (A and B), and the center one is a bias port (denoted as Bias). The two sequences of the intensity modulated signals are input into A and B (relative input powers of both signals  $P_A$  and  $P_B$  are varied between  $P_0$  and  $P_1$ ) with a fixed relative phase relationship. In contrast, the intensity of the bias light is fixed at  $P_{\text{Bias}}$ . The optical output signal power  $P_{\text{out}}$  through a  $\Psi$  gate is then given as follows [6];

$$P_{\text{out}} = (\sqrt{P_{\text{A}}T_{\text{A}}} + \sqrt{P_{\text{B}}T_{\text{B}}} \cos\Delta\Phi + \sqrt{P_{\text{Bias}}T_{\text{Bias}}} \cos\Delta\Phi_{\text{Bias}})^{2} + (\sqrt{P_{\text{B}}T_{\text{B}}} \sin\Delta\Phi + \sqrt{P_{\text{Bias}}T_{\text{Bias}}} \sin\Delta\Phi_{\text{Bias}})^{2},$$
(1)

where  $T_X$  is the transmittance from each input port X to the output port (satisfying  $T_A + T_B + T_{\text{Bias}} \le 1$  due to the linearity), and  $\Delta \Phi$  and  $\Delta \Phi_{\text{Bias}}$  are the relative output phases of B and Bias to the output of A, respectively. Within this degree of freedom, for example, we can implement an optical AND logic operation, as shown in **Fig. 1(b)**.

In this situation, we set  $T_A = T_B \sim 0.39$ ,  $T_{Bias} \sim 0.20$  $(T_{\rm A} + T_{\rm B} + T_{\rm Bias} \sim 0.98), \Delta \Phi = 0, \Delta \Phi_{\rm Bias} = \pi, \text{ and } P_{\rm Bias}$ ~ 0.48 for the maximum binary contrast (BC) of 9.54 dB [7]. From the simulated intensity distributions of Fig. 1(b), the operation result appears just after the multiple interference part of the  $\Psi$  gate. This means the AND operation is carried out by just passing the light through the 3- $\mu$ m-long  $\Psi$  gate. Therefore, the physically limited computation latency of a single AND operation is ~30 fs. This latency is more than 100 times lower than that of CMOS electronics (~10 ps). From the  $P_{out}$  of (A, B) = (1, 1), we define signal loss (SL) as  $10 \log_{10} (P_{out}/P_1)$  (SL becomes 0 dB when  $P_{\text{out}} = P_1 = 1$ ). By using the Si photonics platform, we can obtain SL < 0.5 dB. Even if we try to fabricate a similar interferometer based on loss-less half mirrors, SL becomes  $\sim 1.25$  dB. As far as we know, there have been no reports on optical logic gates with such low SL. The required relative bias power  $P_{\text{Bias}}/P_1$  for the maximum BC for an AND operation can be derived from Eq. (1) as follows;

|  | P <sub>A</sub> | P <sub>B</sub> | $\Delta \Phi$               | 0              | π              | 2π/3           | 0                          |                          |                            | 2π/3                     |
|--|----------------|----------------|-----------------------------|----------------|----------------|----------------|----------------------------|--------------------------|----------------------------|--------------------------|
|  |                |                | $P_{\text{Bias}}/P_{1}$     | 0              |                |                | $T_{\rm A}/4T_{\rm Bias}$  | $T_{\rm A}/T_{\rm Bias}$ | $9T_A/4T_{Bias}$           | $T_{\rm A}/T_{\rm Bias}$ |
|  |                |                | $\Delta \Phi_{\text{Bias}}$ | -              |                |                | π                          |                          |                            | -2π/3                    |
|  | 0              | 0              |                             | 0              | 0              | 0              | $T_{\rm A}/4$              | TA                       | 9 <i>T</i> <sub>A</sub> /4 | T <sub>A</sub>           |
|  | 0              | 1              |                             | T <sub>A</sub> | TA             | TA             | $T_{A}/4$                  | 0                        | T <sub>A</sub>             | TA                       |
|  | 1              | 0              |                             | T <sub>A</sub> | T <sub>A</sub> | T <sub>A</sub> | $T_{\rm A}/4$              | 0                        | T <sub>A</sub>             | T <sub>A</sub>           |
|  | 1              | 1              |                             | $4T_A$         | 0              | T <sub>A</sub> | 9 <i>T</i> <sub>A</sub> /4 | T <sub>A</sub>           | T <sub>A</sub>             | 0                        |
|  |                |                |                             |                | XOR            | OR             | AND                        | XNOR                     | NOR                        | NAND                     |
|  |                | ВС             | [dB]                        | 6              | 00             | 00             | 9.5                        | 00                       | 9.5                        | 00                       |

Table 1. Optical input/output table for linear optical logic gate operations in silicon wire  $\Psi$  gates with and without bias input.

$$\frac{P_{\text{Bias}}}{P_1} = \frac{T_A}{4T_{\text{Bias}}} \left(1 + 3\sqrt{\frac{P_0}{P_1}}\right)^2 \text{ (for AND). (2)}$$

Here,  $P_{\text{Bias}}/P_1$  is determined from the ratio of  $T_A$  and  $T_{\text{Bias}}$ . Larger  $T_A$  gives larger  $P_{\text{out}}$  (smaller *SL*); on the other hand,  $P_{\text{Bias}}/P_1$  becomes larger (larger total input power into a  $\Psi$  gate). In the case of Fig. 1(b), we balance  $T_A$  and  $T_{\text{Bias}}$  according to the minimization of net loss [6]. By keeping the same  $\Delta\Phi$  and  $\Delta\Phi_{\text{Bias}}$ ,  $P_{\text{Bias}}/P_1$  for XNOR and NOR operations can be derived as follows;

$$\frac{P_{\text{Bias}}}{P_1} = \frac{T_A}{T_{\text{Bias}}} \left(1 + \sqrt{\frac{P_0}{P_1}}\right)^2 \text{ (for XNOR), (3)}$$
$$\frac{P_{\text{Bias}}}{P_1} = \frac{T_A}{4T_{\text{Bias}}} \left(3 + \sqrt{\frac{P_0}{P_1}}\right)^2 \text{ (for NOR). (4)}$$

This means we can switch AND, XNOR, and NOR logic operations by only adjusting  $P_{\text{Bias}}$ . All the representative logic operations can be implemented, as shown in **Table 1**, by adjusting the input conditions. In terms of cascade connection, it only allows linearly separable combinations (e.g., multibit AND, multibit NOR, etc.). To expand functionality, it is necessary to apply some nonlinearity by inserting optical-electronic-optical conversions through optical transistors [5].

### 3. High-speed optical logic operations

A sample of a silicon wire  $\Psi$  gate was fabricated on a silicon-on-insulator (SOI) substrate with a 220-nmthick top Si layer. An e-beam resist was spin-coated on the substrate, and the resist pattern was formed after e-beam writing and development. The resist pattern was transferred to the Si layer by using dry etching. After the resist was removed, polymer spot size converters [11] were fabricated for the input/output ports by using 2nd e-beam lithography.

Figure 2 illustrates our setup for demonstrating high-speed logic operations. We used several off-chip optical components connected with several fiber patch cords. However, the relative phases of the input signals are always affected by phase fluctuations due to the mechanical vibrations of the fibers. To eliminate such fluctuation, we used fiber stretchers controlled using a digital proportional integral differential (PID) regulator. First, the laser light was split into two using a tunable coupler. One is intensity-modulated input for signal ports A and B, and the other is an invariant input for the bias port. To generate arbitral optical bit patterns, an optical transmitter connected to a pseudo random pulse pattern generator with a radio-frequency signal multiplexer was used (up to 64 Gbit/s). The generated optical bit patterns were separated into two by using a 3-dB coupler. After intensity tuning with a multi-channel variable optical attenuator, the bottom-side signal was delayed using a fiber delay line so that the  $\Psi$  gate under the test effectively had two different pseudo random bit patterns for signal ports A (top-side) and B (bottomside). All the input and output lights were coupled to the SOI chip by using a lens module and lensed fiber. The output light was amplified through an erbiumdoped fiber amplifier. After removing the amplified spontaneous emission noise by using an optical bandpass filter, the output bit patterns were observed using an optical sampling oscilloscope with a bandwidth of 70 GHz. For the optical phase lock loops between signals and bias light, we obtained an individual phase difference between signals and a bias light (the bias channel is set as the phase standard). The phase of the bias light was weakly modulated using the fiber-input phase shifter (PS) with 200 kHz for the dithering. The converted electrical signals through two channels of the photo receiver were input to a



Fig. 2. Experimental setup for verifying high-speed operation of  $\Psi$  gates. The inset shows a scanning electron microscope image of the measured  $\Psi$  gate.

digital PID regulator. This PID regulator includes a high-voltage amplifier for directly driving the fiber stretcher.

Figure 3 summarizes various logic operations obtained from a single  $\Psi$  gate. The output power of the tunable laser diode was set to 8 dBm for the demonstration. Figure 3(a) shows the input and output optical time waveforms of 10-Gbit/s XOR, OR, AND, XNOR, and NOR logic operations with proper input conditioning according to Table 1. We clearly demonstrated the basic concept of the linear optical logic operations. The XOR and OR operations do not require the bias input. In our demonstration, however, a fraction of P<sub>Bias</sub> was required for optical phase locking. The AND, XNOR, and NOR operations exhibit *BC* of ~9.3 dB, > 10 dB, and ~9.0 dB with  $P_{\text{Bias}}/P_1 \sim$ 0.62, 1.6, and 3.1, respectively. The experimental  $P_{\text{Bias}}/P_1$  fits well to the analytical prediction of Eqs. (2)–(4) by taking into account the finite extinction ratio (*ER*) of the input signals (*ER* =  $10 \log_{10}(P_1/P_0)$ ) ~ 13–23 dB). Figure 3(b) shows the case for 40-Gbit/s pseudo random bit sequence AND and NOR operations. Both operations seemed to work with some degradation because of the finite rise/fall time of the input signals (~30 ps). However, the  $\Psi$  gate never became the bottleneck of the operation bandwidth because of the quite flat spectral response over 100 nm in the wavelength [6]. Figure 3(c) illustrates a

10-Gbit/s wavelength insensitive AND operation in telecom wavelength (1535–1565 nm). By calibrating  $P_{\text{Bias}}/P_1$  for each  $\lambda$ , a similar BC was achieved for all the tested input wavelengths. In this demonstration, the input signal bit sequences and operations were the same for all  $\lambda$ . It should be noted, however, that we can simultaneously carry out different operations for each wavelength channel by varying the input conditions (that is, totally independent parallel logic operations are possible up to the number of input channels with a single  $\Psi$  gate). This is unique and benefits our linear gates compared to nonlinear gates. Note that the demonstrated input wavelength range was actually limited by our setup, and the flatness of the transmission spectra and optical damage threshold should be improved by applying further structural optimization of Si wire  $\Psi$  gates. Ultimately, massive-parallel operations are expected to be carried out with more than several dozen wavelength channels in the 1400-1600 nm wavelength range.

## 4. Preliminary on-chip integration with PSs

Towards practical applications, it is important to show the feasibility of the on-chip integration to demonstrate the phase stability without any optical phase lock loops. Therefore, we also fabricated a Si  $\Psi$  gate with several thermo-optic PSs, as shown in **Fig. 4(a)**.



Fig. 3. Experimental linear optical logic operations observed with a single Ψ gate. (a) Time waveforms of optical input signals and outputs demonstrating 10-Gbit/s XOR, OR, AND, XNOR, and NOR logic operations. (b) Time waveforms of optical input signals and outputs demonstrating 40-Gbit/s pseudo random bit sequence AND and NOR logic operations. (c) Time waveforms of optical input signals and outputs demonstrating 10-Gbit/s AND logic operations. Colors denote different input wavelengths (purple: 1535 nm, blue: 1540 nm, light blue: 1545 nm, green: 1550 nm, orange: 1555 nm, red-orange: 1560 nm, red: 1565 nm).

For each arm, a pair of a Mach-Zehnder modulator (MZM) and PS are inserted to generate signals A and B, and  $P_{\text{Bias}}$ ,  $\Delta\Phi$  and  $\Delta\Phi_{\text{Bias}}$  are adjusted. After conditioning all nine heaters, each MZM was operated with two different frequencies (2 and 1 kbit/s). Then nearly ideal AND and NOR operations with a *BC* of ~9.5 dB were demonstrated, as shown in **Fig. 4(b)**. All the phase modulators (PMs) should be replaced with other shorter (10–50 µm) and faster (40–100 GHz) PMs [12] to achieve both much lower latency and higher throughput.

# 5. Multibit AND circuit by cascading $\Psi$ gates

**Figure 5(a)** illustrates an optimized multibit AND circuit (8-bit) in terms of latency. In this configuration, we need to use a specific  $\Psi$  gate with a 1:1:1 splitting (combining) ratio that enables truly loss-less AND operation (if  $P_A = P_B = 1$ ,  $P_{\text{Bias}} = 1$ ,  $\Delta \Phi_{\text{Bias}} = 0$ , then  $P_{\text{out}} = 3$ , which means no radiation loss) with phase-bit operation (when the digital input is "0" or

"1," the corresponding PM modulates the relative phases  $\Delta \Phi_A$  and  $\Delta \Phi_B$  between  $\pi$  (out-of-phase) and 0 (in-phase)). Therefore, all the signal ports connect with PMs directly (no MZMs). However, this phasebit operation is only available for the first stage because the output of the phase-shift keying operation is not phase bit but amplitude bit. For the intermediate in-phase combining, Y gates are used. At the final stage (and the first splitting stage), the  $\Psi$  gates with a 2:1:2 combining (splitting) ratio are optimal for 8-bit operation to minimize insertion loss (note that the optimal ratio for other bit-number operations is different). The combined optical signal is converted by the electrical signal and digitized through a lowlatency threshold operator such as a sense amplifier [13]. It should be noted that the BC of the multibit AND operation degrades with the number of the cascade connection. However, the sense amplifier can operate even with a small BC if the absolute output intensity difference between fully matched (all the digital input are "1") and unmatched (more than one



Fig. 4. Preliminary on-chip integration of a Ψ gate with three MZMs, three PSs, and a Ψ gate. (a) Optical microscope image of the whole circuit. (b) Time waveforms of optical input signals and outputs demonstrating AND and NOR logic operations.



Fig. 5. Multibit AND operations using cascaded Ψ gates. (a) Schematic diagram optimized for 8-bit operation. (b) Estimated latency including photoelectric conversion against a different bit scale. In the simulation, a very small capacitance photodiode was assumed according to K. Nozaki et al. [15]. To estimate the optical pass delay, we assumed the input PM length of 200 µm, Ψ gate length of 5 µm, spacing between input PMs of 10 µm (this number is possible when we use carrier effect PMs, etc.), and bending radius of a Si waveguide of 20 µm.

input is "0" etc.) cases is large enough (typically a difference of 10–100  $\mu$ W is acceptable). Therefore, the *BC* does not directly limit the feasibility of multibit AND operation. A very similar method was also proposed and demonstrated in CMOS logic circuits based on the current-race method [14]. In the current-race method, the currents from the multiple bit channels ("0" or "1" corresponds to zero or nonzero current output for each bit channel, respectively) are combined through electronic wires, and the combined current is digitized through a sense amplifier. In the

case of our photonic method, the signal-combining time can be much shorter than the case of electronics, as we mentioned above. On the other hand, we have an overhead of optoelectric conversion (charging) delay due to the resistance-capacitance time constant of the used photodetector, but this could be drastically mitigated using an ultralow-capacitance (~1 fF) photodetector [15], as discussed in the next paragraph.

The computation latency of multibit AND circuits was estimated, as shown in **Fig. 5(b)**. We compared

the configurations with and without using  $\Psi$  gates and used an ultralow-capacitance photodetector with 1 fF [15]. For instance, the estimated latency for N =128 (practical level) is ~14.6 ps. The additional charging delay of  $w/\Psi$  gates always becomes half of only Y gates thanks to twice the BC (30% reduction under the same input power of 1 mW). This latency is already 10 times lower than that of the fastest CMOS 128-bit AND circuits [16]. We can also apply wavelength division multiplexing by using more than two input lasers with different wavelengths and corresponding wavelength-selective (resonator-type) electro-optic PMs, further decreasing latency. Although it is also possible to use repeaters for latency compensation, it is no longer energy efficient. The example of this multibit AND circuit application indicates that photonic processing would break the traditional trade-off between latency and energy consumption in CMOS electronics.

### 6. Conclusion

We experimentally demonstrated ultrashort, simple, integrable, and wavelength insensitive Si wire  $\Psi$ gates towards ultralow-latency photonic processing. From the gate length, the minimum computation latency of single logic operation is only ~30 fs, which is more than 100 times lower than that of CMOS electronics. The optical signal loss is also much lower than any other optical gates. The operation function can be switched among XOR, OR, AND, XNOR, NOR, and NAND with a single  $\Psi$  gate by tuning input conditions. We also verified the feasibility of on-chip integration for stable operation. Finally, we suggested the original configuration for multibit AND circuits by cascading  $\Psi$  and Y gates. By using this configuration, 128-bit AND operation can be carried out at 10 times lower latency than cutting-edge CMOS electronics. Latency can be further lowered by combining with multibit AND circuits and wavelength division multiplexing with a larger number of wavelength channels and by using smaller PMs based on micro rings.

We gave an example of logic operations based on linear optical gates. However, linear optical gates can perform specific vector-matrix operations (transformations) without consuming energy. The demonstrated digital-like logic operations are just a specific case of these operations. Therefore, it is expected that low-latency, low-power consumption linear-gatebased vector-matrix operations will be more crucial for photo-electronic-converged artificial-neural-network-accelerator applications [17].

### References

- K. Rupp, "42 Years of Microprocessor Trend Data," 2018. https://www.karlrupp.net/2018/02/42-years-of-microprocessortrend-data/
- [2] T. Ishihara, A. Shinya, K. Inoue, K. Nozaki, and M. Notomi, "An Integrated Optical Parallel Adder as a First Step towards Light Speed Data Processing," Proc. of the 13th International SoC Design Conference (ISOCC 2016): Smart SoC of Intelligent Things, pp. 123–124, Jeju, South Korea, Oct. 2016.
- [3] K. Takeda, T. Sato, A. Shinya, K. Nozaki, W. Kobayashi, H. Taniyama, M. Notomi, K. Hasebe, T. Kakitsuka, and S. Matsuo, "Few-fJ/bit Data Transmissions Using Directly Modulated Lambdascale Embedded Active Region Photonic-crystal Lasers," Nat. Photon., Vol. 7, pp. 569–575, 2013.
- [4] K. Nozaki, S. Matsuo, T. Fujii, K. Takeda, M. Ono, A. Shakoor, E. Kuramochi, and M. Notomi, "Photonic-crystal Nano-photodetector with Ultrasmall Capacitance for On-chip Light-to-voltage Conversion without an Amplifier," Optica, Vol. 3, No. 5, pp. 483–492, 2016.
- [5] K. Nozaki, S. Matsuo, T. Fujii, K. Takeda, A. Shinya, E. Kuramochi, and M. Notomi, "Femtofarad Optoelectronic Integration Demonstrating Energy-saving Signal Conversion and Nonlinear Functions," Nat. Photon., Vol. 13, pp. 454–459, 2019.
- [6] S. Kita, K. Nozaki, K. Takata, A. Shinya, and M. Notomi, "Ultrashort Low-loss Ψ Gates for Linear Optical Logic on Si Photonics Platform," Commun. Phys., Vol. 3, 33, 2020.
- [7] C. Peng, J. Li, H. Liao, Z. Li, C. Sun, J. Chen, and Q. Gong, "Universal Linear-optical Logic Gate with Maximal Intensity Contrast Ratios," ACS Photon., Vol. 5, No. 3, pp. 1137–1143, 2018.
- [8] A. Himeno, K. Kato, and T. Miya, "Silica-based Planar Lightwave Circuits," IEEE J. Sel. Top. Quantum Electron., Vol. 4, No. 6, pp. 913–924, 1998.
- [9] R. Soref and J. Larenzo, "All-silicon Active and Passive Guided-wave Components for  $\lambda = 1.3$  and 1.6  $\mu$ m," IEEE J. Quantum Electron., Vol. 22, No. 6, pp. 873–879, 1986.
- [10] M. I. Stockman, "Nanofocusing of Optical Energy in Tapered Plasmonic Waveguides," Phys. Rev. Lett., Vol. 93, No. 13, p. 137404, 2004.
- [11] T. Shoji, T. Tsuchizawa, T. Watanabe, K. Yamada, and H. Morita, "Low Loss Mode Size Converter from 0.3 μm Square Si Wire Waveguides to Singlemode Fibres," Electron. Lett., Vol. 38, No. 25, pp. 1669–1670, 2002.
- [12] T. Hiraki, T. Aihara, K. Hasebe, K. Takeda, T. Fujii, T. Kakitsuka, T. Tsuchizawa, H. Fukuda, and S. Matsuo, "Heterogeneously Integrated III-V/Si MOS Capacitor Mach-Zehnder Modulator," Nat. Photon., Vol. 11, pp. 482–485, 2017.
- [13] B. Wicht, T. Nirschl, and D. Schmitt-Landsiedel, "Yield and Speed Optimization of a Latch-type Voltage Sense Amplifier," IEEE J. Solid-State Circuits, Vol. 39, No. 7, pp. 1148–1158, 2004.
- [14] I. Arsovski and A. Sheikholeslami, "A Mismatch-dependent Power Allocation Technique for Match-line Sensing in Content-addressable Memories," IEEE J. Solid-State Circuits, Vol. 38, No. 11, pp. 1958– 1966, 2003.
- [15] K. Nozaki, S. Matsuo, A. Shinya, and M. Notomi, "Amplifier-free Bias-free Receiver Based on Low-capacitance Nanophotodetector," IEEE J. Sel. Top. Quant., Vol. 24, No. 2, 4900111, 2018.
- [16] A. Agarwal, S. Hsu, S. Mathew, M. Anders, H. Kaul, F. Sheikh, and R. Krishnamurthy, "A 128x128b High-speed Wide-and Match-line Content Addressable Memory in 32nm CMOS," Proc. of the 37th European Solid-State Circuits Conference (ESSCIRC 2011), pp. 83–86, Helsinki, Finland, Sept. 2011.
- [17] Y. Shen, N. Harris, S. Skirlo, M. Prabhu, T. Baehr-Jones, M. Hochberg, X. Sun, S. Zhao, H. Larochelle, D. Englund, and M. Soljačić, "Deep Learning with Coherent Nanophotonic Circuits," Nat. Photon., Vol. 11, pp. 441–446, 2017.



#### Shota Kita

Senior Researcher, Photonic Nanostructure Research Group of NTT Basic Research Laboratories and NTT Nanophotonics Center. He received a B.E., M.E., and Ph.D. in engi-

nering from Yokohama National University in 2007, 2009, and 2012. He was a postdoc researcher in Lončar's laboratory at Harvard University, USA, for 3 years. He returned to Japan and joined Notomi's laboratory at NTT Basic Research Laboratories as a research associate, where he is investigating nanophotonic devices and circuits. His interests are in nanofabrication and packaging technologies. He received the Poster Presentation Award in iNOW 2009 and Young Scientist Presentation Award from the Japan Society of Applied Physics (JSAP) in 2010. He is a member of JSAP, the Institute of Electronics, Information and Communication Engineers (IEICE), and Optical Society (OSA).

#### Kengo Nozaki

Distinguished Researcher, Photonic Nanostructure Research Group of NTT Basic Research Laboratories and NTT Nanophotonics Center. He received a B.E., M.E., and Ph.D. in electri-

The received a B.E., M.E., and Ph.D. In electrical engineering from Yokohama National University in 2003, 2005, and 2007. He joined NTT Basic Research Laboratories in 2008. His current interests are ultralow-power nanophotonic crystals. He received the Best Paper Award at Photonics in Switching 2012, Young Researchers Award from IEICE Lasers and Quantum Electronics in 2014, Best Paper Award from OECC/PS in 2016, and Commendation for Science and Technology by the Minister of Education, Culture, Sports, Science and Technology of Japan (the Young Scientists' Prize) in 2019. He is a member of JSAP.

#### Kenta Takata

Research Scientist, Photonic Nanostructure Research Group, Optical Science Laboratory, NTT Basic Research Laboratories.

He received a B.S. in information and communication engineering in 2010 from the University of Tokyo for studying band structures of the ferromagnetic semiconductor GaMnAs and an M.S. and Ph.D. in information science and technology in 2012 and 2015 from the same university for proposing and developing optical com-puting schemes based on photonic oscillator networks. From 2012 to 2015, he was a research fellow (DC1) at the Japan Society for the Promotion of Science. In 2013, he was a visiting student researcher at Stanford University, CA, USA. He joined NTT Basic Research Laboratories in 2015. Since then, his research interest has been introducing emergent concepts in quantum and condensed matter physics such as exceptional points and topology to nanophotonics and to explore their applications. He received the Young Scientist Presentation Award from JSAP in 2017. He is a member of JSAP, the American Physical Society (APS), OSA, and the Physical Society of Japan.



#### Akihiko Shinya

Group Leader, Senior Research Scientist, Supervisor, Photonic Nanostructure Research Group of NTT Basic Research Laboratories and NTT Nanophotonics Center. He received a B.E., M.E., and Ph.D. in electri-

He received a B.E., M.E., and Ph.D. in electrical engineering from Tokushima University in 1994, 1996, and 1999. In 1999, he joined NTT Basic Research Laboratories, where he has been engaged in research and development of photonic crystal devices. He is a member of JSAP and the Laser Society of Japan.



#### Masaya Notomi

Senior Distinguished Scientist, Photonic Nanostructure Research Group of NTT Basic Research Laboratories. Project leader of NTT Nanophotonics Center.

He received a B.E., M.E., and Ph.D. in applied physics from the University of Tokyo in 1986, 1988, and 1997. He joined NTT in 1988, where his research has focused on controlling the optical properties of materials/devices by using artificial nanostructures (quantum wires/dots and photonic crystals). In addition to his work at NTT, he has also been a professor at the Department of Physics, Tokyo Institute of Technology since 2017. He received the IEEE/LEOS Distinguished Lecturer Award (2006) from the Institute of Electrical and Electronics Engineers/Lasers & Electro-Optics Society, JSPS prize (2009) from the Japan Society for the Promotion of Science, a Japan Academy Medal (2009), and the Commendation for Science and Technology by the Minister of Education, Culture, Sports, Science and Technology of Japan (2010). He is serving as a member of National University Corporation Evaluation Committee in the Japanese government. He is an IEEE Fellow and a member of JSAP, APS, and OSA.

