Reprogrammable Electro-Optic Nonlinear Activation Functions for Optical Neural Networks

Ian A.D. Williamson, Tyler W. Hughes, Momchil Minkov, Ben Bartlett, Sunil Pai, Shanhui Fan

Research output: Contribution to journalArticlepeer-review

Abstract

We introduce an electro-optic hardware platform for nonlinear activation functions in optical neural networks. The optical-to-optical nonlinearity operates by converting a small portion of the input optical signal into an analog electric signal, which is used to intensity -modulate the original optical signal with no reduction in processing speed. Our scheme allows for complete nonlinear ON-OFF contrast in transmission at relatively low optical power thresholds and eliminates the requirement of having additional optical sources between each of the layers of the network Moreover, the activation function is reconfigurable via electrical bias, allowing it to be programmed or trained to synthesize a variety of nonlinear responses. Using numerical simulations, we demonstrate that this activation function significantly improves the expressiveness of optical neural networks, allowing them to perform well on two benchmark machine learning tasks: learning a multi-input exclusive-OR (XOR) logic function and classification of images of handwritten numbers from the MNIST dataset. The addition of the nonlinear activation function improves test accuracy on the MNIST task from 85% to 94%.

Original languageEnglish
Article number7700412
JournalIEEE Journal of Selected Topics in Quantum Electronics
Volume26
Issue number1
DOIs
Publication statusPublished - Jan 2020
Externally publishedYes

Keywords

  • Optical neural networks
  • electro-optic modulators
  • feedforward neural networks
  • intensity modulation
  • machine learning
  • neuromorphic computing
  • nonlinear optics
  • phase modulation
  • photodetectors

Fingerprint

Dive into the research topics of 'Reprogrammable Electro-Optic Nonlinear Activation Functions for Optical Neural Networks'. Together they form a unique fingerprint.

Cite this