Accelerating Artificial Intelligence with Programmable Silicon Photonics

Advances in the fabrication of large-scale integrated silicon photonics have sparked interest in optical systems that process information at high speeds with ultra-low energy consumption. Photonic systems, which have historically been used for optical telecommunications, have recently been demonstra...

Cur síos iomlán

Sonraí bibleagrafaíochta
Príomhchruthaitheoir: Bandyopadhyay, Saumil
Rannpháirtithe: Englund, Dirk R.
Formáid: Tráchtas
Foilsithe / Cruthaithe: Massachusetts Institute of Technology 2023
Rochtain ar líne:https://hdl.handle.net/1721.1/151430
_version_ 1826198365562470400
author Bandyopadhyay, Saumil
author2 Englund, Dirk R.
author_facet Englund, Dirk R.
Bandyopadhyay, Saumil
author_sort Bandyopadhyay, Saumil
collection MIT
description Advances in the fabrication of large-scale integrated silicon photonics have sparked interest in optical systems that process information at high speeds with ultra-low energy consumption. Photonic systems, which have historically been used for optical telecommunications, have recently been demonstrated to accelerate tasks in quantum simulation, artificial intelligence, and combinatorial optimization. This thesis reports work towards the goal of realizing large-scale programmable photonic systems for information processing: 1) we develop deterministic error correction algorithms for programmable photonic systems, whose capabilities are believed to be limited by fabrication error, showing that these systems can be programmed to implement accurate linear matrix processing suitable for deep neural networks at scales of up to hundreds of channels; 2) we describe a new paradigm for coupling large numbers of optical channels to photonic circuits with exceptionally high alignment tolerance, enabling the use of high-volume, low-precision electronic pick-and-place equipment for photonic assembly; and 3) we design, fabricate, and demonstrate the first single-chip, end-to-end photonic processor for deep neural networks. This fully-integrated coherent optical neural network (FICONN), which monolithically integrates multiple optical processor units for matrix algebra and nonlinear activation functions into a single chip, implements single-shot coherent optical processing of a deep neural network with sub-nanosecond latency. On-chip, in situ training of a deep neural network is demonstrated on this system, obtaining high accuracies on a vowel classification task comparable to that of a digital system. Our results open the path towards integrated, large-scale photonic processors for low-latency inference and training of deep neural networks.
first_indexed 2024-09-23T11:03:43Z
format Thesis
id mit-1721.1/151430
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T11:03:43Z
publishDate 2023
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1514302023-08-01T03:59:18Z Accelerating Artificial Intelligence with Programmable Silicon Photonics Bandyopadhyay, Saumil Englund, Dirk R. Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Advances in the fabrication of large-scale integrated silicon photonics have sparked interest in optical systems that process information at high speeds with ultra-low energy consumption. Photonic systems, which have historically been used for optical telecommunications, have recently been demonstrated to accelerate tasks in quantum simulation, artificial intelligence, and combinatorial optimization. This thesis reports work towards the goal of realizing large-scale programmable photonic systems for information processing: 1) we develop deterministic error correction algorithms for programmable photonic systems, whose capabilities are believed to be limited by fabrication error, showing that these systems can be programmed to implement accurate linear matrix processing suitable for deep neural networks at scales of up to hundreds of channels; 2) we describe a new paradigm for coupling large numbers of optical channels to photonic circuits with exceptionally high alignment tolerance, enabling the use of high-volume, low-precision electronic pick-and-place equipment for photonic assembly; and 3) we design, fabricate, and demonstrate the first single-chip, end-to-end photonic processor for deep neural networks. This fully-integrated coherent optical neural network (FICONN), which monolithically integrates multiple optical processor units for matrix algebra and nonlinear activation functions into a single chip, implements single-shot coherent optical processing of a deep neural network with sub-nanosecond latency. On-chip, in situ training of a deep neural network is demonstrated on this system, obtaining high accuracies on a vowel classification task comparable to that of a digital system. Our results open the path towards integrated, large-scale photonic processors for low-latency inference and training of deep neural networks. Ph.D. 2023-07-31T19:39:06Z 2023-07-31T19:39:06Z 2023-06 2023-07-13T14:15:24.674Z Thesis https://hdl.handle.net/1721.1/151430 In Copyright - Educational Use Permitted Copyright retained by author(s) https://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology
spellingShingle Bandyopadhyay, Saumil
Accelerating Artificial Intelligence with Programmable Silicon Photonics
title Accelerating Artificial Intelligence with Programmable Silicon Photonics
title_full Accelerating Artificial Intelligence with Programmable Silicon Photonics
title_fullStr Accelerating Artificial Intelligence with Programmable Silicon Photonics
title_full_unstemmed Accelerating Artificial Intelligence with Programmable Silicon Photonics
title_short Accelerating Artificial Intelligence with Programmable Silicon Photonics
title_sort accelerating artificial intelligence with programmable silicon photonics
url https://hdl.handle.net/1721.1/151430
work_keys_str_mv AT bandyopadhyaysaumil acceleratingartificialintelligencewithprogrammablesiliconphotonics