Scalar Operand Networks: Design, Implementation, and Analysis
The bypass paths and multiported register files in microprocessors serve as an implicit interconnect tocommunicate operand values among pipeline stages and multiple ALUs. Previous superscalar designs implementedthis interconnect using centralized structures that do not scale with increasing ILP dema...
Main Authors: | , , , |
---|---|
Other Authors: | |
Language: | en_US |
Published: |
2005
|
Online Access: | http://hdl.handle.net/1721.1/30477 |
_version_ | 1811091926097592320 |
---|---|
author | Taylor, Michael Bedford Lee, Walter Amarasinghe, Saman Agarwal, Anant |
author2 | Computer Architecture |
author_facet | Computer Architecture Taylor, Michael Bedford Lee, Walter Amarasinghe, Saman Agarwal, Anant |
author_sort | Taylor, Michael Bedford |
collection | MIT |
description | The bypass paths and multiported register files in microprocessors serve as an implicit interconnect tocommunicate operand values among pipeline stages and multiple ALUs. Previous superscalar designs implementedthis interconnect using centralized structures that do not scale with increasing ILP demands. Insearch of scalability, recent microprocessor designs in industry and academia exhibit a trend toward distributedresources such as partitioned register files, banked caches, multiple independent compute pipelines,and even multiple program counters. Some of these partitioned microprocessor designs have begun to implementbypassing and operand transport using point-to-point interconnects. We call interconnects optimizedfor scalar data transport, whether centralized or distributed, scalar operand networks. Although thesenetworks share many of the challenges of multiprocessor networks such as scalability and deadlock avoidance,they have many unique requirements, including ultra-low latencies (a few cycles versus tens of cycles)and ultra-fast operation-operand matching. This paper discusses the unique properties of scalar operandnetworks (SONs), examines alternative ways of implementing them, and introduces the AsTrO taxonomy todistinguish between them. It discusses the design of two alternative networks in the context of the Raw microprocessor,and presents detailed timing, area and energy statistics for a real implementation. The paperalso presents a 5-tuple performance model for SONs and analyzes their performance sensitivity to networkproperties for ILP workloads. |
first_indexed | 2024-09-23T15:10:08Z |
id | mit-1721.1/30477 |
institution | Massachusetts Institute of Technology |
language | en_US |
last_indexed | 2024-09-23T15:10:08Z |
publishDate | 2005 |
record_format | dspace |
spelling | mit-1721.1/304772019-04-12T08:37:49Z Scalar Operand Networks: Design, Implementation, and Analysis Taylor, Michael Bedford Lee, Walter Amarasinghe, Saman Agarwal, Anant Computer Architecture The bypass paths and multiported register files in microprocessors serve as an implicit interconnect tocommunicate operand values among pipeline stages and multiple ALUs. Previous superscalar designs implementedthis interconnect using centralized structures that do not scale with increasing ILP demands. Insearch of scalability, recent microprocessor designs in industry and academia exhibit a trend toward distributedresources such as partitioned register files, banked caches, multiple independent compute pipelines,and even multiple program counters. Some of these partitioned microprocessor designs have begun to implementbypassing and operand transport using point-to-point interconnects. We call interconnects optimizedfor scalar data transport, whether centralized or distributed, scalar operand networks. Although thesenetworks share many of the challenges of multiprocessor networks such as scalability and deadlock avoidance,they have many unique requirements, including ultra-low latencies (a few cycles versus tens of cycles)and ultra-fast operation-operand matching. This paper discusses the unique properties of scalar operandnetworks (SONs), examines alternative ways of implementing them, and introduces the AsTrO taxonomy todistinguish between them. It discusses the design of two alternative networks in the context of the Raw microprocessor,and presents detailed timing, area and energy statistics for a real implementation. The paperalso presents a 5-tuple performance model for SONs and analyzes their performance sensitivity to networkproperties for ILP workloads. 2005-12-22T01:32:06Z 2005-12-22T01:32:06Z 2004-06-08 MIT-CSAIL-TR-2004-038 MIT-LCS-TM-645 http://hdl.handle.net/1721.1/30477 en_US Massachusetts Institute of Technology Computer Science and Artificial Intelligence Laboratory 27 p. 60061887 bytes 2516865 bytes application/postscript application/pdf application/postscript application/pdf |
spellingShingle | Taylor, Michael Bedford Lee, Walter Amarasinghe, Saman Agarwal, Anant Scalar Operand Networks: Design, Implementation, and Analysis |
title | Scalar Operand Networks: Design, Implementation, and Analysis |
title_full | Scalar Operand Networks: Design, Implementation, and Analysis |
title_fullStr | Scalar Operand Networks: Design, Implementation, and Analysis |
title_full_unstemmed | Scalar Operand Networks: Design, Implementation, and Analysis |
title_short | Scalar Operand Networks: Design, Implementation, and Analysis |
title_sort | scalar operand networks design implementation and analysis |
url | http://hdl.handle.net/1721.1/30477 |
work_keys_str_mv | AT taylormichaelbedford scalaroperandnetworksdesignimplementationandanalysis AT leewalter scalaroperandnetworksdesignimplementationandanalysis AT amarasinghesaman scalaroperandnetworksdesignimplementationandanalysis AT agarwalanant scalaroperandnetworksdesignimplementationandanalysis |