High-level synthesis algorithm for the design of reconfigurable constant multiplier
Multiplying a signal by a known constant is an essential operation in digital signal processing algorithms. In many application scenarios, an input or output signal is repeat...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Journal Article |
Language: | English |
Published: |
2010
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/80022 http://hdl.handle.net/10220/6230 |
Summary: | Multiplying a signal by a known constant is an essential
operation in digital signal processing algorithms. In many
application scenarios, an input or output signal is repeatedly
multiplied by several predefined constants at different instances.
These temporal redundancies can be exploited for the design of
an efficient reconfigurable constant multiplier (RCM). An RCM
achieves greater hardware savings than the conventional multiple
constant multiplication architecture, limited only by the available
latency of the subsystem. Motivated by a number of lucrative
examples, this paper presents a new high-level design methodology
for RCM. Common subexpressions in the preset constants represented
in minimum signed-digit system are first eliminated to
obtain a minimum depth multiroot directed acyclic graph (DAG).
The DAG is converted into a primitive data flow graph (DFG)
where mobile adders are identified. By scheduling each mobile
adder into a control step within its legitimate time window with
the minimum opportunity cost, mutually exclusive adders can be
merged with significantly reduced adder and multiplexing cost.
The opportunity cost for each scheduling decision is assessed
by the probability displacement and disparity measures of the
scheduled node as well as its predecessors and successors in the
DFG. The algorithm is runtime efficient as exhaustive search for
the best fusion of independently optimized constant multipliers
has been avoided. Simulation results on randomly generated 12-b
constant sets show that the solutions generated by the proposed
algorithm are on average 19% to 25% more area–time efficient
than the best reported solutions. |
---|