ISSN ONLINE(2278-8875) PRINT (2320-3765)

All submissions of the EM system will be redirected to Online Manuscript Submission System. Authors are requested to submit articles directly to Online Manuscript Submission System of respective journal.

Efficient Area Minimization with High Speed and Low Power Multiplier Structural Design for Multirate Filter Design

Ch.D.Vishnupriya 1, K.Neelima2
  1. M.Tech, Asst. Professor, Dept of Electronics & Communication Engineering, Pragati Engineering College, Surampalem, AP, India
  2. M.Tech Student, Dept of Electronics & Communication Engineering, Pragati Engineering College, Surampalem, AP, India
Related article at Pubmed, Scholar Google

Visit for more related articles at International Journal of Advanced Research in Electrical, Electronics and Instrumentation Engineering

Abstract

In Multi-rate Signal processing studies used in Digital Signal processing systems include sample rate conversion. This technique is used for systems with different input and output sample rates. Interpolation and Decimation is very effective and popular in multi rate signal processing applications. This paper proposes a high speed, area and power efficient VLSI architecture for polyphase decimation filter with decimation factor of three (D=3) using Booth multiplier. By using booth multiplier to multiply signed numbers also. Various key performance metrics such as number of slices, maximum operating frequency, number of LUT’s, input output bonds, power consumption, setup time, hold time, propagation delay between source and destinations are estimated for the filter of length nine (N=9).The power dissipation is reduced in polyphase decimation filter using Booth multiplier which consumes low-power when compared to the conventional multiplier. The speed is improved by using carry look-ahead adder. It was observed that the proposed scheme provides increase in speed, reduction in area and slight reduction in power dissipation when compared to conventional and BFD multiplier and low complexity.

Keywords

Polyphase decimation filter, Booth multiplier, BFD Multiplier architecture, Area, power dissipation, carry look ahead adder, speed.

INTRODUCTION

Multirate (decimation/interpolation) filters are among the essential signal processing components in spaceborne instruments where Finite Impulse Response (FIR) filters are often used to minimize nonlinear group delay and finite-precision effects. Cascaded (multi-stage) designs of Multi-Rate FIR (MRFIR) filters are further used for large rate change ratio, in order to lower the required throughput while simultaneously achieving comparable or better performance than single-stage designs. Traditional representation and implementation of MRFIR employ polyphase decomposition of the original filter structure, whose main purpose is to compute only the needed output at the lowest possible sampling rate.
Recently, there has been rapid progress in the area of multirate digital signal processing. The applications of multirate systems include subband coding of video, audio, and speech signals, fast transforms using digital filter banks, wavelet analysis of all types of signals, and many other fields. In multirate systems, decimation and interpolation filters are the most important building blocks. A great amount of literature deals with the theory and design of decimation and interpolation filters.
However, issues concerning the VLSI implementation scheme for multirate filters have not been investigated thoroughly. Since the speed of processing time and the silicon area are the crucial factors in the VLSI implementation] a scalable implementation scheme to flexibly and efficiently implement the multirate FIR filters is presented in this paper.
The advances in information technology and the increasing requirement of Very Large Scale Integration issues have resulted in a rapid development of several optimization algorithms and techniques. Low power consumption and smaller area are some of the most important criteria for the DSP systems and high performance systems. DSP systems have less sensitivity to component tolerances and environmental changes and the dynamic range of the system can be increased by floating point arithmetic [2]. Multirate processing arises in many fields of digital signal processing.
Multirate signal processing applications includes digital audio tape, transmultiplexers, subband coding, speech processing, Analog voice privacy systems etc [1]. In digital audio, the sampling rate conversions are about 32 kHz to 44.1kHz and 44.1kHz to 48kHz and vice versa. Symbol rate processing, bit rate processing and sample rate processing are some of the signal processing issues in digital communication [3]. Polyphase decomposition is one of the most important techniques used in multirate signal processing. Polyphase structure utilizes FIR filter that leads to very efficient implementation. Because FIR filters are conditionally stable and linear phase filters. Linear phase in the sense that their phase delay and group delay must be a constant. In data communication, if a pulse is smeared then the received signal does not convey the desired information to the intended user. So, linear phase is the most inherent property of FIR filters.
Exact linear phase cannot be achieved with IIR filters. It can only be achieved with FIR filters. Genetic algorithm is used to find the multistage parameter combination for multistage sampling rate conversion FIR filter design [4]. A bit-level optimized algorithm is used for the design of high-speed FIR decimation filters [5]. Efficient FIR filters are implemented by using systolic decomposition [19]. Power dissipation is the factor that changes rapidly and is one of the major challenging issues today. Texas Instruments are making digital circuits which require no power supply.
Particularly for implantable devices like heart artificial transplantation and artificial hearing aids, they are aiming at devices which will work from body temperature. The power dissipation is minimized by reducing the switching activity factor and by minimizing number of operations to be held in the filter structure. The switching activity reduced by adder and counter [8]. The multipliers have slightly less area and power than optimized tree multipliers while keeping similar delay [9]. The reduction of power consumption is obtained by altering multiplicands in software without any hardware modifications [10]. Optimized Wallace tree and pipelining techniques were used to design a power aware booth multiplier [11]. SPST(Spurious Power Suppression Technique) is applied on multipliers for high-speed and lowpower purposes [12]. Low power fixed width multipliers are used to improve the speed, reduce power and area considerably [13]. A low-power structure called bypass zero, feed A directly (BZ-FAD) for shift-andadd multiplier architecture considerably lowers the switching activity [7]. In this paper, FIR filter with power efficient Booth multiplier is preferred here to get reduction in power dissipation.
This paper describes the design of polyphase multiplier with high speed low power Booth multiplier architecture which uses carry look-ahead adder. This provides reduction in power dissipation as well as increase in speed when compared to conventional multiplier.

PROPOSED POLYPHASE DECIMATION FILTER

A filter can be realized with several ways such as direct form, cascade form, linear phase and polyphase realizations. When the transfer function of the filter is decomposed into number of sub branches, the process is called polyphase realization [1] [2] [3].
A. Polyphase FIR filter
Finite Impulse Response (FIR) filter is a zero-phase filter with magnitude equal to unity in the pass band and zero in the stop band [3]. FIR system is described by the difference equation shown in “(1)”.
image (1)
Where k b be the filter coefficients, x(n)& y(n) are input and output sequences. The equivalent system function is given in “(2)”.
image (2)
Furthermore, the unit sample response of the FIR system is identical to the filter coefficients. FIR filters can be designed to provide exact linear phase over the whole frequency range and are always BIBO stable independent of the filter coefficients.
A linear-phase FIR filter of order N is either characterized by a symmetric impulse response given in “(3)”.
h(n) = h(n − N) (3)
or by an asymmetric impulse response of a FIR filter is given in “(4)”
h(n) = −h(n − N) (4)
The symmetry (or asymmetry) property of a linear-phase FIR filter can be exploited to reduce the total number of multipliers into almost half of that in the direct form implementations of the transfer function.
In a general case, the transfer function of L-branch polyphase decomposition of the transfer function of order N is given by “(5)”.
image(5)
FIR filter is realized based on polyphase decomposition which leads a parallel structure. To illustrate this approach, a casual FIR transfer function H(z) of length nine is given in “(6)”, which is a function of filter coefficients h(k),0 ≤ k ≤ 8.
H(z) = h(0) + h(1)z −1 + ........ + h(8)z −8 (6)
The above transfer function can be expressed as a sum of two terms, with one term containing the even-indexed coefficients and the other containing the odd-indexed coefficients which is given by
image
Grouping the same equation differently, the transfer function is re-expressed in the form of sub bands as shown in “(7)”.
image (7)
Equation (7) is represented schematically in fig.1. The structure has delays, multipliers and accumulators.
B. Transposed Polyphase Decimation Filter
The decimator with a decimation factor D, where D is a positive integer develops an output sequence y(n) with a sampling rate is (1/D)th of the input sequence x(n) . This is implemented by keeping every Dth sample of the input sequence and removing D-1 samples between consecutive samples. As a result, all input samples with indices equal to an integer multiple of D are retained at the output and all others are discarded, to generate the output sequence according to the relation given in “(8)”.
y(n) = x(nD) (8)
Decimation results in aliasing, to avoid the aliasing effect ant aliasing filter called low pass filter is used before down sampling [17].
The transposed form of the decimation filter is used in this paper to avoid the shift registers used in tapped delay structure. Here, x(n) comes into the filter at the sample rate, fs, and is applied to all of the tap multipliers at the same time. The number of stages in the transposed filter depends on the value of decimation factor and number of coefficients. The number of stages is equal to the number of coefficients divided by decimation factor. If the coefficients are not a multiple of decimation factor then we have to append zeros.
The structure of proposed transposed form of polyphase filter for decimation factor of three is shown in fig.2. In this structure, during each cycle, input x(n) is passed through the tap multiplier. Initially the input sequence x(n) is given through a parallel input serial output (PISO) shift register.
The output from PISO is given to three sub filter section where the input sequences are processed in parallel form by means of serial input parallel output (SIPO) shift register. Each section, the input is multiplied with the coefficients. Since the processing is done in parallel, this structure improves the speed of operation. The coefficients are cycled through their values at the sample rate, but the indexes of the coefficients at any given time are separated by D, the decimation rate.
image
The coefficient is multiplied with the sample, but the input samples and coefficients changing during each cycle. The function of the accumulator is to accumulate the value of multiplier for every 3(D) cycles. Carry look-ahead adder is used to perform addition in order to increase the speed of operation. The decimated output y(n) is obtained using this polyphase structure.
image

MULTIPLIER ARCHITECTURES

There are several multipliers available to perform multiplications. Shift and add, BFD multiplication, conventional multiplication and spurious power suppression are some of the techniques used to perform multiplication. In this paper booth multiplier is compared with the conventional and BFD multiplier.
A. BOOTH MULTIPLIER:
image
It is a powerful algorithm for signed-number multiplication, which treats both positive and negative numbers uniformly.
For the standard add-shift operation, each multiplier bit generates one multiple of the multiplicand to be added to the partial product. If the multiplier is very large, then a large number of multiplicands have to be added. In this case the delay of multiplier is determined mainly by the number of additions to be performed. If there is a way to reduce the number of the additions, the performance will get better.
Booth algorithm is a method that will reduce the number of multiplicand multiples. For a given range of numbers to be represented, a higher representation radix leads to fewer digits. Since a k-bit binary number can be interpreted as K/2-digit radix-4 number, a K/3-digit radix-8 number, and so on, it can deal with more than one bit of the multiplier in each cycle by using high radix multiplication. This is shown for Radix-4 in the example below.
image
As shown in the figure above, if multiplication is done in radix 4, in each step, the partial product term (Bi+1Bi)2 A needs to be formed and added to the cumulative partial product. Whereas in radix-2 multiplication, each row of dots in the partial products matrix represents 0 or a shifted version of A must be included and added.
Table 1below is used to convert a binary number to radix-4 number. Initially, a “0” is placed to the right most bit of the multiplier. Then 3 bits of the multiplicand is recoded according to table below or according to the following equation:
Zi = -2xi+1 + xi + xi-1
Example:
Multiplier is equal to 0 1 0 1 1 10
Then a 0 is placed to the right most bit which gives 0
1 0 1 1 10 0 0 added
The 3 digits are selected at a time with overlapping left most bit as follows:
image
For example, an unsigned number can be converted into a signed-digit number radix 4:
(10 01 11 01 10 10 11 10)2 = ( –2 2 –1 2 –1 –1 0 –2)4
The Multiplier bit-pair recoding is shown in Table 3.2
image
Here –2*multiplicand is actually the 2s complement of the multiplicand with an equivalent left shift of one bit position. Also, +2 *multiplicand is the multiplicand shifted left one bit position which is equivalent to multiplying by 2. To enter ± 2*multiplicand into the adder, an (n+1)-bit adder is required. In this case, the multiplicand is offset one bit to the left to enter into the adder while for the low-order multiplicand position a 0 is added. Each time the partial product is shifted two bit positions to the right and the sign is extended to the left.During each add-shift cycle, different versions of the multiplicand are added to the new partial product depends on the equation derived from the bit-pair recoding table above.
Let’s see some examples:
image
B. Carry look-ahead adder (CLA)
The carry lookahead adder (CLA) solves the carry delay problem by calculating the carry signals in advance, based on the input signals. It is based on the fact that a carry signal will be generated in two cases: (1) when both bits ai and bi are 1, or (2) when one of the two bits is 1 and the carry-in is 1. Thus, one can write,
image
The above two equations can be written in terms of two new signals Pi and Gi, which are shown in Figure 4:
image
image
Pi and Gi are called the carry generate and carry propagate terms, respectively. Notice that the generate and propagate terms only depend on the input bits and thus will be valid after one and two gate delay, respectively. If one uses the above expression to calculate the carry signals, one does not need to wait for the carry to ripple through all the previous stages to find its proper value. Let’s apply this to a 4-bit adder to make it clear.
Putting I = 0,1,2,3 in Equation 5, we get
image
Implementation Observations:
The implementation of the proposed algorithm is illustrated in the form of various pictorial views obtained during the process of FPGA Implementation. Figure presents the Synthesis report of integrated Interpolator and Decimator module which was constructed from a gate level net list to the model of a circuit described in Verilog HDL. Figure shows the RTL view of proposed algorithm whereas Figure depicts the technical schematic view of targeted FPGA device.

RESULTS AND DISCUSSION

In this paper, we have presented how to make FIR filters applicable for multiple sampling rates with the help of interpolator and decimator. For reducing the complexity we had used the low complexity FIR filter design with the help of programmable shifting method. The proposed technique in this paper is able to reduce the complexity of decimator filter and interpolator filter circuits in DSP processing where multiple sampling rates are required.
The polyphase decimation filter is designed and verified using Xilinx 14.3 and ModelSim 6.3g. The Xilinx simulation results are as follows. The RTL (Register Transfer Logic) schematic layout, technology schematic layout and design summary of Multi rate FIR filter is shown in below. 6. The result is the output of the multiplier is 16 bits wide. The below fig shows the detailed RTL schematic of Multiplier architecture.
image
image
image
image
image

CONCLUSION

In this paper, a polyphase decimation filter with high speed , area and power efficient Booth multiplier architecture has been proposed. Various key performance metrics such as number of slices, maximum operating frequency, number of LUT’s, input- output bonds, power consumption, setup time, hold time, propagation delay between source and destinations are estimated for the filter of length nine (N=9).Power dissipation is reduced slightly in polyphase decimation filter using this low-power multiplier architecture when compared to the conventional shift and add multiplier and BFD multiplier.

Advantages:

The advantages of this work is that it reduced complexity which in turn reduces the area, power dissipation ,gives higher throughput rate, higher processing speed, fast computation, LFSR can rapidly transmit a sequence that indicates high-precision relative time offsets and many more.

References