Novel Approach For Fast Partial Product Compressor Structure Using Quarter Module For 8x8 & Higher Multipliers

International Journal of Scientific Research Engineering & Technology (IJSRET), ISSN 2278 0882
Volume 3 Issue 2, May 2014
Novel Approach for Fast Partial Product Compressor structure using Quarter
Module for 8x8 & Higher Multipliers
Nilay Nagdeve1, Vishal Moyal2
1
ME IV Sem, Department of ET&T, FET, SSTC, Bhilai

Assoc. Prof. & Head, Department of ET&T, SSITM, SSTC, Bhilai
ABSTRACT
It is very realistic that the FPGA and CPLD devices are
used for Digital Signal Processing and signal testing
purposes in this modern era of Digital Signal Handling.
While realizing or testing any peculiar module, DSP
processors needs a faster multiplication mechanism. The
faster multiplication can be accomplished by using a
faster multiplication algorithm with reflexions for
parameters like Area and Power. Also, in another way,
after generating a partial products using a simple
multiplication process, use of faster compressors which
will transfigures a partial products into a lawful data.
This paper will show the efforts taken to generate a
faster results using a faster code compressors of [3:2],
and [4:2] value with Non-Booth tactic. Also this paper
will throw a focus on designing and implementation of
quarter module which is basically a combinational block
having capability of 16:8 compression and can be used
in symmetric multiplication.
Keywords Multiplier, Compressor, Non Booth,
Spartan, FPGA
I.
INTRODUCTION
Some of Faster multiplication procedures focuses on

generate a faster results using generation of less partial
products and some on generating faster results using
compressors. Multipliers consist of three fundamental
parts: a partial product generator, a partial product
reduction and a final fast adder part [1]-[4]. Generally, to
create a valid data, partial products are simply added
using Full adders and Half adders with a specific
structure suggested like Wallace Tree. Full adder is
fundamental unit in various circuits, especially, in
performing arithmetic operations such as compressors,
comparators, parity checkers, multipliers etc. It is the
nucleus of many other useful operations such as
subtraction, multiplication, division, exponentiation,
address calculation and can significantly influence the
overall achievable performances of the system [5].
The stages of multiplication can be categorized as
follows: In the first stage, the multiplicand and the
multiplier are multiplied bit by bit to generate the partial

products. The second stage is the most important, as it is
the most complicated and determines the speed of the
overall multiplier. The 4-2 compressor has been widely
employed in the high speed multipliers for the
construction of Wallace tree to lower the delay of the
partial product accumulation stage [6], [7].
II.
FUNDAMENTAL APPROACH
As there are so many patterns to design a same thing, in

same way the faster multiplication can be done with
various combinations. A general physicist named
Andrew D. Booth proposed a multiplication algorithm
first time (1950) in order to reduce the partial products
using Booth Encoder and Booth Decoder. This paper
will basically focuses on Conventional methodology
with modern designing for partial product reduction
rather than Booths Algorithm. Again Full adders were
used as a basic building block for generating answer
from the partial product
A. Single bit Full Adder
One bit full adder is nothing but a 3:2 compressor which
takes three input, mainly two signals and one carry to
produce two output bits, mainly sum and one carry
output. While implementing a full adder operation, the
carry generation is a task which take a delay in output.
As only the carry transferred to next consecutive
operation, it must be generated fast in order to make the
next execution to be started and hence an overall fast
execution. The implementation of first adder can be done
through utilization of two simple Half-adders or by
creating a new logic. Applying the knowledge of digital
electronics, the equations are reduced to form a
confident structure of Full Adder
B. Single bit 4:2 compressor
These compressors are basically takes 5 inputs and
process to provide 3 output though the name shows 4
input and 2 output. It is called compressor, since it
compress four partial products into two [8]. The figure 2
shows a simple 4:2 compressor in IN1, IN2, IN3, and
IN4 are input and SUM, and Carry0 as output. As the
www.ijsret.org
198
addition of 5 bits is taking place. The maximum length

of output signal is of 3 bits. 0th bit will be the SUM bit,
1st bit will be the carry0 bit which is to be added to next
addition and 2nd bit is Carry1 bit which is to be added to
one next addition. The thoroughgoing value this
compressor can generate is 101 when all the inputs are
Logic High.
III.
Figure 3 (a) Alternative implementation of [3:2]

compressors using XOR logic
PROPOSED APPROACH
The proposed work is totally based on designing of new

[3:2] and [4:2] compressor and Quarter Module by using
those compressors.
Figure 3 (b) Alternative implementation of [3:2]

compressors using XNOR logic
Figure 3 (c) Alternative implementation of [3:2]

compressors using MUX logic
Figure 1 Propsoed Approach
A. [3:2] Compressor
The implementation of [3:2] can be done by using two
[2:2] compressors with extra ORing for carry output.
The implemented circuit is as shown:
Figure 3 (d) Alternative implementation of [3:2]

compressors using proposed logic
Figure 2 Implemented [3:2] compressor (Design I)
The alternative way by rearranging the calculations that
may generate faster sum and carry outputs are as
follows:
Alternative calculation that may generate faster sum are

shown above in figure 4(a), 4(b), & 4(c). The best way
to generate faster carry is to generate carry outputs in
which the sum output is logically selected by carry
output. The carry output logic is drawn by solving k-map
of truth table. Calculations are so done that the carry will
select the sum. As the carry input is required for next
block to start computing, carry will generate before sum.
The circuit can be realized as shown above in figure
www.ijsret.org
199
2(d). The timing diagram to verify the truth table by all

of the above circuits for [3:2] compressor is as shown in
figure 3. Figure also shows the values at intermediate
nodes.
Figure 5 (c) Alternative implementation of [4:2]

proposed combination
Figure 4 Test bench waveform of [3:2] compressor
B. [4:2] Compressor
Moving towards higher compressors, [4:2] compressor
can be made from lower compressors like [2:2] or [3:2].
From the conclusion drawn from previous section,
design approaches IV and V can be preferred. Some of
the designs for [4:2] compressor will use either IV-IV or
V-V or IV-V design combinations. [4:2] compressor
adds 4 inputs along with carry and produces sum and
carry output. This compressor takes IN1, IN2, IN3, and
IN4 with cin, adds them and produces one bit sum, one
bit carry and one bit carry travel for second next module.
Various possible implementations and proposed
combinations are as shown in figure 4 (a), 4 (b), & 4 (c).
C. DESIGNING of a Quarter Module

While calculating a final answer from a partial products
generated between the calculation, a symmetrical
multiplication (here, symmetrical stands for (2x2, 4x4,
8x8, 12x12...and so on) generates a symmetrical
products in a group that can be formed as shown in dot
figure 3 for 8x8 multiplication.
The quarter module is nothing but a higher order
compressor made up of best of 3:2 and 4:2 compressors.
As the carries generated in 5:2 and higher compressors
are not manageable, the compressors used here are 3:2
and 4:2.
Figure 6 Dot representation for 8x8 multiplication
Figure 5 (a) Alternative implementation of [4:2]

combination I
Figure 5 (b) Alternative implementation of [4:2]

combination II
After implementation of quarter module which is

basically a quarter part of overall partial product
reduction, the dot product with quarter module result can
be realized as follows:
Figure 7 Reduced partial products after QM

implementation
The implemented quarter module technology test bench
waveform is as shown:
www.ijsret.org
200
From above bars and comparison table, following

conclusions can be made
1) Design I can generate fastest cout from cin.
2) Design II and III can generate fastest sum from cin.
3) Design IV can generate fastest sum from any of the
input.
4) Design V can generate faster cout from any one of the
input.
Figure 8 Test bench waveform for QM
After getting the results from the quarter modules, the
values can be added to get the final result again with the
use for compressors discussed above. The multiplication
is done with simple AND technique and the addition is
done with faster compressors.
IV.
Design IV and V are more preferable than other 3

designs while when the carry generation from input is a
prime concern, design V will be more preferable. To
design higher compressors with the help of [3:2]
compressors, design IV and design V is preferable.
COMPARISON & DISCUSSION
The above discussed [3:2] compressors have capabilities

to produce sum and carry from inputs and carry input.
Indeed all the designs have some faster outputs than
others. The designing approach for design II and design
III have same timing values, so mentioned in same
column.
Figure 10 Utilization report of proposed multiplier

Table 2 Result comparison
Table 1 comparison of various [3:2] compressor designs

Design
Destin
Source
Design
II /
Desig Desig
ation
pad
I
Design
n IV
nV
pad
III
In 1
Sum
7.024
7.142
6.785 6.940
In 2
Sum
7.064
6.976
6.578 6.888
Cin
Sum
5.990
5.985
6.411 7.060
In 1
Cout
6.812
6.803
6.945 6.386
In2
Cout
6.852
6.637
6.738 6.334
Cin
Cout
6.260
6.284
6.413 6.256
Figure 11 Result comparison

Figure 9 Comparison of 3:2 compressors various
implementations
www.ijsret.org
201
V.
CONCLUSION
While comparing with the quarter module with and

without fast compressors, the maximum path delay for
the quarter module is getting as 14.7ns and minimum
path delay is 11.0 to generate MSB of the result. The
results are approximate 7% faster than normal
compressor approach. This represents a Novel approach
which is more beneficial than normal.
[9] Nilay Nagdeve, Vishal Moyal, Ms. Archana Fande,

A Simulation Based Evaluation of Different
Compressors for Fast Multiplication, International
Journal of Scientific & Engineering Research, vol.3,
issue 6, June 2012.
During the comparison of delay for producing a MSB

result bit from LSB input bit , it has been found that, the
proposed structure has a smaller delay of (maximum
data path delay) 7.913ns than the Wallace tree multiplier
(with delay of 8.504ns), which is approximately 7%
(6.94%) reduced.
Again the result proves, higher order compressors
increases the speed and reduces a delay.
REFERENCES
[1] K. Prasad, K.K. Parhi, Low power 4-2 and 5-2
compressors, Proceedings of 35th Asilomar Conference
on Signals, Systems and Computers, Vol. 1, pp. 129133, 2001.
[2] Z. Wang, G. A. Jullien, and W. C. Miller, A new
design technique for column compression multipliers,
IEEE Trans. Comput., vol. 44, pp. 962970, Aug. 1995.
[3] S. F. Hsiao, M. R. Jiang, and J. S. Yeh, Design of
high speed low-power 3-2 counter and 4-2 compressor
for fast multipliers, Electron. Lett, vol. 34, no. 4, pp.
341343, 1998.
[4] Kwon, O., Nowka, K., and Swartzlander, E.E. A 16bit x 16-bit MAC design using fast 5:2 compressor,
Proc. IEEE Int. Conf. on Application-Specific Systems,
Architectures, and Processors, pp. 235243, 2000.
[5] Ahmed M. Shams, Tarek K. Darwish and Magdy A.
Bayoumi, Performance Analysis of Low-Power1-Bit
CMOS Full Adder Cells. IEEE Transactions on Very
Large Scale Integration (VLSI) System, Vol. 10, No. 1,
2002, pp. 20-29.
[6] Sreehari Veeramachaneni et al, Novel Architectures
for High-Speed and Low-Power 3-2, 4-2 and 5-2
Compressors, International Conference on VLSI
Design, IEEE, 2007.
[7] Sung Mo Kang and Yusuf Leblebici, CMOS
digital integrated circuits-analysis and design, 2003,
Tata McGraw-Hill, third edition.
[8] Riya Garg, Suman Nehra, B. P. Singh, -Low Power
4-2 Compressor for Arithmetic Circuits International
Journal of Recent Technology and Engineering (IJRTE),
ISSN: 2277-3878, Volume-2, Issue-1, March 2013.
www.ijsret.org
202

Novel Approach For Fast Partial Product Compressor Structure Using Quarter Module For 8x8 & Higher Multipliers

Caricato da

Informazioni sul documento

Titolo originale

Copyright

Formati disponibili

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Copyright:

Formati disponibili

Novel Approach For Fast Partial Product Compressor Structure Using Quarter Module For 8x8 & Higher Multipliers

Caricato da

Copyright:

Formati disponibili

International Journal of Scientific Research Engineering & Technology (IJSRET), ISSN 2278 0882

Volume 3 Issue 2, May 2014

ME IV Sem, Department of ET&T, FET, SSTC, Bhilai

Some of Faster multiplication procedures focuses on

multiplier are multiplied bit by bit to generate the partial

As there are so many patterns to design a same thing, in

addition of 5 bits is taking place. The maximum length

Figure 3 (a) Alternative implementation of [3:2]

The proposed work is totally based on designing of new

Figure 3 (b) Alternative implementation of [3:2]

Figure 3 (c) Alternative implementation of [3:2]

Figure 3 (d) Alternative implementation of [3:2]

Alternative calculation that may generate faster sum are

2(d). The timing diagram to verify the truth table by all

Figure 5 (c) Alternative implementation of [4:2]

C. DESIGNING of a Quarter Module

Figure 6 Dot representation for 8x8 multiplication

Figure 5 (a) Alternative implementation of [4:2]

Figure 5 (b) Alternative implementation of [4:2]

After implementation of quarter module which is

Figure 7 Reduced partial products after QM

From above bars and comparison table, following

Design IV and V are more preferable than other 3

COMPARISON & DISCUSSION

The above discussed [3:2] compressors have capabilities

Figure 10 Utilization report of proposed multiplier

Table 1 comparison of various [3:2] compressor designs

Figure 11 Result comparison

While comparing with the quarter module with and

[9] Nilay Nagdeve, Vishal Moyal, Ms. Archana Fande,

During the comparison of delay for producing a MSB

Potrebbero piacerti anche