0% found this document useful (0 votes)

127 views7 pages

Thesis Phase 1 Report

The document discusses improving the performance of multiplier-and-accumulator (MAC) architectures used in digital signal processing. It proposes combining the accumulation function with a modified carry-save adder tree to reduce critical path delays. The accumulator is merged into the carry-save adder, and intermediate results are accumulated as sums and carries rather than final adder outputs to enable increased pipelining. Booth's algorithm is used to generate partial products, with radix-4 encoding discussed to reduce the number processed. Initial work implemented a Booth encoder in Verilog for an 8-bit signed number multiplier.

Uploaded by

Dhanya Geethanjali Sasidharan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

127 views7 pages

Thesis Phase 1 Report

Uploaded by

Dhanya Geethanjali Sasidharan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

Thesis Phase 1 report

ABSTRACT
The paper , proposes a new architecture of multiplier-and-accumulator (MAC) for high-speed arithmetic. By combining multiplication with accumulation and devising a hybrid type of carry save adder (CSA), the performance can be improved. Accumulator that has the largest delay in MAC is merged into CSA for improving performance . The proposed CSA tree uses 1s-complement-based radix-2 modified Booths algorithm (MBA) and has the modified array for the sign extension in order to increase the bit density of the operands. The CSA propagates the carries to the least significant bits of the partial products and generates the least significant bits in advance to decrease the number of the input bits of the final adder. Also, the proposed MAC accumulates the intermediate results in the type of sum and carry bits instead of the output of the final adder, which make it possible to optimize the pipeline scheme to improve the performance. expect that the proposed MAC can be adapted to various fields requiring high performance such as the signal processing areas.

Thesis Phase 1 report

INTRODUCTION
With the recent rapid advances in multimedia and communication systems, real-time signal processings like audio signal processing, video/image processing, or large-capacity data processing are increasingly being demanded. The multiplier and multiplier-and-accumulator (MAC) are the essential elements of the digital signal processing such as filtering, convolution, and inner products. Most digital signal processing methods use nonlinear functions such as discrete cosine transform (DCT) or discrete wavelet transform (DWT) . Because they are basically accomplished by repetitive application of multiplication and addition, the speed of the multiplication and addition arithmetics determines the execution speed. and performance of the entire calculation. Because the multiplier requires the longest delay among the basic operational blocks in digital system, the critical path is determined by the multiplier, in general. For high-speed multiplication, the modified radix-4 Booths algorithm (MBA) is commonly used.In general, a multiplier uses Booths algorithm and array of full adders (FAs), or Wallace tree instead of the array of FAs., i.e., this multiplier mainly consists of the three parts:Booth encoder, a tree to compress the partial products such asWallace tree, and final adder .The most effective way to increase the speed of a multiplier is to reduce the number of the partial products because multiplication proceeds a series of additions for the partial products. To reduce the number of calculation steps for the partial products, MBA algorithm has been applied mostly where Wallace tree has taken the role of increasing the speed to add the partial products. To increase the speed of the MBA algorithm, many parallel multiplication architectures have been researched . Among them, the architectures based on the BaughWooley algorithm(BWA) have been developed and they have been applied to various digital filtering calculations . One of the most advanced types of MAC for general-purpose digital signal processing has been proposed by Elguibaly . It is an architecture in which accumulation has been combined with the carry save adder (CSA) tree that compresses partial products. In the architecture proposed by Elguibaly , the critical path was reduced by eliminating the adder for accumulation and decreasing the number of input bits in the final adder. While it has a better performance because of the reduced critical path compared to the previous MAC architectures, there is a need to improve the output rate due to the use of the final adder results for accumulation. An architecture to merge the adder block to the accumulator register in the MAC operator was proposed in to provide the possibility of using two separate 2-bit adders instead of one -bit adder to accumulate the bit MAC results

Thesis Phase 1 report

.GENERAL MAC STRUCTURE In this section, we discuss basic MAC operation. Basically, multiplier operation can be divided into three operational steps. The first one is booth encoding to generate the partial products. The second one is adder array or partial product compression and the last one is final addition in which final multiplication result is produced . If the multiplication process is extended to accumulate the multiplied result, then MAC consists of four steps. General hardware architecture for MAC is shown in Figure 1. It executes the multiplication operation by multiplying input multiplier X and input multiplicand Y. After that current multiplication result is added to the previous multiplication result Z as accumulation step

Thesis Phase 1 report

Derivation of MAC Arithmetic

If an operation to multiply two bit numbers and accumulate into a 2 -bit number is considered, the critical path is determined by the 2 -bit accumulation operation. If a pipeline scheme is applied for each step in the standard design of Fig. 1, the delay of the last accumulator must be reduced in order to improve the performance of the MAC. The overall performance of the proposed MAC is improved by eliminating the accumulator itself by combining it with the CSA function. If the accumulator has been eliminated, the critical path is then determined by the final adder in the multiplier. The basic method to improve the performance of the final adder is to decrease the number of input bits. In order to reduce this number of input bits, the multiple partial products are compressed into a sum and a carry by CSA. The number of bits of sums and carries to be transferred to the final adder is reduced by adding the lower bits of sums and carries in advance within the range in which the overall performance will not be degraded. A 2-bit CLA is used to add the lower bits in the CSA. In addition, to increase the output rate when pipelining is applied, the sums and carrys from the CSA are accumulated instead of the outputs from the final adder in the manner that the sum and carry from the CSA in the previous cycle are inputted to CSA. Due to this feedback of both sum and carry, the number of inputs to CSA increases, compared to the standard design and . In order to efficiently solve the increase in the amount of data, a CSA architecture is modified to treat the sign bit

Thesis Phase 1 report

The hardware architecture of the MAC to satisfy the process in Fig. 3 is shown in Fig. 4. The -bitMAC inputs, and , are converted into an -bit partial product by passing through the Booth encoder. In the CSA and accumulator, accumulation is carried out along with the addition of the partial products. As a result, -bit , and (the result from adding the lower bits of the sum and carry) are generated. These three values are fed back and used for the next accumulation. If the final result for the MAC is needed, is generated by adding and in the final adder and combined with that was already generated.

Booths Algorithm
In unsigned multiplication there is no need to take the sign of the number into consideration. However in signed multiplication the same process cannot be applied because the signed number is in a 2s compliment form which would yield an incorrect result if multiplied in a similar fashion to unsigned multiplication. Thats where Booths algorithm comes in. Booths algorithm preserves the sign of the result. The modified Booths algorithm based on a radix-4, generally called Booth-2 is the most popular approachfor implementing fast multipliers using parallel encoding . It uses a digit set {0, 1, 2} to reduce the number of the partial products to n= [(n+1)/ 2]. Radix- 4 encoding start by appending a zero to the right of multiplier LSB. Triplets are taken beginning at position x 1 and continuing to the MSB with one bit overlapping between adjacent tripletsThis recoding scheme applied to a parallel multiplier halves the number of partial products so the multiplication time and the hardware requirements decrease. Radix-8 recoding applies the same algorithm as radix-4, but now in this we take quartets of bits instead of triplets. The Booth-3 scheme is based on a radix-8 encoding to reduce this number to n = [(n+1)/3].

Radix 2 based booth algoritm

For each multiplier bit, also examine bit to its right 00: middle of a run of 0s, do nothing 10: beginning of a run of 1s, subtract multiplicand 11: middle of a run of 1s, do nothing 01: end of a run of 1s, add multiplicand

Thesis Phase 1 report

Example based on radix 2 algorithm:

43 = 00000101011 * 12 = 00000001100 0 = 00000000000 // multiplier bits 00 + 0 = 00000000000 // multiplier bits 00 - 172 = 11101010100 // multiplier bits 10 + 0 = 00000000000 // multiplier bits 11 + 688 = 01010110000 // multiplier bits 01 516 = 01000000100

Radix 4 based booth algorithm

000: middle of run of 0s, do nothing 100: beginning of run of 1s, subtract multiplicand twice 010: singleton 1, add multiplicand once 110: beginning of run of 1s, subtract multiplicand once 001: end of run of 1s, add multiplicand once 101: end of run of 1s, beginning of another, subtract multiplicand once 011: end of a run of 1s, add multiplicand twice 111: middle of run of 1s, do nothing Example based on radix 4 booth algorithm: 43 = 00000101011 * 12 = 00000001100 0 = 00000000000 // multiplier bits 000 - 172 = 11101010100 // multiplier bits 110 + 688 = 01010110000 // multiplier bits 001 516 = 01000000100

Thesis Phase 1 report

WORK DONE SO FAR

Coding for the first section consisting of booth encoder was performed inverilog.Both radix2 and radix - 4 based booth algorithm were programed.Partial products were generated by considering two 8 bit signed numbers.

A New Vlsi Architecture of Parallel Multiplier-Accumulator Based On Radix-2 Modified Booth Algorithm
No ratings yet
A New Vlsi Architecture of Parallel Multiplier-Accumulator Based On Radix-2 Modified Booth Algorithm
6 pages
Implementation of Low Power and High Speed Multiplier-Accumulator Using SPST Adder and Verilog
No ratings yet
Implementation of Low Power and High Speed Multiplier-Accumulator Using SPST Adder and Verilog
8 pages
Implementation of Low Power and High Speed Multiplier-Accumulator Using SPST Adder and Verilog
No ratings yet
Implementation of Low Power and High Speed Multiplier-Accumulator Using SPST Adder and Verilog
8 pages
A New VLSI Architecture of Parallel Multiplier-Accumulator Based On Radix-2 Modified Booth Algorithm
No ratings yet
A New VLSI Architecture of Parallel Multiplier-Accumulator Based On Radix-2 Modified Booth Algorithm
8 pages
Parallel MAC
No ratings yet
Parallel MAC
6 pages
1.5. MAC 1.5.1 Block Diagram of MAC
No ratings yet
1.5. MAC 1.5.1 Block Diagram of MAC
11 pages
Apar 12
No ratings yet
Apar 12
5 pages
An Optimized Modified Parallel Implementation Design of Multiplier and Accumulator Operator
No ratings yet
An Optimized Modified Parallel Implementation Design of Multiplier and Accumulator Operator
39 pages
VLSI Designing of High Speed Parallel Multiplier - Accumulator Based On Radix4 Booths Multiplier
No ratings yet
VLSI Designing of High Speed Parallel Multiplier - Accumulator Based On Radix4 Booths Multiplier
7 pages
Multiplier and Accumulator Unit
80% (5)
Multiplier and Accumulator Unit
4 pages
Implementation of MAC Unit Using Booth Multiplier & Ripple Carry Adder
No ratings yet
Implementation of MAC Unit Using Booth Multiplier & Ripple Carry Adder
3 pages
Efficient Implementation of 16-Bit Multiplier-Accumulator Using Radix-2 Modified Booth Algorithm and SPST Adder Using Verilog
No ratings yet
Efficient Implementation of 16-Bit Multiplier-Accumulator Using Radix-2 Modified Booth Algorithm and SPST Adder Using Verilog
12 pages
A New VLSI Architecture of Parallel Multiplier Accumulator Based On Radix 2 Modified Booth Algorithm
No ratings yet
A New VLSI Architecture of Parallel Multiplier Accumulator Based On Radix 2 Modified Booth Algorithm
9 pages
DigitalMultipliers AReview (220 223) 8aa5905d 5da5 4b50 8b2e F0bd850becb7
No ratings yet
DigitalMultipliers AReview (220 223) 8aa5905d 5da5 4b50 8b2e F0bd850becb7
4 pages
Power-Efficient Multiplier Design
No ratings yet
Power-Efficient Multiplier Design
6 pages
PXC 3878710
No ratings yet
PXC 3878710
4 pages
VLSI Architecture for Engineers
No ratings yet
VLSI Architecture for Engineers
8 pages
Ijarcet Vol 1 Issue 5 346 351
No ratings yet
Ijarcet Vol 1 Issue 5 346 351
6 pages
Implementation and Comparison of Radix-8 Booth Multiplier by Using 32-Bit Parallel Prefix Adders For High Speed Arithmetic Applications
No ratings yet
Implementation and Comparison of Radix-8 Booth Multiplier by Using 32-Bit Parallel Prefix Adders For High Speed Arithmetic Applications
11 pages
Booth Algorithm For The Design of Multiplier: Bhavya Lahari Gundapaneni, JRK Kumar Dabbakuti
No ratings yet
Booth Algorithm For The Design of Multiplier: Bhavya Lahari Gundapaneni, JRK Kumar Dabbakuti
4 pages
Booth Multiplier
No ratings yet
Booth Multiplier
6 pages
PaperID 74S201921
No ratings yet
PaperID 74S201921
7 pages
Radix-4 and Radix-8 Multiplier Using Verilog HDL: (Ijartet) Vol. 1, Issue 1, September 2014
No ratings yet
Radix-4 and Radix-8 Multiplier Using Verilog HDL: (Ijartet) Vol. 1, Issue 1, September 2014
6 pages
Radix-4 and Radix-8 Multiplier Using Verilog HDL
No ratings yet
Radix-4 and Radix-8 Multiplier Using Verilog HDL
6 pages
Paper M
No ratings yet
Paper M
10 pages
Unit 1 Coa
No ratings yet
Unit 1 Coa
52 pages
Implementation of High-Speed Modified Radix-8 Booth Multiplier For Signed and Unsigned Numbers
No ratings yet
Implementation of High-Speed Modified Radix-8 Booth Multiplier For Signed and Unsigned Numbers
9 pages
International Journal of Computational Engineering Research (IJCER)
No ratings yet
International Journal of Computational Engineering Research (IJCER)
11 pages
Booth Algorithm for Tech Enthusiasts
No ratings yet
Booth Algorithm for Tech Enthusiasts
6 pages
Literature Survey: 2.1 Background of The Project
No ratings yet
Literature Survey: 2.1 Background of The Project
5 pages
High-Speed Signed Multiplier Design
No ratings yet
High-Speed Signed Multiplier Design
10 pages
A New Vlsi Architecture For Modi Ed
No ratings yet
A New Vlsi Architecture For Modi Ed
6 pages
Modified Booth Multiplier Performance
No ratings yet
Modified Booth Multiplier Performance
8 pages
A New Vlsi Architecture of Parallel Multiplier Based On Radix-4 Modified Booth Algorithm Using VHDL
No ratings yet
A New Vlsi Architecture of Parallel Multiplier Based On Radix-4 Modified Booth Algorithm Using VHDL
8 pages
Efficient Design of FIR Filter Using Modified Booth Multiplier
No ratings yet
Efficient Design of FIR Filter Using Modified Booth Multiplier
5 pages
Design of Low Power Approximate Radix 8 Booth Multiplier IJERTCONV5IS17004
No ratings yet
Design of Low Power Approximate Radix 8 Booth Multiplier IJERTCONV5IS17004
5 pages
Project8 Team1
No ratings yet
Project8 Team1
23 pages
Booth Algorithm for Engineers
No ratings yet
Booth Algorithm for Engineers
21 pages
Lpvlsi Unit-4
100% (2)
Lpvlsi Unit-4
37 pages
Vlssi Design Project New
No ratings yet
Vlssi Design Project New
4 pages
Design of Modulo 2 - 1 Multiplier Based On Radix-8 Booth Algorithm Using Residue Number System
No ratings yet
Design of Modulo 2 - 1 Multiplier Based On Radix-8 Booth Algorithm Using Residue Number System
8 pages
Adobe Scan 03 Dec 2024
No ratings yet
Adobe Scan 03 Dec 2024
25 pages
Low Power Booth Multiplier Design
No ratings yet
Low Power Booth Multiplier Design
6 pages
A Comparative Study of Different Multiplier Designs PDF
No ratings yet
A Comparative Study of Different Multiplier Designs PDF
4 pages
Mux Implementation of Bec-1 Based Pipelined Vedic Mac Using Han Carlson Accumulator
No ratings yet
Mux Implementation of Bec-1 Based Pipelined Vedic Mac Using Han Carlson Accumulator
94 pages
Design FF Low Power Multiplier Unit Using Wallace Tree Algorithm IJERTV9IS020069
No ratings yet
Design FF Low Power Multiplier Unit Using Wallace Tree Algorithm IJERTV9IS020069
5 pages
Reference 7
No ratings yet
Reference 7
4 pages
Computing Multiplier Analysis
No ratings yet
Computing Multiplier Analysis
4 pages
Project8 Team1
No ratings yet
Project8 Team1
22 pages
4.1 Unsigned Binary Multiplication: Digital Computer Arithmetic Datapath Design
No ratings yet
4.1 Unsigned Binary Multiplication: Digital Computer Arithmetic Datapath Design
5 pages
DSP Notes Unit1 and 2
100% (1)
DSP Notes Unit1 and 2
45 pages
A Review Paper On Different Multipliers Based On Their Different Performance Parameters
No ratings yet
A Review Paper On Different Multipliers Based On Their Different Performance Parameters
4 pages
Study of Combinational and Booth Multiplier: Neha Goyal, Khushboo Gupta, Renu Singla
No ratings yet
Study of Combinational and Booth Multiplier: Neha Goyal, Khushboo Gupta, Renu Singla
4 pages
Computer Organization and Architecture: UNIT-2
No ratings yet
Computer Organization and Architecture: UNIT-2
29 pages
DSP Architecture & Programming
No ratings yet
DSP Architecture & Programming
10 pages
Booth Multiplier On 23 06 10
No ratings yet
Booth Multiplier On 23 06 10
25 pages
Booth Algorithm for Signed Multiplication
100% (1)
Booth Algorithm for Signed Multiplication
5 pages
COA Unit 2
No ratings yet
COA Unit 2
25 pages
Coa M3
No ratings yet
Coa M3
66 pages
Worksheet 1 Distance Time
No ratings yet
Worksheet 1 Distance Time
7 pages
Min-Sum/offset-Min-Sum Algorithm Is Proposed That Supports Both Irreg
No ratings yet
Min-Sum/offset-Min-Sum Algorithm Is Proposed That Supports Both Irreg
5 pages
Personal Data: (Please Complete All Fields Write "N/A" If Not Applicable)
No ratings yet
Personal Data: (Please Complete All Fields Write "N/A" If Not Applicable)
4 pages
Carbon Nanotubes For VLSI: Interconnect and Transistor Applications
No ratings yet
Carbon Nanotubes For VLSI: Interconnect and Transistor Applications
2 pages
Measur Ments
No ratings yet
Measur Ments
7 pages
Analysis of Crosstalk in Single-And Multi-Wall Carbon Nanotube Interconnects and Its Impact On Gate Oxide Reliability
No ratings yet
Analysis of Crosstalk in Single-And Multi-Wall Carbon Nanotube Interconnects and Its Impact On Gate Oxide Reliability
9 pages
Keltron Introduction
No ratings yet
Keltron Introduction
1 page
Csir Key For Dec END2012K. Enjoy The Download Version
No ratings yet
Csir Key For Dec END2012K. Enjoy The Download Version
1 page
197
No ratings yet
197
113 pages
Try To Find Answer For The Following Questions in One Paragraph. The Mark For One Question Is 2 Submission Date Is
No ratings yet
Try To Find Answer For The Following Questions in One Paragraph. The Mark For One Question Is 2 Submission Date Is
1 page
Mems
No ratings yet
Mems
16 pages
Data Types and Variables-: Int Int Int Sizeof
No ratings yet
Data Types and Variables-: Int Int Int Sizeof
114 pages
Samuel Martinez: Professional Summary
No ratings yet
Samuel Martinez: Professional Summary
3 pages
CSC248 Final Report
No ratings yet
CSC248 Final Report
24 pages
Testing Webtestclient
No ratings yet
Testing Webtestclient
7 pages
Chapter 3 Brute Force Student
No ratings yet
Chapter 3 Brute Force Student
11 pages
Xv6 Process and Cpu Scheduling
No ratings yet
Xv6 Process and Cpu Scheduling
31 pages
Sobel 3x3 Image Filter Implementation
No ratings yet
Sobel 3x3 Image Filter Implementation
3 pages
Q Hardware Manual en
No ratings yet
Q Hardware Manual en
364 pages
97 Syllabus B SC Computer Science
No ratings yet
97 Syllabus B SC Computer Science
32 pages
Topic4 ERM
No ratings yet
Topic4 ERM
58 pages
Lastexception 63670259058
No ratings yet
Lastexception 63670259058
25 pages
Java Training Report
No ratings yet
Java Training Report
22 pages
Namma Kalvi 12th Computer Science 1 Mark Question Paper em 216956
No ratings yet
Namma Kalvi 12th Computer Science 1 Mark Question Paper em 216956
14 pages
ISYS6508 Database System: Week 9 Semi-Structured Data and XML
No ratings yet
ISYS6508 Database System: Week 9 Semi-Structured Data and XML
40 pages
L-2.1.3 Swapping, Fragmentation - Compaction
No ratings yet
L-2.1.3 Swapping, Fragmentation - Compaction
13 pages
Software Development Fundamentals: Deskripsi
No ratings yet
Software Development Fundamentals: Deskripsi
2 pages
Coa Unit - 5 Notes
No ratings yet
Coa Unit - 5 Notes
6 pages
Essential Guide To Data Science For Petroleum Engineers
No ratings yet
Essential Guide To Data Science For Petroleum Engineers
150 pages
Event Driven Programming Mock Exam Questions
No ratings yet
Event Driven Programming Mock Exam Questions
5 pages
Python Programing Microproject
No ratings yet
Python Programing Microproject
16 pages
JAVA Environment
No ratings yet
JAVA Environment
4 pages
4.1 Using Libraries in Python
No ratings yet
4.1 Using Libraries in Python
11 pages
Design Patterns Activities
No ratings yet
Design Patterns Activities
6 pages
Time Model Query's
No ratings yet
Time Model Query's
4 pages
4 Formula - Script Editor
No ratings yet
4 Formula - Script Editor
54 pages
8086 Assembly Programs Collection
No ratings yet
8086 Assembly Programs Collection
54 pages
Lampiran Source Code
No ratings yet
Lampiran Source Code
16 pages
PDF
No ratings yet
PDF
6 pages
C++ Data Types & Variables Guide
No ratings yet
C++ Data Types & Variables Guide
16 pages
Clink Settings
No ratings yet
Clink Settings
2 pages

Thesis Phase 1 Report

Uploaded by

Thesis Phase 1 Report

Uploaded by

Thesis Phase 1 report

Thesis Phase 1 report

Thesis Phase 1 report

Thesis Phase 1 report

Derivation of MAC Arithmetic

Thesis Phase 1 report

Radix 2 based booth algoritm

Thesis Phase 1 report

Example based on radix 2 algorithm:

Radix 4 based booth algorithm

Thesis Phase 1 report

WORK DONE SO FAR

You might also like