Fir Da
Fir Da
Low power low area VLSI implementation of adaptive FIR filter using DA for
decision feed back equalizer
K. Vijetha ∗, Rajendra Naik B.
University College of Engineering Osmania University, Hyderabad, Telangana, India
Keywords: In this paper, a low-complexity pipelined adaptive FIR filter is designed using Distributed Arithmetic (DA)
Pipelined architecture for signal processing applications. Generally adaptive filters will occupy more area and power
LMS adaptive FIR filter consumption because of using memories in the filters for partial product (PP) generation. To get rid of this,
Distributed Arithmetic
We use the pipeline concept to reduce the registers in the filters and also to reduce the area further compressor
Compressor adder
adders are used in the adaptive filter architectures instead of using normal adders. With these two concepts the
area and power consumption of the adaptive filters will be reduced. The proposed design is coded in Verilog
HDL language and synthesized in Synapsis design compiler tool with SAED 90 nm technology for finding the
area, power, minimum sampling period, maximum sampling frequency, area delay product (ADP), power delay
product (PDP). By using proposed adaptive filter we can design and implement higher order filters more easily
and also the complexity of the proposed design is very less when compared with the existing designs. When
we observe the synthesis results the proposed design will occupy 30% less area when compared with the two
memories based existing architecture. Also the power consumption is 25% less when compared with the block
based adaptive filters. The ADP and PDP of the proposed design is very less when compared with existing
architectures. The proposed design is well suited for signal processing application designs such as adaptive
decision feed back equalizers for removing the signal noises and inter symbol interference, hearing aids, ECG
signal analysis and software defined radio.
1. Introduction and adders are commonly used in any filters. Multipliers will take up
more space and use more energy.
In signal processing applications such as echo cancellation [1], Multiplier architectures are replaced by multiplier-less architectures
software defined radio, digital communications an adaptive filter, is in adaptive FIR filters to save area and power consumption [3]. To
essential since only a limited amount of a priori knowledge of signal enhance the speed and reduce power, a number of multipliers-less
properties is available. Fig. 1 depicts the fundamental schematic of adaptive FIR filter topologies are available. The Distributed Arithmetic
an adaptive filter. It consist of multipliers, delay elements and adders. model [4,5], is a well-known multiplier-free design for more effi-
When we observe the architecture multipliers will occupy more area cient adaptive filter implementation. Croisier [6] was the first author
and power consumption. The linear combiner, which forms the filter to propose DA concept, and further it was enhanced developed by
output 𝑦, remains at the centre of the architecture. The adaptive filters Peled [7]. Zohar [8] was in charge of the mathematical document
will change the filter coefficients to bring them closer to their ideal work. White [9] proposed the formulation of basic DA. NagaJyothi [10]
values. The filter coefficient can be updated with a variety of adaptive discussed various types of DA-based FIR filters. Allied suggested two
techniques [2]. FIR and IIR are two types of adaptive filters. Because of adaptive DA-based FIR filters based on LUTs [11]. Guo suggested an
their natural stability and ease of computation, adaptive FIR filters are adaptive FIR filter using DA [12], however it is only appropriate for
more advantageous than IIR filters. Adaptive filters come in a variety of FIR filters of lower order. Jyothi presented a DA-based adaptive FIR
shapes and sizes. Least mean square adaptive design, Recursive adap- filter without memory [13]. A traditional multiplier consists of two
tive design, Normalized adaptive filter are the three main algorithms major steps: The key topics of this study effort were multiplier less
present. Because filter coefficients may be quickly adjusted, the least architecture, implications, and solutions to better adaptive FIR filter
mean square adaptive method is a better choice. Multipliers, shifters topologies [14]. A tutorial survey on DA has been proposed by GN
∗ Corresponding author.
E-mail address: vijethakura@gmail.com (K. Vijetha).
https://doi.org/10.1016/j.micpro.2022.104577
Received 5 November 2021; Received in revised form 26 April 2022; Accepted 3 June 2022
Available online 11 June 2022
0141-9331/© 2022 Elsevier B.V. All rights reserved.
K. Vijetha and Rajendra Naik B. Microprocessors and Microsystems 93 (2022) 104577
Jyothi which is useful for the DSP applications. Many multipliers less architectures have a second clock period. The use of two separate clocks
LUT based and LUT less DA architecture has been discussed. Also in this reduces power consumption. When compared to a traditional DA-based
author has explained how multiplier less architecture is more useful design, this architecture reduces the number of adders by half. This
than the multiplier based architecture. architecture also allows for LUT updating in parallel, as well as filtering
Decision feedback equalizers, hearing aids, channel equalizers, noise and coefficient update in parallel. Baghel and Shaik [21] developed a
and echo cancellation, and beam shaping are all examples of signal DA-based solution for implementing a block LMS adaptive filter on an
processing applications that use adaptive filters [15]. FPGA. For the block LMS (BLMS) adaptive filter.
Mohanty et al. [22] developed a DA-based construction. A set of
2. Literature survey
LUTs, input samples, and coefficient updates are used in this structure.
They developed a new LUT update mechanism that does not move the
Allred et al. [16,17] developed LMS adaptive filter design using DA.
LUT content. Left shifting weight vectors over an array of LUTs achieves
The author had taken 2 unique LUTs for filter output & coefficient
column-wise right shift of LUTs from one LUT to the other LUT [22].
update. The architecture is made up of three sections. The DA filter is
This method saves both time and power. LUT optimization is impossible
the first module, and it conducts filtering operations on both the current
with this strategy. Mohanty offer a LUT optimized design for DA based
data sets and the LUT’s existing weights.
Guo and De Brunner [18,19] proposed a useful approach for storing BLMS adaptive filter.
the sums of delayed and scaled signals in the LUT that uses coefficients Tang et al. has been explained FIR filter using booth encoding mul-
as addresses. With this technique, both the updating of the LUT and the tiplier and Wallac tree adder for decreasing the area and power. Chen
execution of the coefficients can be done at the same time. They also and Chiueh discussed CSD based reconfigurable FIR filter where the fil-
devised a method for saving data in the LUT in OBC format rather than ter coefficients can be changed during simulation. The other multipliers
binary. less conversion based methods are the Common sub-expression elimi-
Meher et al. [20] developed a DA-based adaptive filter. Carry save nations (CSE), programmable shift method and constant shift method.
accumulation replaces binary DA’s adder-shift accumulation. It has 2 In this the filter coefficient inputs are mentioned in binary form. But
separate clock periods.CSA has one clock and all other circuits in the these are useful for the lower order filters [23].
2
K. Vijetha and Rajendra Naik B. Microprocessors and Microsystems 93 (2022) 104577
Prakash [24] proposed a new OBC DA-based FIR filter technique. high rate of throughput. Jiang et al. [25] described an energy-efficient
There are two LUTs in the design. The offset binary input samples are FIR adaptive filter based on approximate DA design. To decrease the
kept in one LUT, while the modified filter coefficients are stored in amount of partial products (PP) in the filter, the radix-8 booth multi-
another. The parallel up-dation of input samples and filter coefficients plier is used. To speed up the design, a Wallac tree adder is employed
can be accomplished using two LUTs. As a result, the design has a instead of a regular adder.
3
K. Vijetha and Rajendra Naik B. Microprocessors and Microsystems 93 (2022) 104577
In signal processing applications where only a rudimentary under- Berberidis et al. developed a new block adaptive DFE in [26], which
standing of signal qualities is available, such as noise cancellation, is mathematically identical to the traditional LMS-based sample-by-
channel equalizers, and so on, an adaptive filter is needed. sample DFE but requires significantly less computational effort. Parhi
4
K. Vijetha and Rajendra Naik B. Microprocessors and Microsystems 93 (2022) 104577
∑
𝐿−1
𝑥𝑘 = −𝑏𝑘0 + 𝑏𝑘𝑙 2−𝑙 (6)
𝑙=1
𝐷(𝑘) = desired signal Fig. 4 shows a DA table with 16 delay elements that can create
𝑋(𝑘) = input signal partial results for the adaptive filter construction.
5
K. Vijetha and Rajendra Naik B. Microprocessors and Microsystems 93 (2022) 104577
3.4. Carry save accumulator 4. Proposed compressor adder-based pipelined DA-based adaptive
FIR filter
Shift and accumulation can be used to compute the inner products The inner products of a basic DA-based adaptive FIR filter are
in L clock cycles, followed by LUT-read operations corresponding to L calculated using a DA table with registers and adders. In Fig. 3, the
number of bit slices dkl for 0 l L1. This shift accumulation is done with a DA table’s inner product is shown. The register has an input of length
CSA because the most crucial path is included in the shift accumulation. L and an output of length Y(k). To get the value x(k), x(k) is sent
6
K. Vijetha and Rajendra Naik B. Microprocessors and Microsystems 93 (2022) 104577
The input signal X(k) is passed to different 3 D-FFs and yields the
outputs of 𝑋(𝑘 − 3), 𝑋(𝑘 − 2), 𝑋(𝑘 − 1) signals which are passed to the
weight increment block (WIB) as shown in Fig. 5. It consist of 4 barrel
shift (BS) registers, 4 adder/subtractor, 4 registers and a word parallel
bit converters. The BS will shift the values according to the control
signal t. The control signal shifts the data one or more times according
to the condition.
The BS output will be added or subtracted depending upon the filter
coefficient values. The sign bit will control the adder/subtractor block
by using error bit as controller. If sign bit is ‘0’ the BS output will
be added with current filter coefficient data and if sign bit is ‘1’ then
BS output will be subtracted from the filter coefficient and are these
register values are send to the word parallel bit serial device to get
final output. This output will act as a selection line for the DA table
generated in the 4-point inner product block.
This process will be repeated until the error will get nullify and
which is noise will removed by using adaptive filter. In this process
for adding outputs we use compressor adders such that the number
of gates used for designing of adders will be reduced so that the area
consumption for full adder will be reduced further. In similar manner
for higher order filters the higher order filters are sub divided into
smaller order filters such that the large memory usage will be reduced.
7
K. Vijetha and Rajendra Naik B. Microprocessors and Microsystems 93 (2022) 104577
4 pipelined DA block, 3 proposed compressor adder block. The 4- were listed in Table 1. In this the proposed and existing works are
WIB blocks will give 4 selection lines to the 4 different pipelined DA designed using SAED90 nm technology. The filter taps of 16, 32 and
blocks. The outputs from DA block will give to the 3 different proposed 64 of existing designs and proposed designs are shown in Table 1.
compressor blocks to get final output. This final output is cross verified When we observe the Table 1 the synthesis results of author Debrun-
by the desired signal and is again given to the WIB block. This process ner [11] has high power consumption and it has high complex structure
is repeated until errors will be nullified. when compared with the proposed design. Also the author Guo [12]
proposed architecture has high mathematical calculation and difficult
5. Simulation results to understand and the hardware complexity and area of the design is
high. The area occupied by the design is very high when compared with
The proposed design was programmed in Verilog HDL language and the proposed design. Ref. [25] has occupied more area when compared
synthesized in ASIC in Synapsis design compiler tool, and the results with proposed design. In Ref. [25] they used approximate DA adaptive
8
K. Vijetha and Rajendra Naik B. Microprocessors and Microsystems 93 (2022) 104577
Table 1
ASIC implementation results of proposed and existing results.
Design Order (K) MSP (ns) Speed Area μm2 Power ADP PDP (mwXns)
16 24.05 50.600 143377.333 1.52491 3441048 36.6738
Ref. [11] 32 27.24 46 267146.333 2.90620 7277057 79.1648
64 29.78 40 537053.633 5.3110 15993438 158.16158
16 25.80 77 107520.8 0.7555 2774016 12
Ref. [12] 32 31.80 71 209082 1.4598 6648807 46.42
64 35.80 64 416418 2.7315 14907770.49 97.7877
16 7.09 141.04 88719 2.917 629617.71 20.68153
Ref. [25] 32 8.31 120.33 177460 5.834 1474692.60 48.48054
64 9.6 104.167 364820 11.67 3502272.00 112.032
16 11.8 84.745 41946 0.56 494962.80 15.694
Proposed design 32 12.05 82.987 85262 1.0112 1027407.00 20.42475
64 13.11 76.278 170678 2.0127 2237588.58 42.6704
area = μm2 , ADP = area delay product, power = mW, MSP = Minimum Sampling Period, PDP = Power delay product (mWXns), Speed = MHZ.
Table 2
Hardware complexity of the proposed architectures.
Design Throughput Hardware
Adders Shifters Registers LUT
Alled et.al [11] 1/[c1(𝑇𝑅 + 𝑇𝐴 + 3𝑇𝑀 )] 1+3Q Q Q+K+2 2𝑃 +1 𝑄
Guo et.al [12] 1/c2(𝑇𝑅 + 𝑇𝐴 ) N+1+2Q K 3K+1+2Q 2𝑃 𝑄
Meher [22] 1/L(4𝑇𝑀 + 𝑇𝐹 𝐴 + 𝑇𝐷 ) (3.2P-1+1)Q Q(2P-1) (1+2P-1)3Q+K+1 *
Park [20] 1/L(4𝑇𝑀 + 𝑇𝐹 𝐴 + 𝑇𝐷 + +𝑇𝑋𝑂𝑅 ) (2+2P-1)Q+K K (3+2P)Q+3+2k *
Proposed 1/L(4𝑇𝑀 + 𝑇𝐷 + 𝑇𝐹 𝐴 ) 3Q K (3+2P)Q+3+2K *
𝑇𝑀 = Delay for 2:1 MUX; 𝑇𝑅 = Delay for LUT access and address generation; 𝑇𝐹 𝐴 = Delay for full adder; K = Filter order R = K/S; The no.
of adaptive filter blocks of length S; 𝑇𝑋𝑂𝑅 = Delay for XOR gate; 𝑇𝐷 = Delay for D-FF; 𝑇𝐴 = Delay for the bit adder.
filter which is used for specific application only. Around 50% area is
reduced when compared with Ref. [11] and 75% area is reduced when 𝑁𝑓 −1
∑
compared with the Ref. [12]. When comparing the area delay product ̂ =
𝑥(𝑘) ℎ𝑓 (𝑖)𝑥(𝑘 − 𝑖) (11)
(ADP), power delays product (PDP), and minimum cycle period (MCP) 𝑖=0
𝑁𝑏 −1
of the two DA architectures the proposed design will occupy less power ∑
̂ =
𝑟(𝑘) ℎ𝑏 (𝑗)𝑟(𝑘 − 𝑗) (12)
consumption and is shown in Fig. 9. Fig. 10 shows the area report of the
𝑗=0
present and existing architectures. The hardware architecture occupied
by the proposed design is very less when compared with other existing where
architecture. By using pipeline concept the area occupied by DA table
ℎ𝑇𝑓 = [ℎ0 , ℎ1 , ℎ2 , … ..ℎ𝑁𝑓 −1 ] (13)
is decreased and the area will be reduced further by using compressor
adders. The layout diagram is shown in Fig. 11. Table 2 shows the
hardware complexity of the proposed and existing architectures. The ℎ𝑇𝑏 = [ℎ0 , ℎ1 , ℎ2 , … ..ℎ𝑁𝑏 −1 ] (14)
speed of the proposed design is high when compared with Alled et al. The coefficient of FF and FB filters are ℎ𝑓 and ℎ𝑏 . 𝑁𝑏 and 𝑁𝑓 are
and Guo et al. architectures. The number of adders required is 3Q which number of FB and FF filter coefficients respectively. 𝑥(𝑘) and 𝑟(𝑘) with
is less when compared with other architectures. Also proposed design 𝑊 word length, 2′ s complementary form is:
needs K shift registers.
∑
𝑊 −1
𝑥(𝑘 − 𝑖) = 𝑥𝑖,𝑊 −1−𝑖 2−𝑖 − 𝑥𝑖,𝑊 −1 (15)
6. Application of proposed filter for DFE 𝑤=1
∑
𝑊 −1
In digital communications, transmitted signal prone to distortion 𝑟(𝑘 − 𝑗) = 𝑟𝑗,𝑊 −1−𝑗 2−𝑗 − 𝑟𝑗,𝑊 −1 (16)
due to additive noise in channel and multi-path propagation and makes 𝑤=1
the signal less reliable. The distortion can occur at any instant of time. On substituting Eq. (11) and Eq. (12) in Eq. (9) respectively we get :
To nullify these distortions in transmitted signals we go for adaptive
𝑁𝑓 −1 𝑁𝑏 −1
decision feed back equalizers. ∑ ∑
𝑆(𝑘) = 𝑥(𝑘 − 𝑖)ℎ𝑓 (𝑖) − ℎ𝑏 (𝑗)𝑟(𝑘 − 𝑗) (17)
𝑖=0 𝑗=0
6.1. Mathematical formulation of DFE
𝑁𝑓 −1 𝑁𝑏 −1
∑ ∑
= 𝑥(𝑘 − 𝑖)ℎ𝑓 (𝑖) + −ℎ𝑏 (𝑗)𝑟(𝑘 − 𝑗) (18)
Let us consider, adaptive DFE shown in Fig. 12, with 𝑥(𝑘) as input 𝑖=0 𝑗=0
signal, where 𝑘 ∈ 𝑍 with 𝑁𝑓 number of FF filter coefficients & output 𝑁𝑓 −1 𝑁𝑏 −1
decision 𝑟(𝑘) with 𝑁𝑏 number of FB filter coefficients. 𝑆𝑞𝑘 is the output ∑ ∑
= 𝑥(𝑘 − 𝑖)ℎ𝑓 (𝑖) + ℎ𝑏̄(𝑗)𝑟(𝑘 − 𝑗) (19)
decision for DFE: 𝑖=0 𝑗=0
̂ − 𝑟(𝑘)
𝑆(𝑘) = 𝑥(𝑘) ̂ (9) 𝑆(𝑘) = ℎ𝑇 𝑍 (21)
Where 𝐾 = 𝑁𝑓 + 𝑁𝑏 and
𝑟(𝑘) = 𝑆𝑞 (𝑘 − 1) (10) ̄ (𝑘), … .....ℎ𝑏,𝑁̄ (𝑘)]
ℎ𝑇 = [ℎ𝑓 ,0 (𝑘), … ...ℎ𝑓 ,𝑁𝑓 −1 (𝑘), ℎ𝑏,0 𝑏−1
9
K. Vijetha and Rajendra Naik B. Microprocessors and Microsystems 93 (2022) 104577
𝑒(𝑘) = 𝑠𝑞 (𝑘) − 𝑠(𝑘) (27) [1] Stenger Alexander, Lutz Trautmann, Rudolf Rabenstein, Nonlinear acoustic echo
cancellation with 2nd order adaptive Volterra filters, in: Acoustics, Speech, and
The negative value of ℎ𝑏𝑝 (𝑘 + 1) is Signal Processing, 1999. Proceedings. 1999 IEEE International Conference on.
vol. 2, IEEE, 1999.
ℎ̄𝑏𝑝 (𝑘 + 1) = 𝜇̄𝑏 𝑒(𝑘)𝑟(𝑘 − 𝑝) + ℎ̄𝑏𝑝 (𝑘) + (28) [2] Thakor Nitish V., Y.-S. Zhu, Applications of adaptive filtering to ECG analy-
sisnoise cancellation and arrhythmia detection, IEEE Trans. Biomed. Eng. 38.8
Multiplying Eq. (25) and Eq. (28) with 𝑏𝑚𝑙 and 𝑏𝑝𝑙 respectively and (1991) 785–794.
introducing summation we get [3] Ghamkhari Seyedeh Fatemeh, Mohammad Bagher Ghaznavi-Ghoushchi, A new
𝑁𝑓 −1 𝑁𝑓 −1 𝑁𝑓 −1 low-power architecture design for distributed arithmetic unit in FIR filters
∑ ∑ ∑ implementation, Circuits Systems Signal Process. 33.4 (2014) 1245–1259.
𝑏𝑚𝑙 ℎ𝑓 𝑚 (𝑘 + 1) = 𝑏𝑚𝑙 ℎ𝑓 𝑚 (𝑘) + 𝜇𝑓 𝑒(𝑘) 𝑏𝑚𝑙 𝑥(𝑘 − 𝑚) (29)
[4] NagaJyothi Grande, Sriadibhatla SriDevi, Distributed arithmetic architectures for
𝑚=0 𝑚=0 𝑚=0
fir filters-a comparative review, in: 2017 International Conference on Wireless
𝑁𝑏 −1 𝑁𝑝 −1
∑ ∑ Communications, Signal Processing and Networking, WiSPNET, IEEE, 2017.
𝑏𝑏𝑝 ℎ̄𝑏𝑝 (𝑘 + 1) = 𝑏𝑚𝑙 ℎ̄𝑝𝑙 (𝑘)+ [5] Mohanty Basant K., Pramod Kumar Meher, Sujit K. Patel, LUT optimization for
𝑝=0 𝑝=0 distributed arithmetic-based block least mean square adaptive filter, IEEE Trans.
(30)
𝑁𝑏 −1 Very Large Scale Integr. (VLSI) Syst. 24.5 (2016) 1926–1935.
∑
𝜇̄𝑏 𝑒(𝑘) 𝑏𝑝𝑙 𝑟(𝑘 − 𝑝) [6] Croisier Alain, et al., Digital filter for PCM encoded signals. U.S. Patent No. 3,
𝑝=0
777, 130. 4 Dec. 1973.
10
K. Vijetha and Rajendra Naik B. Microprocessors and Microsystems 93 (2022) 104577
[7] Peled Abraham, Bede Liu, A new approach to the realization of nonrecursive [24] Prakash M. Surya, Rafi Ahamed Shaik, Sagar Koorapati, An efficient distributed
digital filters, IEEE Trans. Audio Electroacoust. 21.6 (1973) 477–484. arithmetic-based realization of the decision feedback equalizer, Circuits Systems
[8] Zohar halhav, A realization of the RAM digital filter, IEEE Trans. Comput. 25.10 Signal Process. 35.2 (2016) 603–618.
(1976) 1048–1052. [25] Jiang Honglan, et al., A high-performance and energy-efficient FIR adaptive filter
[9] White Stanley A., Applications of distributed arithmetic to digital signal using approximate distributed arithmetic circuits, IEEE Trans. Circuits Syst. I.
processing: A tutorial review, IEEE Assp Mag. 6.3 (1989) 4–19. Regul. Pap. 66.1 (2018) 313–326.
[10] NagaJyothi Grande, Sriadibhatla Sridevi, High speed and low area decision feed- [26] Berberidis Kostas, Thanasis A. Rontogiannis, Sergios Theodoridis, Efficient block
back equalizer with novel memory less distributed arithmetic filter, Multimedia implementation of the decision feedback equalizer, IEEE Signal Process. Lett. 5.6
Tools Appl. (2019) 1–15. (1998) 129–131.
[11] Allred Daniel J., et al., LMS adaptive filters using distributed arithmetic for high [27] Parhi Keshab K., Design of multigigabit multiplexer-loop-based decision feedback
throughput, IEEE Trans. Circuits Syst. I. Regul. Pap. 52.7 (2005) 1327–1337. equalizers, IEEE Trans. Very Large Scale Integr. (VLSI) Syst. 13.4 (2005)
[12] Guo Rui, Linda S. DeBrunner, Two high-performance adaptive filter implemen- 489–493.
tation schemes using distributed arithmetic, IEEE Trans. Circuits Syst. II: Express [28] Lin Chi-Shiung, et al., Concurrent digital adaptive decision feedback equalizer
Brief. 58.9 (2011) 600–604. for 10GBase-LX4 ethernet system, in: 2007 IEEE Custom Integrated Circuits
[13] Jyothi Grande Naga, Sridevi Sriadibhatla, Asic implementation of low power, Conference, IEEE, 2007.
area efficient adaptive fir filter using pipelined da, in: Microelectronics, [29] Almogahed Abdullah, et al., Performance improvement of mode division multi-
Electromagnetics and Telecommunications, Springer, Singapore, 2019, pp. plexing free space optical communication system through various atmospheric
385–394. conditions with a decision feedback equalizer, Cogent Eng. 9.1 (2022) 2034268.
[14] Jyothi Grande Naga, Sriadibhatla Sridevi, Low power,low area adaptive finite [30] Han Ruigang, et al., Joint time-frequency domain equalization of MSK signal
impulse response filter based on memory less distributed arithmetic, J. Comput. over underwater acoustic channel, Appl. Acoust. 189 (2022) 108597.
Theor. Nanosci. 15.6-7 (2018) 2003–2008.
[15] Haridas Nisha, Elizabeth Elias, Efficient variable bandwidth filters for digital
hearing aid using farrow structure, J. Adv. Res. 7.2 (2016) 255–262.
[16] D.J. Allred, H. Yoo, V. Krishnan, W. Huang, D.V. Anderson, A novel high K Vijetha received her B.Tech and M.Tech degree in
performance distributed arithmetic adaptive filter implementation on an FPGA, Electronic and Communication Engineering from Jawaharlal
in: Proc. IEEE Int. Conf. Acoust. Speech, Signal Process, May, vol. 5, 2004, pp. Nehru Technology University, Andhra Pradesh, India, in
V–161–V–164. 2007 and 2010. She is working as assistant professor in
[17] D.J. Allred, H. Yoo, V. Krishnan, W. Huang, D.V. Anderson, LMS adaptive filters Electronics and Communication Engineering Department of
using distributed arithmetic for high throughput, IEEE Trans. Circuits Syst. I, Matrusri Engineering College , Hyderabad, Telangana, from
Reg. Pap. 52 (7) (2005) 1327–1337, Jul.. September 2012to till date . Currently, she is working
[18] R. Guo, L.S. DeBrunner, Two high-performance adaptive filter implementation- towards her Ph.D degree in VLSI System Design at Osmania
schemes using distributed arithmetic, IEEE Trans. Circuits Syst. II, Exp. Brief. 58 University, Hyderabad, India. Her research interest includes
(9) (2011) 600–604, Sep.. Low power IC design and VLSI DSP.
[19] R. Guo, L.S. DeBrunner, A novel adaptive filter implementation scheme using
distributed arithmetic, in: Conf. Rec. 45th ASILOMAR, Nov, 2011, pp. 160–164.
Prof. B. Rajendra Naik obtained his Bachelor’s degree
[20] Park Sang Yoon, Pramod Kumar Meher, Low-power, high-throughput, and low-
in Electronics and Communications Engineering from Na-
area adaptive FIR filter based on distributed arithmetic, IEEE Trans. Circuits Syst.
garjuna University, Master’s degree in Digital Systems and
II: Express Brief. 60.6 (2013) 346–350.
Doctorate degree from Osmania University. He has pub-
[21] Baghel Sudhanshu, Rafiahamed Shaik, FPGA implementation of fast block
lished over 60 research papers in National, International
LMS adaptive filter using distributed arithmetic for high throughput, in: 2011
Conferences and Journals. He has 18 years of teaching
International Conference on Communications and Signal Processing, IEEE, 2011.
and 12 years of research experience. Prof. B. Rajendra
[22] Mohanty Basant Kumar, Pramod Kumar Meher, A high-performance FIR filter
Naik joined in Osmania University in the year 2001. He is
architecture for fixed and reconfigurable applications, IEEE Trans. Very Large
Professor in Electronics and Communication Engineering,
Scale Integr. (VLSI) Syst. 24.2 (2015) 444–452.
University College of Engineering, Osmania University. His
[23] Tang Zhangwen, Jie Zhang, Hao Min, A high-speed programmable, CSD
research interests include VLSI signal processing, Signal
coefficient FIR filter, IEEE Trans. Consum. Electron. 48.4 (2002) 834–837.
integrity performance improvement and image processing.
11