0% found this document useful (0 votes)
42 views9 pages

Efficient Reduction of Area and Delay in Null Convention Logic Design Paradigm

The document describes an original algorithm for mapping logic expressions to NULL Convention Logic (NCL) gates. The algorithm begins with a multi-rail sum-of-products (SOP) expression and groups product terms. It selects the largest groups to minimize gates, but this can result in non-optimal grouping. The document then proposes a new methodology to improve the original algorithm and reduce area and delay of the NCL circuit design.

Uploaded by

Pushpa Mohan Raj
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
42 views9 pages

Efficient Reduction of Area and Delay in Null Convention Logic Design Paradigm

The document describes an original algorithm for mapping logic expressions to NULL Convention Logic (NCL) gates. The algorithm begins with a multi-rail sum-of-products (SOP) expression and groups product terms. It selects the largest groups to minimize gates, but this can result in non-optimal grouping. The document then proposes a new methodology to improve the original algorithm and reduce area and delay of the NCL circuit design.

Uploaded by

Pushpa Mohan Raj
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 9

Page 1

www.ijird.com
February, 2014
Vol 3 Issue 2
INTERNATIONAL JOURNAL OF INNOVATIVE RESEARCH & DEVELOPMENT
Page 406
Efficient Reduction of Area and Delay in
Null Convention Logic Design Paradigm
1. Introduction
With the fabrication process of integrated circuits developing in the deep sub micron, more function modules
can be integrated into a
small chip with higher clock rates. Even though higher clock rates increase logic speed, they also increase power
consumption and
cause a challenging clock distribution issue. Due to the difficult clock management of the whole system chip,
synchronous circuit
design faces many challenges such as clock skew, clock jitter, and power consumption [1], [2]. Therefore,
asynchronous clockless
circuit designs have been proposed as a solution to these clock issues. Asynchronous circuits have merits of low
power, anti-
interference, high robustness, and module reusability due to its clockless nature [3]. Thus, the asynchronous
circuits are very suitable
for systems-onchip (SOC) designs which are sensitive to power consumption and performance. NULL
Convention Logic (NCL) is
one of the promising candidates for asynchronous circuit design paradigms.
NCL circuit will work correctly regardless of when circuit inputs become available [4]; therefore, NCL circuits
are said to be correct-
by-construction. In other words, timing analysis is not required for correct operation. NCL circuits utilize the
dual-rail or quad-rail
logic to achieve delay-insensitivity. A dual-rail signal, D is comprised of two mutually exclusive wires, D0 and
D1, which might
assume any value from the set {Data0; Data1; Null; Invalid}, as shown in Table I. The Data0 state corresponds
to Boolean logic 0, the
Data1 state corresponds to Boolean logic 1, and the Null state corresponds to the empty set, meaning that the
value of D is not yet
available. NCL circuits are comprised of 27 fundamental gates [5], which constitute the set of all the function of
four or a few
variables [5]. Since each rail of an NCL signal is considered as a separate variable or value, a four variable
function is not the same as
a function of four literals, which would normally consist of eight variables or values, assuming dual-rail signals
[6].
D
D0
D1
Data0
1
0
Data1
0
1
Null
0
0
Invalid
1
1
Table I: Dual Rail Encoding
ISSN 2278 0211 (Online)
J. Nerenjana
PG Scholar, I.F.E.T College of Engineering, Villupuram, Tamil Nadu, India
M. Ananthakumar
Assistant Professor, I.F.E.T College of Engineering, Villupuram, Tamil Nadu, India
R. Malar
Associate Professor, I.F.E.T College of Engineering, Villupuram, Tamil Nadu, India
Y. Emilet Kethsiyal
PG Scholar, I.F.E.T College of Engineering, Villupuram, Tamil Nadu, India
Abstract:
Synchronous circuit designs face many challenges such as clock skew, clock jitter and power consumption.
Asynchronous
clockless circuit design is a solution to these clock issues. Asynchronous circuits have merits of low power, anti-
interference, high
robustness, and module reusability due to its clockless nature. NULL Convention Logic (NCL) is one of the
promising candidates
for asynchronous circuit design paradigms. NCL circuits are said to be correct-by-construction and delay
insensitive circuits. In
this paper, a new methodology for mapping multi-rail logic expressions to NULL convention logic (NCL) gate
library is proposed
to reduce area and delay of NCL circuits when compared to the existing algorithm.
Key words: synchronous, asynchronous, NCL circuits, mapping

Page 2
www.ijird.com
February, 2014
Vol 3 Issue 2
INTERNATIONAL JOURNAL OF INNOVATIVE RESEARCH & DEVELOPMENT
Page 407
Figure 1: THmn Threshold Gate
NCL uses threshold gates with hysteresis for its basic elements [5]. The primitive type of threshold gate, shown
in Fig. 1, is the THmn
gate, where 1<= m <= n. Each THmn gate has n inputs, where at least m of n inputs must be asserted before the
output will become
asserted. In a THmn gate, each of the n inputs is connected to the rounded portion of the gates, and the gates
threshold value, m, is
written inside of the gate [8]. Another type of threshold gate is referred to as a weighted threshold gate,
expressed as
THmnWw1w2:::wR, where n is the number of inputs, m is the gates threshold, and w1w2:::wR are the weights
of input1, input2,...,
inputR, respectively. For example, TH34 has 4 inputs and threshold of 3. When any three inputs go high, its
output will be asserted to
high. Only when all inputs are low, the output will be reset to low. For all other input patterns, the output will
remain unchanged, and
they keep the previous state. A weighted gate TH34W2 has 4 inputs are labeled A, B, C, and D, shown in Fig. 2.
The weight of input
A, W(A), is, therefore, 2. Since the gates threshold, m, is 3, this implies that in order for the output to be
asserted, either inputs B, C,
and D must all be asserted, or input A must be asserted along with any other input, B, C, or D.
Figure 2: Th34w2 Threshold Gate: Z = AB+AC+AD+BCD
2. Original Mapping Algorithm
After several synthesis and optimization steps in [9], each combinational module in the initial synchronous RTL
design is finally
expressed as dual-rail SOP expressions describing its outputs in terms of its inputs. Then, each SOP expression
is separately
mapped to NCL gates using the grouping algorithm in [9], as shown in Fig. 3. The algorithm starts with a multi-
rail SOP expression as
input and then groups the product terms. Taking F1 = A0B1C1 + A0B1D1 + A1B1 + B1C0 + B1D0 + A1C1 as
an example, the first
step in the original grouping algorithm is to remove all four-variable terms from the initial SOP expression and
moved to the set of
final groups, because according to the NCL gate library, these terms cannot be grouped with any other terms to
make larger groups
and are always realized using TH44 gates. Since F1 has no four-variable term, no action is needed. Next, the
distinct variables in the
SOP expression are counted; if the number is less than 4, previously grouped terms can be potentially combined
with the SOP
expression to make 4-feasible group.
Table 2: Library of 27 Standard NCL Gates

Page 3
www.ijird.com
February, 2014
Vol 3 Issue 2
INTERNATIONAL JOURNAL OF INNOVATIVE RESEARCH & DEVELOPMENT
Page 408
F1 contains 7 distinct variables, so the algorithm proceeds to the next decision box, where it is determined
whether the SOP
expression is empty or not. The next step is to create a k-feasibility table (Table III). Under each parent term in
the k-feasibility table,
the other terms that can be combined with the parent term such that their combination includes four or fewer
distinct variables are
listed. Now, starting from the larger parent terms and proceeding to the smaller ones, one parent term is picked
at a time, and all the
possible k-feasible groups are found by combining the parent term with the other terms listed under it. Note that
in the original
grouping method, when the terms are combined the largest possible groups are formed, and the temporary small
groups are discarded.
In order to reduce runtime, largest group is selected, and the rest are removed. Also, the selected group is added
to a search list to
avoid finding the same group for the other parent terms. It will be shown that this approach of selecting the
largest group can
potentially result in non-optimal grouping; the new grouping algorithm addresses this limitation.
The original grouping algorithm then proceeds with choosing the largest non-redundant groups from the set of
largest groups found
under each parent term, and moving them to the set of final groups. A group is considered non-redundant if it
does not share a
common term with any other group. Shared terms are not allowed in the set of final groups since they introduce
gate orphans and
therefore impact the delay insensitivity of NCL circuits. This issue arises because, when several gates share a
common term, there is a
possibility that all their outputs transition during the same DATA phase, but in that case only the fastest
transition is acknowledged by
the output and the other slower transitions are not acknowledged. In the original grouping method, non-
redundant groups are found as
follows: starting from the first group in the list, a group is selected and compared with the other groups; if it
shares a common term
with any other group, then the larger group is selected and the smaller group is discarded. This procedure is
continued until all the
smaller redundant groups are removed. The remaining groups, which are large and non-redundant, are moved to
the set of final
groups. The outputs of the current set of final groups are added as single-variable terms to the current SOP
expression at this step. In
the previous example, two groups already exist in the set of final groups. We will call the output of the first
group T1 and the output of
the second group T2. After adding T1 and T2 to the SOP expression, the new expression becomes B1D0+ T1 +
T2. At the end of the
third iteration, B1D0+ T1 + T2 is selected as a final group and is removed from the SOP expression.
A0B1C1
A0B1D1
A1B1
B1C0
B1D0
A1C1
A0B1D1
A0B1C1
B1C0
A1B1
A1B1
A1B1
A1B1
A1B1
B1D0
B1D0
B1C0
B1C0
B1C0
B1C0
A1C1
A1C1
A1C1
B1D0
B1D0
B1D0
A1C1
Table 3: K-Feasibility Table
A0B1C1
A0B1D1
A1B1
A0B1C1+A0B1D1
A0B1D1+ A1B1
A1B1+B1C0+ B1D0
A0B1C1+A1B1+ A1C1
A0B1D1+ B1C0
A1B1+B1C0+ A1C1
A0B1C1+B1C0
A0B1D1+ B1D0
A1B1+ B1D0+ A1C1
A0B1C1+B1D0
Table 4: K-Feasible Groups

Page 4
www.ijird.com
February, 2014
Vol 3 Issue 2
INTERNATIONAL JOURNAL OF INNOVATIVE RESEARCH & DEVELOPMENT
Page 409
Figure 3: Original Mapping Algorithm
Table 5: Final K-Feasible Groups after First Iteration
A0B1C1
A0B1D1
A1B1
A0B1C1+A1B1+
A1C1
A0B1D1+
A1B1
A1B1+B1C0+
B1D0

Page 5
www.ijird.com
February, 2014
Vol 3 Issue 2
INTERNATIONAL JOURNAL OF INNOVATIVE RESEARCH & DEVELOPMENT
Page 410
Figure 4: NCL Implementation
Table 5: Final Groups
3. Proposed Mapping Algorithm
Original mapping algorithm targets only area reduction. And it requires several iterations to complete. It is
referred as Multi-pass
grouping and inefficient grouping. It is computationally expensive and could be alleviated by engaging in less
computation in the main
loop. The original grouping method is also not efficient in terms of finding the largest non-redundant groups.
Finally, the approach of
adding the final groups as single variable terms to the initial SOP expression to make larger groups is not always
helpful. In fact, this
approach can potentially result in a larger circuit or increase the logic levels unnecessarily and consequently
make the final circuit
slower. The main idea of the new grouping algorithm is to sort the list of NCL gates based on a cost function
prior to grouping. At the
time of grouping, the gates occupying a higher position in the sorted list are considered more important and will
have a higher priority
in grouping. Targets both area and delay.
In summary, the algorithm begins with moving all the four variable terms to the set of final groups, just as in the
original grouping
method. Next, all k-feasible groups are found by using a k-feasibility table and combining all parent terms with
the other terms, as
discussed in the original grouping method. The k-feasible groups are then arranged into priority levels according
to Table. Next,
starting from the highest nonempty priority level, the non-redundant groups in each priority level are found,
their terms are removed
from all the other groups, and the priority levels are updated accordingly. This procedure continues until all
priority levels become
empty. For this example, the original grouping method and the proposed one produce the same grouping result,
but this is not always
the case as explained in the next section.
The proposed grouping method always preserves delay insensitivity of the initial dual-rail function. In order to
prove it, one must
prove that the proposed grouping method preserves input-completeness and observability. Several advantages of
the proposed
grouping algorithm, such as the ability to use different cost functions, the ability to map to a subset of NCL
gates, and one-pass
grouping, were pointed out. With the proposed grouping method, the non-redundant groups under a priority
level are selected
efficiently.
FINAL GROUPS
T1 = A0B1C1+A1B1+ A1C1
T2 = A0B1D1+ B1C0
F = B1D0+T1+T2

Page 6
www.ijird.com
February, 2014
Vol 3 Issue 2
INTERNATIONAL JOURNAL OF INNOVATIVE RESEARCH & DEVELOPMENT
Page 411
Figure 5: Proposed Mapping Algorithm

Page 7
www.ijird.com
February, 2014
Vol 3 Issue 2
INTERNATIONAL JOURNAL OF INNOVATIVE RESEARCH & DEVELOPMENT
Page 412
Table 6: Priority Table
Table 7: K-Feasible Groups
A0B1C1
A0B1D1
A1B1
B1C0
B1D0
A1C1
A0B1C1+ A0B1D1
A0B1D1+
A1B1
A1B1+B1C0+ B1D0
B1C0+ B1D0
B1D0
A1C1
A0B1C1+A1B1+A1C1
A0B1D1+
B1C0
A1B1+B1C0+ A1C1
B1C0
A0B1C1+B1C0
A0B1D1+
B1D0
A1B1+B1D0+ A1C1
A0B1C1+B1D0
A0B1D1
A1B1+B1C0
A0B1C1
A1B1+ B1D0
A1B1+A1C1
A1B1

Page 8
www.ijird.com
February, 2014
Vol 3 Issue 2
INTERNATIONAL JOURNAL OF INNOVATIVE RESEARCH & DEVELOPMENT
Page 413
Table 8: K-Feasible Groups with Priority Levels
Table 9: Updating Priority Levels
Figure 6: NCL Implementation
Table 10: Final Groups
8 (Highest
priority)
10
11
12
14
16
19
A0B1C1+A1B1+
A1C1
A1B1+B1C0+
B1D0
A0B1C1+
A0B1D1
A0B1C1+B1C0
A1B1+B1C0
A0B1C1
A1B1
A1B1+B1C0+
A1C1
A0B1C1+B1D0
A1B1+B1D0
A0B1D1
B1C0
A1B1+ B1D0+
A1C1
A0B1D1+ A1B1
A1B1+A1C1
B1D0
A0B1D1+ B1C0
B1C0+B1D0
A1C1
A0B1D1+ B1D0
12
14
16
19
A0B1D1+ B1C0
B1C0+B1D0
A0B1D1
B1C0
A0B1D1+ B1D0
B1D0
FINAL GROUPS
T1 = A0B1C1+A1B1+ A1C1
T2 = A0B1D1+ B1C0
F = B1D0+T1+T2

Page 9
www.ijird.com
February, 2014
Vol 3 Issue 2
INTERNATIONAL JOURNAL OF INNOVATIVE RESEARCH & DEVELOPMENT
Page 414
3. Experimental Results
Related Groups
Area
Delay Run time
1
F = A0B1 + A0C0 + A0D1 + B1C0 + A0D0 + B0C1D1
Original
X = A0B1 + A0C0 + A0D1 + B1C0 (TH44w322)
Y = B0C1D1 (TH33)
F = A0D0 + X + Y (TH24w22)
271
660
13.5
Proposed X = A0B1 + A0C0 + A0D0 + B1C0 (TH44w322)
Y = A0D1 + B0C1D1 (TH54w32)
F = X + Y (TH12)
254
434
13.8
2
F = B1C0D0 + C1D0 + A0B1 + A0C1 + A0D1 + B0C1
Original
X = A0B1 + A0C1 + C1D0 (THand0)
Y = A0D1 + B0C1 (THxor0)
Z = B1C0D0 + X (TH34w3)
F = Y + Z (TH12)
350
734
13.9
Proposed X = A0B1 + A0C1 + A0D1 (TH44w3)
Y = B1C0D0 + C1D0(TH54w32)
Z = B0C1 + X + Y (TH24w22)
293
704
29.7
3
F = A0B1C0 + A0 B0C1 + A1B1C0 + A1B0C1 + B1C0D1
Original
X = A0B1C0 + A1B1C0 (TH54w22)
Y = A0B0C1 + A1B0C1 (TH54w22)
Z = B1C0D1 + X (TH34w3)
F = Y + Z (TH12)
368
739
5.6
Proposed X = A0B1C0 + A1B1C0 (TH54w22)
Y = A0B0C1 + A1B0C1 (TH54w22)
Z = B1C0D1 (TH33)
F = X + Y + Z (TH13)
352
447
11.4
Table 11: Comparison of Original and Proposed Grouping Methods
After processing a sum of product (SOP) expression using the original and proposed mapping algorithm, final
groups are obtained as
discussed earlier. They are implemented using Modelsim 6.4 c and Xilinx ISE 13.2 software. Codes are written
in verilog HDL
language as they are user-friendly.
Figure 7: Proposed Method 1
Figure 8: Proposed Method 2

Page 10
www.ijird.com
February, 2014
Vol 3 Issue 2
INTERNATIONAL JOURNAL OF INNOVATIVE RESEARCH & DEVELOPMENT
Page 415
Figure 9: Proposed Method 3
4. Conclusion
A New mapping algorithm to reduce both area and delay of NULL convention logic circuits is proposed in this
paper, while original
mapping algorithm takes only area as a cost function. Constraints of original mapping algorithm are met by the
proposed algorithm. In
contrast to the original mapping algorithm, the new mapping algorithm can map a multi-rail expression to a
restricted subset of NCL
gates and can target any cost function.
5. References
1. E. Yahya and L. Fesquet,Asynchronous Design: A Promising Paradigm for Electronic Circuits and
Systems, The 16th
IEEE International Conference on Electronics, Circuits and Systems, p.339 (2009).
2. A.J. Martin and M. Nystrom,Asynchronous Techniques for System-on-Chip Design, Proceedings of the
IEEE, 6, p.1089
(2006).
3. A. Kondratyev, L. Sorensen, and A. Streich, Testing of Asynchronous Designs by Inappropriate Means:
Synchronous
Approach, Proc. IEEE Symp. Asynchronous Circuits and Systems (ASYNC 02), 2002, pp. 171-180, doi:
10.1109/ASYNC.2002.1000307.
4. Scott Smith, Jia Di.,Designing Asynchronous Circuits using NULL Convention Logic, Morgan &
Claypool, Aug 15, 2009,
ISBN: 9781598299816.
5. Gerald E. Sobelman and Karl M. Fant,CMOS Circuit Design of Threshold Gates with Hysteresis, IEEE
International
Symposium on Circuits and Systems (II), 1998, pp. 61-65.
6. Sankar, R.; Kadiyala, V.; Bonam, R.; Kumar, S.; Mohan, S.; Kacani, F.; Al-Assadi, W.K.; Smith,
S.C.,Implementation of
Static and Semi- Static Versions of a Bit-Wise Pipelined Dual-Rail NCL 2S Complement Multiplier, Region
5Technical
Conference, 2007 IEEE , vol., no., pp.228,233, 20-22 April 2007 doi: 10.1109/TPSD.2007.4380386.
7. Ho Joon Lee and Yong-Bin Kim,Low Power Null Convention Logic Circuit Design Based on DCVSL,
Department of
Electrical and Computer Engineering.
8. Yancey, S.; Smith, S.C.,A differential design for C-elements and NCL gates, Circuits and Systems
(MWSCAS), 2010 53rd
IEEE International Midwest Symposium on, vol., no., pp.632, 635, 1-4 Aug. 2010 doi:
10.1109/MWSCAS.2010.5548905.
9. B. Bhaskaran, Automated synthesis and cycle reduction optimization for asynchronous NULL convention
circuits using
industry-standard CAD tools, Ph.D. dissertation, Dept. Comp. Eng., Univ. Missouri - Rolla, Rolla, 2007.
10. I. David, R. Ginosar, and M. Yoeli, An efficient implementation of Boolean functions as self-timed
circuits, IEEE
Trans. Comput., vol. 41, no. 1, pp. 211, Jan. 1992.
11. S. K. Bandapati, S. C. Smith, and M. Choi, Design and characterization of convention self-timed
multipliers, IEEE Des.
Test Comput., vol. 20, no. 6, pp. 2636, Nov.Dec. 2003.
12. R. Reese, S. C. Smith, and M. A. Thornton, Unclean RTL approach to asynchronous design, in Proc.
18th IEEE Int.
Symp. Adv. Res. Asynchron. Circuits Syst., May 2012, pp. 6572

You might also like