0% found this document useful (0 votes)

10 views19 pages

Seminar Presentation

The document presents a machine learning framework for early estimation of power, performance, and area (PPA) at the RTL stage using HDL like Verilog, addressing inefficiencies in traditional VLSI design that require full synthesis. It introduces a Simple Operator Graph (SOG) to facilitate this estimation, employing various machine learning models for time, power, and area predictions. Results indicate high accuracy in PPA predictions, making it effective for early-stage RTL evaluation and suggesting future improvements for complex designs.

Uploaded by

Rahat Ahmed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views19 pages

Seminar Presentation

Uploaded by

Rahat Ahmed

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

Machine Learning Framework for Early

Power,
Performance, and Area Estimation of
AUTHORS: RTL
Vijay Kumar Sutrakar
Aeronautical Development
Establishment Defence Research and
Development Organisation
Bangalore, India
1 vks.ade@gov.in

Presented by,
Anindita Chattopadhyay
Md. Rahat Ahmed Khan
Dept. of Electronics and Communication
Student ID: 210929
BMS College of Engineering
Electronics & Communication
Bangalore, India
Engineering Discipline
anindita.lvs21@bmsce.ac.in
2 PRESENTATION OUTLINE

 Abstract
 Methodology
 Simple Operator Graph
 Time Modelling
 Power Modelling
 Area Modelling
 Dataset
 Result & Discussion
 Conclusion
 References
3 ABSTRACT
Focus
 Predict Power, Performance, Area (PPA) at RTL stage using HDLs like Verilog—before full
synthesis.

Problem in Traditional VLSI Design

 Evaluating PPA (Performance, Power, Area) requires full synthesis and layout of RTL design.
 This takes hours to days, using expensive EDA tools (like Synopsys, Cadence).
 Late-stage issues are hard or costly to fix.
 Not ideal for rapid design iteration or early feedback.

Proposed Solution
 Pre-synthesis ML framework using RTL + library files.
 Key: Simple Operator Graph (SOG) – bit-level graph mimicking post-synthesis design.
4 ABSTRACT

Results (147 RTL Designs)

 98% – Worst Negative Slack (WNS)
 98% – Total Negative Slack (TNS)
 90% – Power
 Outperforms prior models.
5 METHODOLOGY
HDL → RTL Representation (H → R):
 They convert the HDL into a Simple Operator Graph (SOG).
 SOG breaks down RTL into small building blocks like AND, OR, NOT etc., at the bit-
level.

Apply ML models:
Three machine learning models are used:
 Random Forest and XGBooost for time (performance),
 Graph Convolutional Networks (GCN) for power,
 Tree based model for area.
All models learn from the SOG version of the design.
 ·This gives early feedback, before synthesis.
6 SIMPLE OPERATOR GRAPH (SOG)
What is SOG?
 SOG (Simple Operator Graph) is a bit-level representation of RTL code where all operations
are broken down into five fundamental logic operations:
AND, OR, XOR, NOT, and 2-to-1 MUX Forms a graph of these operations → Simple Operator
Graph (SOG) [12-15]

 How is SOG Created?

RTL code → Yosys → Bit-level graph (SOG)[16]
 No full synthesis needed → Faster & simpler

 Why SOG Works Well

Closer to post-synthesis → better PPA estimation
Uniform input → works on many types of designs
No optimization steps needed → saves time
7 TIME MODELLING
What is Time Modelling?
 Time Modeling in VLSI refers to the process of estimating how long signals take to
propagate through a digital circuit

Two Key Metrics:

❖ WNS (Worst Negative Slack): Worst delay beyond clock deadline
If a circuit has a path with:
• Time Constraint = 1 ns
• Actual Delay = 1.5 ns
Then,
Slack = 1 − 1.5 = −0.5 ns ← This is a negative slack
If this is the worst slack in the design, it’s calledMWNS = −0.5 ns
❖ TNS (Total Negative Slack): Total accumulated delay violations
8 TIME MODELLING
Time Modelling Flow (SOG + ML)
1. Input: RTL → SOG (bit-level logic graph).
2. Analytical Delay: Assign delay to each logic node.
3. Critical Path Extraction: Trace paths with highest delay using source/sink
matching.
4. Delay Propagation: Compute cumulative delay per path.
5. Feature Extraction: Count operations and total delay per path.
6. Random Forest: Predict path-level delay.
7. XGBoost: Predict global WNS and TNS.
8. Output: Accurate timing insights at the RTL stage (WNS AD TNS)
9 POWER MODELLING

What is RTL Power Modelling?

 At the RTL (Register Transfer Level), power modeling means estimating how
much power the digital design will consume before going through full synthesis
or physical implementation.

Work Flow:
 Bit-level RTL representation (SOG)
 Switching activity of each node
 Graph Convolutional Networks (GCN) for learning power consumption directly
from the structure
10 POWER MODELLING

Switching Activity = Dynamic Power Source:

 Power is consumed when signals toggle (0 ↔ 1).
 More toggles = more power used.

Bit-Level Power Annotation in SOG

 Each node (AND, OR, XOR, etc.) in the SOG is:
• Annotated with its switching frequency.
 Enables precise bit-level power tracking.
11 POWER MODELLING

Power Prediction via GCN (Graph Convolutional Network):

 SOG is fed into a Graph Convolutional Network.
 GCN learn how gate types and their connections affect power.
 Accurately models power at RTL. [12,14,15]
12 AREA MODELLING
What is Area?
 The physical silicon space a circuit occupies after fabrication.
Types of Area:
 Sequential Logic Area (e.g., flip-flops)
 Combinational Logic Area (e.g., AND, OR, MUX, etc.)

Sequential Area Estimation (Simple + Direct):

 Count number of D flip-flops in the SOG.
 Get area per flip-flop from standard cell library (liberty file).
Formula:
 Sequential Area = Number of Flip-Flops × Area per Flip-Flop
 No machine learning required — just count and multiply.
13 AREA MODELLING

Combinational Area Estimation (ML-based):

 For each logic gate in the SOG (AND, OR, MUX...):

• Count occurrences
• Multiply with their individual area (from liberty file)

 Extract features like:

• Gate counts, types, SOG structure

 Use a tree-based ML model (e.g., Random Forest) to predict total combinational area.
 Total Area: Sequential Logic Area + Combinational Logic Area
147 Optimized Circuits:

14 RESULT AND DISCUSSION

Different benchmark circuits ISCAS’89 [18], ITC’99 [19],
OpenCores [20], VexRiscv [21], RISC-V [22], NVDLA [23],
Chipyard [24] are used to predict their PPA.

Metric R (Correlation) MAPE (Error)

WNS 0.98 12%
TNS 0.98 24%
Power 0.92 <48%
Area 0.99 12%
15 RESULT AND DISCUSSION
16 CONCLUSION

 The framework performs moderately well on unoptimized designs.

 On optimized designs, it achieves high accuracy for WNS, TNS, power, and
area.
 It's effective for early-stage RTL evaluation.
 Future work: improve further for more complex designs and scalability.
17 REFERENCES
 [12] W. Fang et al., "MasterRTL: A Pre-Synthesis PPA Estimation Framework for Any RTL
Design," 2023 IEEE/ACM International Conference on Computer Aided Design (ICCAD),
San Francisco, CA, USA, 2023, pp. 1-9
 [13] NanGate 45nm Open Cell Library, https://si2.org/open-cell-library/.
 [14] N. Wu, H. Yang, Y. Xie, P. Li, and C. Hao, “High-level synthesis performance prediction
using gnns: Benchmarking, modeling, and advancing,” in Proceedings of the 59th
ACM/IEEE Design Automation Conference (DAC), 2022, pp. 49–54.
 [15] E. Ustun, C. Deng, D. Pal, Z. Li, and Z. Zhang, “Accurate operation delay prediction for
fpga hls using graph neural networks,” in Proceedings of the 39th International
Conference on Computer-Aided Design (ICCAD), 2020, pp. 1–9.
[18] F. Brglez, D. Bryan, and K. Kozminski, “Combinational profiles of sequential benchmark
circuits,” in IEEE International Symposium on Circuits and Systems (ISCAS). IEEE, 1989, pp.
1929–1934.
 [19] F. Corno, M. S. Reorda, and G. Squillero, “Rt-level itc’99 benchmarks and first atpg
results,” Design & Test of computers (ITC), 2000.

18 REFERENCES

 [20] E. Ustun, C. Deng, D. Pal, Z. Li, and Z. Zhang, “Accurate operation delay
prediction for fpga hls using graph neural networks,” in Proceedings of the 39th
International Conference on Computer-Aided Design (ICCAD), 2020, pp. 1–9.
 [21] VexRiscv, “VexRiscv: A FPGA friendly 32 bit RISCV CPU implementation,”
2022. [Online]. Available: https: //github.com/SpinalHDL/VexRiscv.
 [22] “A 32-bit risc-v processor for mriscv project,” 2017. [Online]. Available:
https://github.com/onchipuis/mriscvcore.
 [23] Nvidia, “Nvidia deep learning accelerator,” 2018. [Online]. Available:
http://nvdla.org/primer.html
 [24] A. Amid, D. Biancolin, A. Gonzalez, D. Grubb, S. Karandikar, H. Liew, A.
Magyar, H. Mao, A. Ou, N. Pemberton et al., “Chipyard: Integrated design,
simulation, and implementation framework for custom socs,” IEEE Micro, vol. 40,
no. 4, pp. 10–21, 2020.
19

THANK YOU

Slide Speech
No ratings yet
Slide Speech
7 pages
Machine Learning Framework For Early Power, Performance, and Area Estimation of RTL
No ratings yet
Machine Learning Framework For Early Power, Performance, and Area Estimation of RTL
6 pages
Phase 2 Batch 17
No ratings yet
Phase 2 Batch 17
20 pages
Design Planning For Large SoC Implementation at 40nm - Part 1
No ratings yet
Design Planning For Large SoC Implementation at 40nm - Part 1
11 pages
Libraries Corners STA TimingPaths 07 23
No ratings yet
Libraries Corners STA TimingPaths 07 23
67 pages
Machine Learning in Chip Design Verification
No ratings yet
Machine Learning in Chip Design Verification
36 pages
(Lecture Notes in Computer Science 7606 Theoretical Computer Science and General Issues) Sven Rosinger, Wolfgang Nebel (Auth.), José L. Ayala, Delong Shang, Alex Yakovlev (Eds.)-Integrated Circuit And
No ratings yet
(Lecture Notes in Computer Science 7606 Theoretical Computer Science and General Issues) Sven Rosinger, Wolfgang Nebel (Auth.), José L. Ayala, Delong Shang, Alex Yakovlev (Eds.)-Integrated Circuit And
265 pages
Pre Synthesis and Post Synthesis Power Estimation of VLSI Circuits Using Machine Learning Approach
No ratings yet
Pre Synthesis and Post Synthesis Power Estimation of VLSI Circuits Using Machine Learning Approach
15 pages
RP6 (ML)
No ratings yet
RP6 (ML)
15 pages
Architectural Estimation Seminor
No ratings yet
Architectural Estimation Seminor
32 pages
Design and Optimization Techniques of High Speed VLSI Circuit
No ratings yet
Design and Optimization Techniques of High Speed VLSI Circuit
310 pages
Design and Optimization Techniques of High-Speed VLSI Circuits
No ratings yet
Design and Optimization Techniques of High-Speed VLSI Circuits
310 pages
High-Level Power Analysis and Optimization
No ratings yet
High-Level Power Analysis and Optimization
185 pages
Standard Cell Characterization
0% (1)
Standard Cell Characterization
11 pages
Accurate Power-Analysis Techniques Support Smart SOC-design Choices
No ratings yet
Accurate Power-Analysis Techniques Support Smart SOC-design Choices
4 pages
30VLSI System Level
No ratings yet
30VLSI System Level
49 pages
Design of Low Power TPG Using LP-LFSR
No ratings yet
Design of Low Power TPG Using LP-LFSR
5 pages
ML-based AIG Timing Prediction To Enhance Logic Optimization
No ratings yet
ML-based AIG Timing Prediction To Enhance Logic Optimization
6 pages
Hoque Muzammal Thesis
No ratings yet
Hoque Muzammal Thesis
98 pages
Low Power Syntheis
100% (3)
Low Power Syntheis
18 pages
Dfick 1
No ratings yet
Dfick 1
147 pages
Vermesan O. Industrial Artificial Intelligence Tech and App 2022
No ratings yet
Vermesan O. Industrial Artificial Intelligence Tech and App 2022
244 pages
AI SoC Battery
No ratings yet
AI SoC Battery
186 pages
Layout Theiss
No ratings yet
Layout Theiss
93 pages
978 3 642 17752 1
No ratings yet
978 3 642 17752 1
270 pages
Xie - 2022 - Intelligent Circuit Design and Implementation With Machine Learning
No ratings yet
Xie - 2022 - Intelligent Circuit Design and Implementation With Machine Learning
179 pages
19MECV15 Report
No ratings yet
19MECV15 Report
36 pages
TC Lecture1
No ratings yet
TC Lecture1
45 pages
AI
No ratings yet
AI
242 pages
Practical Solutions To Accelerating ASIC Design Development Using Machine Learning
No ratings yet
Practical Solutions To Accelerating ASIC Design Development Using Machine Learning
142 pages
Scalable System-on-Chip Design Dissertation
No ratings yet
Scalable System-on-Chip Design Dissertation
230 pages
L26 Estimation
No ratings yet
L26 Estimation
42 pages
Low Power
No ratings yet
Low Power
170 pages
Unit 2 Lpvlsi
No ratings yet
Unit 2 Lpvlsi
29 pages
A Survey On Machine Learning Applications in Vlsi Cad
No ratings yet
A Survey On Machine Learning Applications in Vlsi Cad
9 pages
Lecture 11 Low Power Circuits
No ratings yet
Lecture 11 Low Power Circuits
67 pages
Machine Learning Models For Hotspot Detection
No ratings yet
Machine Learning Models For Hotspot Detection
6 pages
13 SoCVerification MB TVS.V
No ratings yet
13 SoCVerification MB TVS.V
28 pages
Goswami and Bhatia - 2023 - Application of Machine Learning in FPGA EDA Tool Development
No ratings yet
Goswami and Bhatia - 2023 - Application of Machine Learning in FPGA EDA Tool Development
17 pages
2019 Dac Tutorial Nvidia Part
No ratings yet
2019 Dac Tutorial Nvidia Part
55 pages
Optimising A D Flip Flop Through Delay and Power Estimation Using An RC Model and Transistor Sizing
No ratings yet
Optimising A D Flip Flop Through Delay and Power Estimation Using An RC Model and Transistor Sizing
7 pages
RTL Design Approach
No ratings yet
RTL Design Approach
15 pages
Cell Comparsion
No ratings yet
Cell Comparsion
18 pages
LPV 04
No ratings yet
LPV 04
110 pages
Approaches For Power Management Verification of SoC Having Dynamic Power and Voltage Switching
No ratings yet
Approaches For Power Management Verification of SoC Having Dynamic Power and Voltage Switching
30 pages
RTL Design Techniques To Reduce The Power Consumption of FPGA Based Circuits
No ratings yet
RTL Design Techniques To Reduce The Power Consumption of FPGA Based Circuits
6 pages
Aging Analysis of Digital Integrated Circuits
No ratings yet
Aging Analysis of Digital Integrated Circuits
150 pages
PARISI Thesis
No ratings yet
PARISI Thesis
82 pages
cs224r Practical Deep RL
No ratings yet
cs224r Practical Deep RL
77 pages
Chen Et Al. - 2023 - Machine Learning in Advanced IC Design A Methodological Survey
No ratings yet
Chen Et Al. - 2023 - Machine Learning in Advanced IC Design A Methodological Survey
17 pages
Rupam's Master Thesis Presentation
No ratings yet
Rupam's Master Thesis Presentation
34 pages
qt9sf895pj Nosplash
No ratings yet
qt9sf895pj Nosplash
80 pages
Eytu Lecture2-3
No ratings yet
Eytu Lecture2-3
114 pages
Sta Syllabi
No ratings yet
Sta Syllabi
6 pages
CMOS VLSI Unit 2
No ratings yet
CMOS VLSI Unit 2
51 pages
Register-Transfer Level - Wikipedia
No ratings yet
Register-Transfer Level - Wikipedia
6 pages
Construction of A Low-Voltage Standard Cell Library For Ultra-Low Power Application
No ratings yet
Construction of A Low-Voltage Standard Cell Library For Ultra-Low Power Application
64 pages
Quiz 1
No ratings yet
Quiz 1
2 pages
4 Differential Amplifier With Output
No ratings yet
4 Differential Amplifier With Output
5 pages
Boot Menu: Operating System Post
No ratings yet
Boot Menu: Operating System Post
4 pages
Ramnarayan Book
No ratings yet
Ramnarayan Book
460 pages
MV-1FX Stereo Mod - 5023dbf2ba2b5
No ratings yet
MV-1FX Stereo Mod - 5023dbf2ba2b5
8 pages
FD90AID B
No ratings yet
FD90AID B
22 pages
Introduction To Proteus PDF
No ratings yet
Introduction To Proteus PDF
6 pages
AMBA APB Datasheet
No ratings yet
AMBA APB Datasheet
5 pages
Pioneer DDJ-SX2 Service Manual
80% (5)
Pioneer DDJ-SX2 Service Manual
83 pages
DSP Arithmetic: Ece 450:digital Signal Processors and Applications Processors and Applications
No ratings yet
DSP Arithmetic: Ece 450:digital Signal Processors and Applications Processors and Applications
23 pages
Manual
No ratings yet
Manual
14 pages
VHDL Basics for ECE Students
No ratings yet
VHDL Basics for ECE Students
44 pages
AC Power and The Power Triangle
No ratings yet
AC Power and The Power Triangle
7 pages
Unijunction Transistor: Experiment No.4
No ratings yet
Unijunction Transistor: Experiment No.4
9 pages
AD Conversion (PCM)
No ratings yet
AD Conversion (PCM)
10 pages
Surat Lamaran Kerja Bahasa Inggris
No ratings yet
Surat Lamaran Kerja Bahasa Inggris
6 pages
DP Sounds Realtek 15042 Drivers
No ratings yet
DP Sounds Realtek 15042 Drivers
1,747 pages
MD Rakibul Islam Rakib - LabReport4
No ratings yet
MD Rakibul Islam Rakib - LabReport4
5 pages
Matlab Simulink of COST231-WI Model IJWMT-V2-N3-1
No ratings yet
Matlab Simulink of COST231-WI Model IJWMT-V2-N3-1
8 pages
Adaptdor de Señal Encoder ELTRA EMB
No ratings yet
Adaptdor de Señal Encoder ELTRA EMB
2 pages
Klemsan - KLEA-320P (Energy Analyzer)
No ratings yet
Klemsan - KLEA-320P (Energy Analyzer)
8 pages
MIPS Pipeline in Detail
50% (2)
MIPS Pipeline in Detail
3 pages
Panasonic WP1400 PDF
No ratings yet
Panasonic WP1400 PDF
69 pages
Chapter 2 342 Transformer 4
No ratings yet
Chapter 2 342 Transformer 4
74 pages
De10 Lite User Manual
No ratings yet
De10 Lite User Manual
74 pages
Oscillators For First Year
No ratings yet
Oscillators For First Year
21 pages
Workshop Notes
No ratings yet
Workshop Notes
5 pages
William Stallings Computer Organization and Architecture 8 Edition
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition
34 pages
Transmission Lines Exam Paper
No ratings yet
Transmission Lines Exam Paper
4 pages
DT 880 Headphones Specs
No ratings yet
DT 880 Headphones Specs
1 page

Seminar Presentation

Uploaded by

Seminar Presentation

Uploaded by

Machine Learning Framework for Early

Problem in Traditional VLSI Design

Results (147 RTL Designs)

 How is SOG Created?

 Why SOG Works Well

Two Key Metrics:

What is RTL Power Modelling?

Switching Activity = Dynamic Power Source:

Bit-Level Power Annotation in SOG

Power Prediction via GCN (Graph Convolutional Network):

Sequential Area Estimation (Simple + Direct):

Combinational Area Estimation (ML-based):

 For each logic gate in the SOG (AND, OR, MUX...):

 Extract features like:

14 RESULT AND DISCUSSION

Metric R (Correlation) MAPE (Error)

 The framework performs moderately well on unoptimized designs.

You might also like