0% found this document useful (0 votes)

461 views114 pages

Machine Learning Applications in Physical Design: Recent Results and Directions

The document discusses machine learning applications in physical design. It describes how machine learning can be used for modeling and prediction tasks to improve physical design outcomes and reduce iterations. Specifically, it provides two examples: 1) Using machine learning to predict design rule checking violations to avoid unnecessary design iterations. 2) Developing a machine learning model to predict the impact of local clock tree synthesis moves to reduce optimization runtime. The document advocates that machine learning can help address challenges in physical design related to unpredictability, margins, iterations, and turnaround time.

Uploaded by

Nitin G

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

461 views114 pages

Machine Learning Applications in Physical Design: Recent Results and Directions

Uploaded by

Nitin G

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 114

Machine Learning Applications in

Physical Design: Recent Results

and Directions

Andrew B. Kahng
CSE and ECE Departments
UC San Diego

http://vlsicad.ucsd.edu

A. B. Kahng, 180327 ISPD--2018

Agenda
• Crises…

A. B. Kahng, 180327 ISPD--2018 2

IC Industry Crises: Cost, Quality of Design
• Can’t afford to design chips (tools, people, time, risk)
• Return on investment for new technology is poor
• $$M to move to new node (28nm  14nm 10nm  7nm  …)
• Benefit from new node: ~20% power, speed, area (less, today)
• Design Capability Gap
• Available density
grows at 2x/node
• Realizable density
grows at 1.6x/node
• UCSD / 2013 ITRS

A. B. Kahng, 180327 ISPD--2018 3

IC Design Crises: Unpredictability, Schedule
• Many steps in long “design flow”  can we predict outcome?
• Many chicken-egg loops  convergence point? how to initialize?
• Nearly all problems are NP-hard
• Min-cut hypergraph bisection, Quadratic assignment,
Multicommodity flow, Max-weight independent set, Multi-
vehicle TSP, k-colorability, …
• Huge “n” metaheuristics piled on metaheuristics
• Suboptimality is expensive
• 10% of {power, speed, area} is half of benefit from new node
• Iteration is expensive
• Moore’s Law: 1 week = 1 percent
• Conservatism (“margin”) is expensive
• But: “oops” (didn’t fit, didn’t route, too slow) is unacceptable
A. B. Kahng, 180327 ISPD--2018 4
Unpredictability of Design
• Intractable optimizations  heuristics piled on heuristics
• “Noise” or “Chaos” when EDA tools “try hard”
• Unpredictability  added margin and schedule
14nm PULPino: area = 6% from freq = 10MHz !

Challenges: Schedule, Quality, Cost

A. B. Kahng, 180327 ISPD--2018 5
“The Last Semiconductor Scaling Levers”
• Quality
• Improved design tools and methods
• Reduced margins
• Schedule
• 1 week = 1%
• Cost
• IC design is expensive (engineers, tools, spins, …)

A. B. Kahng, 180327 ISPD--2018 6

Agenda
• Crises…
• … and a Vision

A. B. Kahng, 180327 ISPD--2018 7

Unpredictability of Design
• Intractable optimizations  heuristics piled on heuristics
• “Noise” or “Chaos” when EDA tools “try hard”
• Unpredictability  added margin and schedule
14nm PULPino: area = 6% from freq = 10MHz !

A. B. Kahng, 180327 ISPD--2018 8

Today’s SOC Design

# Partitions  Design Flexibility

Predictability Margins

# Iterations
Achieved
Design Quality
Turnaround Time

A. B. Kahng, 180327 ISPD--2018 9

Vision for Future SOC Design

Design Flexibility
# Partitions 

!
Predictability Margins
!
# Iterations
Achieved
Single-pass
! Design Quality
Turnaround Time

Mindsets
• Tools should not return unexpected results  Quality
• Achieve predictability from the user’s POV
 Schedule
• Use cloud/parallel to recover solution quality
• Focus on reducing design time, design effort
 Cost
Machine Learning will be a key piece of this …
A. B. Kahng, 180327 ISPD--2018 10
Agenda
• Crises…
• … and a Vision
• Machine Learning in PD

A. B. Kahng, 180327 ISPD--2018 11

Machine Learning in Physical Design
Problem types solved with Machine Learning
• Classification
• Regression
• Dimensionality reduction
• Structured prediction
• Anomaly detection
Past ML applications in EDA literature
• Yield modeling (anomaly detection, classification)
• Lithography hotspot detection (classification)
• Identification of datapath-regularity (classification)
• Noise and process-variation modeling (regression)
• Performance modeling for analog circuits (regression)
• Design- and implementation-space exploration (regression)

ML in PD: modeling, prediction, correlation, …

A. B. Kahng, 180327 ISPD--2018 12
Near-Term Opportunities
• Modeling and Prediction
• Predict tool outcome = F(design, constraints, tool config)
• How to run tool “optimally” for given design and design goals?
• Avoid “failed runs”  reduce iterations in design flow
• Dream: one-pass design flow
• Analysis Correlation
• Model analysis errors (crude vs. golden analyses)
• Reduced guardbands and pessimism  better design quality
• Optimization (ML models = objective functions!)
• ML models = objective functions for higher-level optimization
• Better use of resources (tools, schedule, engineers) + better tools
• Project-level prediction, adaptive scheduling
• Later: “Taxonomy and Roadmap”
A. B. Kahng, 180327 ISPD--2018 13
Agenda
• Crises…
• … and a Vision
• Machine Learning in PD
• Modeling and Prediction

A. B. Kahng, 180327 ISPD--2018 14

Example 1: Interface Between Global-Detailed Route
• 7nm P&R: global route (GR) congestion map does not
correlate well with post-route (actual) DRC violations (DRVs)
• Many false-positive overflows in GR congestion map
• False positives do not correspond to actual DRVs

GR Overflows Actual DRVs

GR-based prediction can mislead routability optimizations!!!

A. B. Kahng, 180327 ISPD--2018 15

Too Many Expensive Iterations
Conventional closure
• Iteratively fix design before
TECHNOLOGY
signoff
• Go back to placement or
synthesis or FP if QOR is
DESIGN RULES CONSTRAINTS
hopeless
• Costly iterations and TAT (7-
day P&R runs…)
RTL DESIGN SYNTHESIS
ISPD17: ML-based
PLACEMENT DRV predictor
X X
G/D ROUTING
X Iteration with space
padding,
ANALYZE QOR (AREA, WIRELENGTH, NDR modifications,
TIMING, #DRCs, YIELD) density screens ...

A. B. Kahng, 180327 ISPD--2018 16

Insight From Layout Studies
• Initial prediction from GR overflows and cell/pin density map
• Red DRV-hotspot likely a False Negative due to low cell-pin density
• Larger windows , buried nets (, NDRs, FFs, etc.) added to model inputs

Standard cells
Actual DRV
False-negative

Layout windows

Non-buried net

Sparse pins/cells Dense pins/cells

A. B. Kahng, 180327 ISPD--2018 17
Improved Learning-Based Predictor
• Captures all true-positive clusters
• Maintains low false-positive rate

Learning-based Prediction Actual DRVs

(a) (b) (c)

A. B. Kahng, 180327 ISPD--2018 18
ISPD17: Model-Guided Routability Opt

• New: True-Positive rate = 74%, False-Positive rate = 0.2%

• Previous: True-Positive rate = 24%, False-Positive rate = 0.5%
A. B. Kahng, 180327 ISPD--2018 19
Example 2: Local CTS Optimization Moves
• Iterative local moves to minimize skew variation
across corners
1. Displacement {N, S, E, W, NE, NW, SE, SW} by 10μm x
one-step sizing
2. Displacement by 10μm x one-step sizing on child buffer
3. Reassign to a new driver (i) at the same level, (ii) within
bounding box of 50μm x 50μm
10μm 10μm

... ... ... ...

... ... ...

... ... ... ... ... ...
(1) (2) (3)
 Each move is expensive (legalization, ECO routing, RC extraction, STA)
 Each buffer has many candidate moves
 DAC-15: learning-based model
A. B. Kahng, 180327 ISPD--2018 20
DAC15: CTS Outcome Prediction
• Predict driver-to-fanout latency change due to local moves

Local move 100%

%Buffers identified to
have the best move
80%
Analytical models 60%
Routing: FLUTE, STST
Cell delay: Liberty LUTs 40% Flute+ED
Wire delay: Elmore, D2M Flute+D2M
20% STST+ED
STST+D2M
0% Model
Delta delays 0 2 4 6 8 10 12
#Attempts

 Each attempt is a local move

Learning-based model  114 buffers
 45 candidate moves for each buffer
 Learning-based model identifies best moves
Delta delays for more buffers with less #attempts

A. B. Kahng, 180327 ISPD--2018 21

Example 3: Prediction of Doomed Runs?
• Some P&R runs end up with too many post-route DRVs
• Approach: track and project metrics as time series
• Markov decision process (MDP): terminate “doomed runs” early
• Shown: 4 example progressions of #DRVs (commercial router)
• Stopping red, yellow runs early would save resources and schedule !

A. B. Kahng, 180327 ISPD--2018 22

Markov Decision Process = “Strategy Card”
• State space from Fibonacci binning
• Actions – GO or STOP
• Rewards at each state – e.g., small negative reward for non-stop state, large
positive reward for stop with low #DRVs, etc.
• Automatically trained MDP “strategy card”: Yellow = GO, Purple = STOP

A. B. Kahng, 180327 ISPD--2018 23

Strategy Card “Completion”

A. B. Kahng, 180327 ISPD--2018 24

Promising Initial Studies
• TYPE 1 Prediction Error: MDP STOPs a run that will eventually succeed
• TYPE 2 Prediction Error: MDP predicts GO at each iteration, but run fails
• Training data: 1200 logfiles from PROBE experiments
• Testing data: 3442 logfiles from ARM Cortex M0 floorplan experiments
• Substantial #iterations saved for doomed runs (398 / 3442 cases)

Latest P&R tools have increased #iterations  larger benefit in future ?

Errors Training (Total = 1200) Testing (Total = 3442)

N = 200 Total #TYPE 1 Errors #TYPE 2 Total #TYPE 1 Errors #TYPE 2
Training (wrong STOP Errors Training (wrong STOP Errors
Error prediction) (no STOP) Error prediction) (no STOP)
1 STOP 29.66% 251 99 35.2% 1317 3
2 consecutive 10.5% 27 99 8.3% 307 3
STOPs
3 consecutive 8.5% 3 99 4.2% 154 3
STOPs

A. B. Kahng, 180327 ISPD--2018 25

Agenda
• Crises…
• … and a Vision
• Machine Learning in PD
• Modeling and Prediction
• Analysis Correlation

A. B. Kahng, 180327 ISPD--2018 26

ML Shifts the Accuracy-Cost Tradeoff Curve!

A. B. Kahng, 180327 ISPD--2018 27

Example 4: ML-based Timer Correlation
If
Outliers INCREMENTAL error >
(data points) threshol
DATE-2014 New
d

Designs
(+ SLIP-2015) MODELS
(Path slack, setup
Train Validate time, stage, cell,
Test
wire delays)

Artificial Real
Circuits Designs ONE-TIME

0,1 BEFORE AFTER

T2 Path Slack (ns)

-0,1

-0,2

-0,3 ML 31 ps
Modeling
-0,4 123 ps ~4 reduction
-0,5

-0,6
-0,6 -0,5 -0,4 -0,3 -0,2 -0,1 0 0,1
T1 Path Slack (ns) T1 Path Slack (ns)
A. B. Kahng, 180327 ISPD--2018 28
“SI for Free” with Machine Learning
Timing Reports in SI Timing Reports in
Mode Non-SI Mode • Machine learning of
Create Training, Validation and Testing Sets incremental transition
time, delay due to SI
ANN (2 Hidden Layers, SVM (RBF Kernel, 5-Fold
5-Fold Cross-Validation) Cross-Validation) • Accurate SI-aware
path delays, slacks
HSM (Weighted Predictions from ANN and SVM)

Save Model and Exit

Non-SI Path Slack (ns) ($)

BEFORE AFTER

Predicted Path Delay (ps)

81ps Worst absolute
error = 8.2ps
ML 8.2ps Average absolute
Modeling error = 1.7ps

SI Path Slack (ns) ($$$) Actual PathA. Delay 180327 ISPD--2018 29

B. Kahng,(ps)
Example 5: Predicting PBA from GBA?
• PBA (Path-Based Analysis) is less pessimistic than GBA
(Graph-Based Analysis)
• But, more expensive runtime !
• Question: Can we predict PBA timing from GBA timing?
•  Better optimization in P&R&Opt, less expensive STA

PBA ‐ GBA Slack Gain
GBA Mode
50

PBA Slack – GBA Slack (ps)
40

0
0 5000 10000 15000 20000 25000 30000
Endpoint Index

PBA Mode

A. B. Kahng, 180327 ISPD--2018 30

Costs of GBA vs. PBA Pessimism
GBA Actual PBA Actual Impact
Slack Slack
POSITIVE POSITIVE Power recovery can’t exploit usable
slack
NEGATIVE POSITIVE Schedule, Area, Power wasted fixing
false timing violations
NEGATIVE NEGATIVE Schedule, Area, Power waste from
over-fixing

PBA Actual PBA Predicted Impact

Slack Slack (Model)
HIGH LOW Power recovery can’t exploit all of
usable slack
LOW HIGH Masking of real violations

A. B. Kahng, 180327 ISPD--2018 31

Promising Initial Studies
• Early model with MARS (multiple adaptive regression
splines): 90% of predicted PBA slacks within 5ps
• Also: random forest classifier for 2-stage “bi-grams”
• Testcase: netcard, 28nm FDSOI
# EndPoints (Testing)

9000
8000
7000
6000
5000
4000
3000
2000
1000
0
‐40 ‐20 0 20 40
Error (ps) = Actual ‐ Predicted Bi-gram =2-stage unit in timing path
PBA Slack

A. B. Kahng, 180327 ISPD--2018 32

Example 6: Reduce Corners in STA, Opt !
• Want benefits of STA at N corners, using just M << N corners
• “Missing Corner Prediction” (“matrix completion”) saves runtime, licenses
• Avoids optimistic timing that is caught at detailed signoff, causing iteration

A. B. Kahng, 180327 ISPD--2018 33

Agenda
• Crises…
• … and a Vision
• Machine Learning in PD
• Modeling and Prediction
• Analysis Correlation
• Optimization

A. B. Kahng, 180327 ISPD--2018 34

Example 7: Design Cost Optimization
• Predictive models == Optimization objectives
• Enables schedule, resource optimizations up to enterprise level

A2 A5 A3
(3) (1) (3) Datacenter capacity
Usage (Across Three Projects)

A2 A4 A5
(1) (3) (1) Current servers
A4 A4 A4 A5
(1) (2) (1) (2)
A4 A3 A3 A3 A4 A5
(1) (1) (1) (2) (2) (2)
A3 A2 A3 A2 A3 A4 A4
(1) (1) (2) (2) (3) (3) (3)
A1 A1 A2 A1 A4 A3 A4 A4
(2) (2) (2) (3) (2) (2) (2) (3)
A1 A1 A2 A2 A1 A4 A2 A2 A3 A5 A5
(1) (1) (1) (2) (3) (1) (3) (3) (3) (3) (3)
20 22 24 26 28 30 32 34 36 38 40 42
Work Weeks

• TODAES 2017: Schedule Cost Minimization, Resource Cost

Minimization ILPs
• “How do I pack 12 tapeouts into my design center during Q4?
A. B. Kahng, 180327 ISPD--2018 35
Agenda
• Crises…
• … and a Vision
• Machine Learning in PD
• Modeling and Prediction
• Analysis Correlation
• Optimization
• A Roadmap

A. B. Kahng, 180327 ISPD--2018 36

Four Stages of ML Insertion in IC Design

1. Mechanization and Automation

2. Orchestration of Search and
Optimization
3. Pruning via Predictors and
Models
4. Reinforcement Learning and
Intelligence
Huge space of tool, command,
option trajectories through design
flow

A. B. Kahng, 180327 ISPD--2018 37

1. Mechanization and Automation
• Create “robot IC design engineers”
• Observe and learn from humans
• Search for command sequences in design tools
• Multi-Armed Bandit Problem: Given slot machine
with N arms, maximize reward obtained using T pulls
• Well-studied in context of Reinforcement Learning
• IC Design: “arm” = target frequency; “pull” = run flow
Tool Outcomes (Area,
Power, WNS/TNS)

Arms to
Sample Parallel
SAMPLER
Samples per Tool Runs
Constraints
Arm

Max
Frequency

DAC-18 session: “The Road to No-Human-in-the-

Loop IC Design” (UCSD, Qualcomm, Synopsys) A. B. Kahng, 180327 ISPD--2018 38
1. Mechanization and Automation
• Create “robot IC design engineers”
• Observe and learn from humans
• Search for command sequences in design tools
• Multi-Armed Bandit Problem: Given slot machine
with N arms, maximize reward obtained using T pulls
• Well-studied in context of Reinforcement Learning
• IC Design: “arm” = target frequency; “pull” = run flow

A. B. Kahng, 180327 ISPD--2018 39

2. Orchestration of Search and Optimization
• How to optimally orchestrate N robot
engineers?
• Concurrent search of N flow trajectories
• Explore, identify good flow options efficiently
• Constraint: compute and license resources
• Goal: best QOR within resource, risk limits
• Example strategy: “Go with the winners”
• Launch multiple optimization threads
• Periodically identify promising thread
• Clone promising thread and terminate others

A. B. Kahng, 180327 ISPD--2018 40

Another Example: “Adaptive Multi-Start”
• Optimization cost landscapes often have “big valley”
structures
• Best local minima are central to all other local minima
• Adaptive Multi-Start (AMS)
• Identify promising configurations
in current iteration
• Adaptively choose better
start points for next
optimization iteration

A. B. Kahng, 180327 ISPD--2018 41

3. Pruning via Predictors and Models
• Prediction of tool- and design-specific outcomes over
longer and longer subflows
• Wiggling of longer and longer ropes

A. B. Kahng, 180327 ISPD--2018 42

Example 8: Prediction of SRAM Timing Failure
• Multiphysics effects (IR drop, thermal, etc.) affect
timing closure
• Floorplanning with SRAMs is complicated
• P&R blockages
• Unpredictable post-P&R timing
• Goal: Early prediction of post-P&R slack (“doomed
floorplans”) to save schedule
• But estimating post-P&R timing at floorplan stage is
challenging:
• Wire delay estimate has no spatial embedding information
• Gate delay estimate has no buffering information

A. B. Kahng, 180327 ISPD--2018 43

Multiphysics Analysis is Difficult to Predict
• IR drop, thermal, reliability, crosstalk, etc.
• ASP-DAC 2016 (UCSD, Samsung): Can we predict
“risk map” for embedded memories at floorplan stage?
SRAM Slack (ps)

Implementation Index

A. B. Kahng, 180327 ISPD--2018 44

Floorplan Pathfinding with Machine Learning
• Filter bad floorplans (e.g., embedded memory
placements, power plans) comprehending
downstream PD flow
• Model f estimates combined effects of netlist,
constraints, placement, CTS, routing, optimization,
STA
Gate Netlist Constraints

Floorplan, Powerplan

Placement
Modeling

Extraction,
Scope

Timing
Clock network synthesis

Routing
Costly
Extraction, Timing, Iteration
Verification

Slack (w/, w/o IR) Signoff

A. B. Kahng, 180327 ISPD--2018 45
Modeling Techniques and Flow

Parameters from netlist Parameters from floorplan Slack reports from P&R,
sequential graph context, constraints multiphysics STA

Ground Truth

LASSO with L1 SVM with RBF ANN with 1 input, 2 Boosting with SVM
regularization kernel hidden, 1 output layer as weak learner

Combine using weights

Save model and exit

A. B. Kahng, 180327 ISPD--2018 46

Floorplan Pathfinding Model
• False negatives = 3%
• Pessimistic predictions  floorplan change that is
actually not required
• False positives = 4%
• Model incorrectly deems a floorplan to be good

Actual
Pass Fail
Pass
Predicted

False
584 42
positives
Fail

31 384

False negatives

A. B. Kahng, 180327 ISPD--2018 47

3. Pruning via Predictors and Models
• Prediction of tool- and design-specific outcomes over
longer and longer subflows
• Wiggling of longer and longer ropes
• Prune, terminate  avoid wasted design resources
• Better outcome within given resource budget
• Implicit: improved predictability and modelability
of heuristics and tools

A. B. Kahng, 180327 ISPD--2018 48

4. Reinforcement Learning and “Intelligence”
Many challenges on the road ahead…

• Latency and unpredictability of IC design tools/flows

• Can’t “play the IC design game” 100M times in 3 days
• “Small data” challenge with a big-data problem
• Data points are expensive
• Huge implementation space
• Tool versions, design versions, technology all changing
(pictures of cats and trees don’t change)
• Model parameters come from domain experts today
• Open: bridging real (top-secret!) and artificial (fake!)
• My group: many years of “eye chart” papers

A. B. Kahng, 180327 ISPD--2018 49

Todo List: “Last Mile” Robots
• Automation of manual DRC violation fixing
• P&R tools cannot handle latest rule decks, unavoidable
lack of routing resource in high-utilization block, etc.
• Automation of manual timing closure
• After routing and optimization, several thousand violations
of maxtrans, setup, hold constraints exist
• Engineer fixes 200-300 DRVs by hand, per day
• Placement of memory instances in a P&R block
• Package layout automation
• How to assess post-routed quality (e.g., bump inductances)
of SOC floorplan and die-package pin map?
• Required for: pin map, power delivery optimization
• Requires: automation/estimation of manual package routing

A. B. Kahng, 180327 ISPD--2018 50

Todo List: Improving Analysis Correlation
• Prediction of the worst PBA path
• Prediction of the worst PBA slack per endpoint,
from GBA analysis
• Prediction of timing at “missing corners”
• Predict other impacts (e.g., transition times, ..) of an ECO as
well
• Closing of multi-physics analysis loops
• Early priorities: vectorless dynamic IR drop, power-
temperature loops
• Continued improvement of timing correlation and
estimation !
• Faster and better always helpful !

A. B. Kahng, 180327 ISPD--2018 51

Todo List: Predictive Models of Tools, Designs
• Predict convergence point for P&R, non-uniform PDN
• Estimate PPA response of block to floorplan context
• Estimate useful skew impact on post-route WNS,TNS
• “Auto-magic” determination of netlist constraints for
given performance and power targets
• Key opportunity: exactly ONE netlist is passed into place-
and-route – how to generate this best netlist?
• Predict best “target sequence” of constraints through
layout optimization phases
• Predict “most-optimizable” cells during design closure
• Predict divergence (detouring , timing/slew violations)
between trial/global route and final detailed route
• Predict “doomed runs” at all steps of design flow
A. B. Kahng, 180327 ISPD--2018 52
Todo List: And More…
• Infrastructure for machine learning in IC design
• Standards for model encapsulation, model application, and
IP preservation when models are shared
• Standard ML platform for EDA modeling
• Enablement of design metrics collection, tool/flow model
generation, design-adaptive tool/flow configuration,
prediction of tool/flow outcomes
• This recalls “METRICS” http://vlsicad.ucsd.edu/GSRC/metrics
• Modelable algorithms and tools
• Smoother, less chaotic outcomes than present methods
• Datasets to support ML
• Artificial circuits and “eyecharts”
• Shared training data – e.g., timer correlation, post-route
DRV prediction, optimal sizing
A. B. Kahng, 180327 ISPD--2018 53
Agenda
• Crises…
• … and a Vision
• Machine Learning in PD
• Modeling and Prediction
• Analysis Correlation
• Optimization
• A Roadmap
• Conclusion

A. B. Kahng, 180327 ISPD--2018 54

Conclusion
• Many high-value opportunities for ML in physical
design
• Analysis correlation  less margin, improved design QOR,
faster convergence
• Predictive modeling of tools/flows and designs  fewer
loops, less wasted effort, less pessimism, better design
optimization, better resource management
• Roadmap
• Robots
• Orchestration of robots
• Pruning via predictors and models
• Intelligence + many specific “todos”
• Other facets: enablement, standards, openness,…
• I hope that many of you will join this quest !!!
A. B. Kahng, 180327 ISPD--2018 55
THANK YOU !

Support from NSF, Qualcomm, Samsung, NXP, Mentor

Graphics and the C-DEN center is gratefully acknowledged.

A. B. Kahng, 180327 ISPD--2018 56

[ISQED01]
(This is “METRICS” !)

• METRICS (1999; ISQED01): “Measure to Improve”

• Goal #1: Predict outcome
• Goal #2: Find sweet spot (field of use) of tool, flow
• Goal #3: Dial in design-specific tool, flow knobs
http://vlsicad.ucsd.edu/GSRC/metrics
A. B. Kahng, 180327 ISPD--2018 57
Patterning and Margins for Wires (“BEOL”)
• Self-aligned multiple patterning + Cutmask
• Make a “sea of wires”
• Make “cuts”
• Cut shapes and locations determine dummy wires and
end-of-line extensions of wire segments
• Final layout  Target layout
 Timing and power not the same as originally designed !
 Need more margin !
cut extension

Target layout 1D wires Cut masks Final layoutdummy fill

A. B. Kahng, 180327 ISPD--2018 58

Patterning and Margins for Gates (“FEOL”)
• Neighbor diffusion effect (NDE)
• Diffusion step = neighboring diffusion area height
change
• Transistor drive strength and leakage prop. to
horizontal fin spacing
• 2nd Diffusion Break (DB)
• Vt shift as a function of spacing to the 2nd diffusion
break
• Gate Cut (GC)
• Idsat shifts as a function of gate-cut distance to DUT
• Worst corner has to consider NDE + 2nd DB + GC
 More margin added besides PVT (!)
1st DB 2nd DB
Diffusion
Diffusion
height

Diffusion break
DUT Fin
PC
Gate cut Gate Cut (GC) Effect

A. B. Kahng, 180327 ISPD--2018 59

[ASPDAC16]
Closing Multiphysics Analysis Loops

Sim Results
Tech files, signoff (Dyn.) Activity Functional
Sim vectors
Sim
criteria, corners Factor (Static) Benchmark
RTL
IR Drop Power
Map Thermal
AVS Trace Analysis

Timing / Power Temp

Glitches Analysis Map
Slack

Task
Timing/ Mapping/
P&R + Noise Migration/
Reliability (DVFS)
Optimization
Report

MTTF &
Aging
A. B. Kahng, 180327 ISPD--2018 60
[ASPDAC16]
Closing Multiphysics Analysis Loops

Sim Results
Tech files, signoff Workload-Thermal
(Dyn.) Activity Functional loop
Sim vectors
Sim
criteria, corners Factor (Static) Benchmark
STA-IR loop RTL
IR Drop Power
Map Thermal
AVS Trace Analysis

Timing / Power
STA-Thermal
Analysis
Temp
Slack Glitches Map
loop
Task
Timing/ Mapping/
P&R + Noise Migration/
Reliability (DVFS)
Optimization
Report

STA-Reliability loop MTTF &

Aging
A. B. Kahng, 180327 ISPD--2018 61
BACKUP

A. B. Kahng, 180327 ISPD--2018 62

Many Operating Conditions (“Corners”)
• Chip must work at many (500+) operating conditions (corners)
• Each corner = another run of the timing tool
• GOAL: Run as few timing corners as possible; predict the rest

Predict the hidden slack values!

A. B. Kahng, 180327 ISPD--2018 63

And a Dream … [predicting dynamic voltage drop]
Inexpensive
Static analysis
+ Current map
+

Expensive
Dynamic analyses

A. B. Kahng, 180327 ISPD--2018 64

Some References
Highlighted in the talk from ABKGroup
• [RISKMAP] W.-T. J. Chan, K. Y. Chung, A. B. Kahng, N. D. MacDonald and S. Nath, "Learning-Based Prediction of
Embedded Memory Timing Failures During Initial Floorplan Design", (.pdf), Proc. ASPDAC, 2016.
• [GT1GT2] ] S. S. Han, A. B. Kahng, S. Nath and A. Vydyanathan, "A Deep Learning Methodology to Proliferate
Golden Signoff Timing", (.pdf), Proc. DATE, 2014.
• [GT1GT2] A. B. Kahng, M. Luo and S. Nath, "SI for Free: Machine Learning of Interconnect Coupling Delay and
Transition Effects", (.pdf), Proc. SLIP, 2015.
• [#ML/ROPT] W.-T. J. Chan, Y. Du, A. B. Kahng, S. Nath and K. Samadi, "BEOL Stack-Aware Routability Prediction
from Placement Using Data Mining Techniques", (.pdf), Proc. ICCD, 2016.
• [#ML/ROPT] W.-T. J. Chan, P.-H. Ho, A. B. Kahng and P. Saxena, "Routability Optimization for Industrial Designs at
Sub-14nm Process Nodes Using Machine Learning", (.pdf), Proc. ISPD, 2017.
• [CTS] K. Han, A. B. Kahng, J. Lee, J. Li and S. Nath, "A Global-Local Optimization Framework for Simultaneous
Multi-Mode Multi-Corner Skew Variation Reduction",(.pdf), Proc. DAC, 2015.
Some other machine learning / data mining papers from ABKGroup
• [3DPE] W.-T. J. Chan, Y. Du, A. B. Kahng, S. Nath and K. Samadi, "3D-IC Benefit Estimation and Implementation
Guidance from 2D-IC Implementation", (.pdf), Proc. DAC, 2015.
• [HS] A. B. Kahng, C.-H. Park and X. Xu, "Fast Dual-Graph Based Hotspot Detection” (.pdf), Proc. BACUS, 2006.
• [INT] A. B. Kahng, S. Kang, H. Lee, S. Nath and J. Wadhwani, "Learning-Based Approximation of Interconnect Delay
and Slew in Signoff Timing Tools", (.pdf), Proc. SLIP, 2013.
• [METRICS] S. Fenstermaker, D. George, A. B. Kahng, S. Mantik and B. Thielges, "METRICS: A System Architecture
for Design Process Optimization", (.pdf), Proc. DAC, 2000.
• [METRICS] A. B. Kahng and S. Mantik, "A System for Automatic Recording and Prediction of Design Quality
Metrics", (.pdf), Proc. ISQED, 2001.
• [HSM] A. B. Kahng, B. Lin and S. Nath, "Enhanced Metamodeling Techniques for High-Dimensional IC Design
Estimation Problems", (.pdf), Proc. Design, Automation and Test in Europe, 2013, pp. 1861-1866.
• [HHSM] A. B. Kahng, B. Lin and S. Nath, "High-Dimensional Metamodeling for Prediction of Clock Tree Synthesis
Outcomes", (.pdf), Proc. ACM/IEEE International Workshop on System-Level Interconnect Prediction, 2013.
• [METRICS] GSRC/METRICS: http://vlsicad.ucsd.edu/GSRC/metrics/
See also: Center for Design-Enabled Nanofabrication, http://cden.ucsd.edu 65
A. B. Kahng, 180327 ISPD--2018
Cycles of Margin Implications [ISQED08]

Delays

Optimization Challenge

50% decrease of margin? Driver Sizes

Or 100% increase?
Area (A)

Wirelengths

Parambest Paramworst Yr  e  Ad
Defects (d: defect density)
-100% 0% 100%  r2 2r 
Cost N dies     
 A 2 A 
(r: wafer radius)

A. B. Kahng, 180327 ISPD--2018 66

Benefits from Margin Reduction at 45nm
Technology
(90nm, 65nm, 45nm)
• 40% margin reduction
Cell library margin RC margin • Area: 13% reduction
reduction reduction • Dynamic power: 13% reduction
• Leakage power: 19% reduction
RTL Design
• Wirelength: 12% reduction
Synthesis
(AES, JPEG, SOC1) • Tool runtime (S,P&R): 28% reduction
• #Timing viols.:100% reduction
Placement
 saves iterations and schedule
Experiments • #Good dies per wafer (w/o process
with industry chip Clock tree synthesis enhancement): 4% increase
implementation
flow Routing
• More margin = more cost
Analyze outcomes • Less margin = less cost
(Area, wirelength,
runtime, #violations,
• Cost reduction  must cure
yield) unpredictability of design tools
A. B. Kahng, 180327 ISPD--2018 67
Agenda
• Scaling, Moore’s Law and Crises
• Scaling Prospects
• What’s Left for the Future?

A. B. Kahng, 180327 ISPD--2018 68

“More Than Moore”: 2.5D/3D Integration
Conventional Path Futures
2.5D Interconnect 3D Monolithic Integration
Micro
Interposer- Bump

Sequential
TSV
based

Build-up
C4
Bump

2.5D SoC “Virtual” SoC

Source: LETI
MOCHI
(Marvell) 3D Transfer Printing

Stamp Grab objects off

3D 1
of donor subs
Tier3
TSV-based Tier2 TSV 2

Three Dimensional System

Tier1 Integration, Springer, 2011.
Donor subs
4 Prints objects
3D 3 onto receiver
Bonding-
D2W / D2D
based
Receiver
subs Nature Materials 5, 33 - 38 (2006)
A. B. Kahng, 180327 ISPD--2018 69
New (“Rebooting Computing”) Paradigms
• Approximate Computing
• E.g., cut carry chain in adder to trade off throughput, accuracy

• Stochastic Computing
• Represent numbers by pseudo-random bitstreams
• Tolerant to delay-induced error compared to parallel number
representation
Z = X1×X2
3/8 = 4/8  6/8

• Neuromorphic Computing …
A. B. Kahng, 180327 ISPD--2018 70
BUT: Even If We Had Infinite Dimensions...
 Idea: Infinite dimension gives us a bound on 3DIC benefits
 Infinite dimension: netlist optimization with zero wire
parasitics
 Gap between infinite dimension and 2D  maximum power
benefit from 3DIC = 36% for CORTEX M0, 20% for AES

CORTEX M0 AES
30 60
Pseudo1D 2D
25 3D (2 tier) 3D (3 tier) 50

Power (mW)
Power (mW)

3D (4 tier) infD
infiD
20 40
20%
15
36% 30
10 20
5 10
0,75 0,95 1,15 0,55 0,75 0,95
clock period (ns) clock period (ns)

A. B. Kahng, 180327 ISPD--2018 71

BUT: Even If Frequency Didn’t Matter At All…

 Up to ~65% area difference (usually ~30%) between

minimum clock period constraint (2.08GHz) and
relaxed clock period constraint (28FDSOI, AES)
Area vs. Target Frequency - AES Cipher in 28FDSOI
18000

16000 Timing
Fail
14000

65%
Post Route Area (um2)

12000

10000

8000

6000

4000

2000

0
0 0,5 1 1,5 2 2,5
Target Frequency (GHz)

A. B. Kahng, 180327 ISPD--2018 72

BUT: Even If Wires Were Perfect (No R, C) ...
3 Path Delays (JPEG Encoder) Path Delay (with wires)
Path Delay (without wires)
Min. cycle time = 2.8
2.5

Min. cycle time = 2.25

2
Delay (ns)

1.5

0.5

0
1 501 1001 1501 2001 2501 3001 3501 4001 4501 5001

Path Index
A. B. Kahng, 180327 ISPD--2018 73
Agenda
• Scaling, Moore’s Law and Crises
• Scaling Prospects
• What’s Left for the Future?
• The Last Semiconductor Scaling Levers

A. B. Kahng, 180327 ISPD--2018 74

Andrew B. Kahng
CSE and ECE Departments
UC San Diego

http://vlsicad.ucsd.edu

A. B. Kahng, 180327 ISPD--2018

Agenda
• Scaling, Moore’s Law, and Crises

A. B. Kahng, 180327 ISPD--2018 77

What is “Scaling”?
• ITRS = International Technology Roadmap for
Semiconductors (http://www.itrs2.net/)
• Key metric of (density) progress: half-pitch (F)
• Contacted Poly pitch scales by 0.7
• Mx pitch scales by 0.7

0.7 x 0.7 = 0.49  density doubles

at each “technology node”

A. B. Kahng, 180327 ISPD--2018 78

“Moore’s Law” = Scaling of Cost and Value
• Moore, 1965: “The complexity for minimum component costs
has increased at a rate of roughly a factor of two per year”

Min cost per transistor

• Moore’s Law is a law of cost reduction (1% = 1 week)

• Proxy for cost reduction: “scaling of value”
• Proxies for value: “bits”, “hertz”, “density” (= utility, integration)
A. B. Kahng, 180327 ISPD--2018 79
Today: Bigger Stacks of Margin (“Corners”)
Design margin = stacks of
layers of conservatism

Reliability
Voltage Temperature
Process
Nominal Vdd Signoff
PDF
Static IR drop

Signoff Power grid

IR gradient
Dynamic IR

HCI/NBTI
performance
Signoff
source: Wu 08
A. B. Kahng, 180327 ISPD--2018 80
Corner Explosion Worsens

Corners = Process X RCX X Temperature X Voltage X ...

FF, FFG, C-worst, -40°C, 0°C, 0.7V, 0.8V,

FS, SF, Cc-worst, 80°C, 125°C, 0.9V, 1.0V,
TT, C-best, … 1.1V, …
SSG, SS, Cc-best,
… RC-worst,
RC-best,
…

• Each corner is a new “objective function” and a new set of

constraints!

• Lose design turnaround time (TAT) == schedule

• Non-convergence, “ping-ponging” in timing closure

A. B. Kahng, 180327 ISPD--2018 81

Consequences
• Diminishing ROI from next node
• Typical: Moore’s Law-ish scaling
• Worst-case: Scales, but worse return on investment
• Signoff with excessive margin: gain is wiped out

A. B. Kahng, 180327 ISPD--2018 82

Agenda
• Scaling, Moore’s Law and Crises
• Scaling Prospects…
• Difficult and costly, with limits ahead !

A. B. Kahng, 180327 ISPD--2018 83

Scaling Will Continue (!)
• Lateral scaling in semiconductor manufacturing and
device architecture is still predicted to occur
• Extremely challenging after 5nm/3nm node (i.e., N5/N3)
• Monolithic 3D will drive scaling afterwards
• Beyond this roadmap, new scaling levers are needed

Source: IRDS

A. B. Kahng, 180327 ISPD--2018 84

Lateral (Area) Scaling: MOL and Tracks (1)
• Old technology node layer stack
• OD / Poly – V0 – M1 – V1 – M2
VDD
M1
VDD M2
V1 BEOL
M1
A Z
A Z V0
Mint
Vint Poly MOL
M0G M0A
VSS VSS
Fin
Inverter (old) Inverter (old) Poly

• Advanced node layer stack

• OD – M0A – VINT – MINT – V0 – M1 – V1 – M2
• Poly – M0G – VINT – MINT – V0 – M1 – V1 – M2

A. B. Kahng, 180327 ISPD--2018 85

Lateral (Area) Scaling: MOL and Tracks (2)
• N10/N7/N5 technology nodes
Cells 12T 9T 7.5T 6T 5T/4T/3T
Pins M1 M1 MINT/M1
M1 Bidirectional Unidirectional
MOL N/A Yes: MINT/M0 below M1
VDD/VSS M1 M2 M1/MINT Buried/backside P/G
# M2 routing tracks ~9 ~6 5 6 5/4/3

VDD M0G
VDD
M0A
VDD VDD VDD VDD
VDD MINT
Z
VSS A VSS A Z A M1
Z
M2
VSS VSS VSS VSS Buried
VSS
Inverter (7.5T) Inverter (6T) Inverter (5T)
A. B. Kahng, 180327 ISPD--2018 86
Area Scaling Teardown (CPP x MP)
• 0.5x target area scaling to Gate-Contact Congestion
continue Moore’s Law
• Combines Contacted Poly Pitch
(CPP) scaling and Metal Pitch
(MP) scaling
•  Need new design technology
and device technologies

0.5x area scaling = CPP scaling x metal pitch scaling

[source] M. Badaroglu, “More Moore scaling: opportunities and inflection points”

A. B. Kahng, 180327 ISPD--2018 87
Scaling is Doable, but ...

... it’s getting tough 

A. B. Kahng, 180327 ISPD--2018 88

Machine Learning Gives Us Scaling !
• High-value opportunities in and around EDA
• Modeling and Prediction
• Predict tool outcome = F(design, constraints, tool config)
• How to run tool “optimally” for given design and design goals?
• Avoid “failed runs”  reduce iterations in design flow
• Dream: one-pass design flow
• Model analysis errors (crude vs. golden analyses)
• Reduced guardbands and pessimism  better design quality
• Optimization (ML models = objective functions!)
• Better use of resources (tools, schedule, engineers) + better tools
• Project-level prediction, adaptive scheduling
• Today: the major focus for IC industry
• U.S. DARPA IDEA program: automation, schedule
• 24-hour TAT, “no-human-in-the-loop”
A. B. Kahng, 180327 ISPD--2018 89
What About … “No Human In The Loop”?
• Multi Armed Bandit Problem: Given a slot machine
with N arms, maximize total reward obtained using T
pulls (iterations)
• Well-studied in context of Reinforcement Learning
• IC Design: “arm” = target frequency; “pull” = run of flow
• UCSD scripts available upon request
Tool Outcomes (Area,
Power, WNS/TNS)

Arms to
Sample Parallel
SAMPLER
Tool Runs
Samples per
Constraints
Arm

Max
Frequency
A. B. Kahng, 180327 ISPD--2018 90
Same Quality in Less Time = Scaling
IC Quality (%)
(25, 100)
100% Current (100, 100)
90% #1
#3

Design time (%)

25 100
#1. tool/flow models; design-adaptive, learning-based, one-pass flows
#2. analysis correlation, prediction; reduced margins/corners; correct by construction
#3. cloud-based design to recover global optimization; SP&R improvements

Machine Learning (Data + Intelligence) is essential for this

A. B. Kahng DARPA IDEA workshop 170413
A. B. Kahng, 180327 ISPD--2018 91
[ISQED01]
(This is “METRICS” !)

• METRICS (1999; ISQED01): “Measure to Improve”

A. B. Kahng, 180327 ISPD--2018 93

Agenda
• Scaling, Moore’s Law and Crises
• Scaling Prospects
• What’s Left for the Future?
• The Last Semiconductor Scaling Levers
• Going Forward: Foundation #1 = ML in/around EDA
• Going Forward: Foundation #2

A. B. Kahng, 180327 ISPD--2018 94

Attacking the Design Capability Gap
• Not enough R&D attention on EDA challenges
• ~10,000 worldwide EDA, internal CAD, academic research
headcount
• Long latency of technology transfer
• Latest CAD research technologies unavailable to chip
designers
• 5-7 years from ASP-DAC proceedings to production IC
design flow
•  Opportunity for another form of “scaling”

A. B. Kahng, 180327 ISPD--2018 95

Is It Time for “Linux of EDA”?
• Free open-source software (FOSS) has sparked
rapid innovation in many fields
• Common standards, platforms avoid wasted energy
• Recent U.S. DARPA “IDEA” program solicitation: IC
design that is “no human in the loop” and “24-hour TAT”
• Older efforts
• MARCO GSRC Bookshelf
• Berkeley tools (SPICE, MIS/SIS/ABC, …)
• UCLA/UCSD/UM tools (Capo, MLPart, …)
• OpenAccess and OAGears
• Many recent efforts worldwide
• OpenTimer, Yosys, RSyn, Ophidian, Open Design Flow,
CloudV.io, …
• Will “critical mass” be possible this time around?
A. B. Kahng, 180327 ISPD--2018 96
Agenda
• Scaling, Moore’s Law and Crises
• Scaling Prospects
• What’s Left for the Future?
• The Last Semiconductor Scaling Levers
• Going Forward: Foundation #1 = ML in/around EDA
• Going Forward: Foundation #2 = “Linux of EDA”
• Going Forward: Foundation #3 = partitioning, cloud
• Takeaways

A. B. Kahng, 180327 ISPD--2018 97

Multiphysics Analysis is Difficult to Predict
• IR drop, thermal, reliability, crosstalk, etc.
• Example: Can we predict “risk map” for embedded
memories at floorplan stage ?
SRAM Slack (ps)

29ps
25ps

SRAM #1 SRAM #5

A. B. Kahng, 180327 ISPD--2018 98

Key Challenge: Global-Detailed Route Correlation
• 7nm P&R: global route (GR) congestion map does not
correlate well with post-route (actual) DRC violations
• Many false-positive overflows in GR congestion map
• False-positive  do not correspond to actual DRC violations

GR Overflows Actual DRC

GR-based prediction can mislead routability optimizations!!!

A. B. Kahng, 180327 ISPD--2018 99

If We Know DRC Hotspots before Routing…
• Conventional way to
Technology close designs
• Iteratively fix design before
signoff
Design Rules Constraints
• Go back to placement if
QOR is hopeless
RTL Design Synthesis
• Turnaround time is VERY
challenging (7-day P&R
Placement
runs…)
• Can we do better with
G/D Routing accurateIteration
prediction?
with space
padding,
Analyze QOR (Area, wirelength, NDR modifications,
timing, #DRCs, yield) density screens ...

A. B. Kahng, 180327 ISPD--2018 100

Layout Study
• Initially predict with GR overflows and cell/pin density map
• Red DRC-hotspot likely be rejected due to low cell-pin density
• Larger windows and buried nets metrics to guide prediction

Standard cells
Route-DRC
False-negative

Extraction
windows

Non-buried net

Sparse pins/cells Dense

pins/cells A. B. Kahng, 180327 ISPD--2018 101
DRV Prediction with Machine Learning
• Predictor is used to guide routability optimization
• SVM with weighting to compensate biased training data
Cell density, Parameters
pin density
Remaining
Random GR resources
80%
20% Pin proximity Learning
gcells for
gcells for Model
Cell testing
training connectivity
Net spreading

…
… Prediction of
Route-DRVs Route-DRVs
for training

Initial linear model Non-linear SVM model

With
W/o DRC W/o DRC With DRC
True positive rate = tp / t DRC
False positive rate = tn / n

W/o DRC 98260 350 W/o DRC 98571 117

With DRC 481 111 With DRC 170 344

True positive rate: 24% True positive rate: 74%

False positive rate: 0.5% False positive rate: 0.2%
A. B. Kahng, 180327 ISPD--2018 102
Improved Learning-Based Predictor
• Captures all true-positive clusters
• Maintains low false-positive rate

Learning-based Prediction Actual DRC

(a) (b) (c)

A. B. Kahng, 180327 ISPD--2018 103
Machine Learning Gives Us Scaling !
• High-value opportunities in and around EDA
• Modeling and Prediction
• Predict tool outcome = F(design, constraints, tool config)
• How to run tool “optimally” for given design and design goals?
• Avoid “failed runs”  reduce iterations in design flow
• Dream: one-pass design flow
• Model analysis errors (crude vs. golden analyses)
• Reduced guardbands and pessimism  better design quality
• Optimization (ML models = objective functions!)
• Better use of resources (tools, schedule, engineers) + better tools
• Project-level prediction, adaptive scheduling (=separate talk)
• Today: the major focus for IC industry
• U.S. DARPA IDEA program: automation, schedule

A. B. Kahng, 180327 ISPD--2018 104

Agenda
• Crises…
• … and a Vision
• Machine Learning

A. B. Kahng, 180327 ISPD--2018 105

PREDICTION

A. B. Kahng, 180327 ISPD--2018 106

Agenda
• Scaling, Moore’s Law and Crises
• Scaling Prospects
• What’s Left for the Future?
• The Last Semiconductor Scaling Levers
• Going Forward: Foundation #1

A. B. Kahng, 180327 ISPD--2018 107

Savings due to MDP
Errors Testing (Total = 3442 logs)
N = 200 Number of runs that Number of runs stopped Average number of
need to be stopped correctly out of these iterations saved
1 STOP 398 394 18.9644
2 consecutive STOPs 398 391 17.9309
3 consecutive STOPs 398 380 16.9736

Test data = M0 runs

For one run, #iterations saved = 20 – (iteration number where MDP says
STOP)
Average #iterations saved = Sum(#iterations saved)/398
In almost every one of these 398 cases, the run starts with a huge number of
violations, and the MDP stops it almost immediately. Hence, large avg.
#iterations saved

A. B. Kahng, 180327 ISPD--2018 108

Doomed Runs – Updated Error Criteria
• Prediction is wrong if:
• DR ends with less than N violations and we predict STOP at 3 consecutive
iterations (less stringent)
(where N is the number of violations which a human designer finds it hard to
resolve - usually N ~100-200)
• DR ends with more than N violations and we predict GO at each iteration (already
relaxed, but predictor does not have information about N)
• Training data: 1200 logfiles from PROBE experiments
• Testing data: 3745 logfiles from ARM Cortex M0 floorplan experiments
Errors Training (Total = 1200) Testing (Total = 3442)
N = 200 Total #Errors wrongly #Errors with Total #Errors wrongly #Errors
Training predicted to no STOP Training predicted to with no
Error STOP (TYPE 1) (TYPE 2) Error STOP STOP
1 STOP 29.66% 251 99 35.2% 1317 3
2 consecutive 10.5% 27 99 8.3% 307 3
STOPs
3 consecutive 8.5% 3 99 4.2% 154 3
STOPs

A. B. Kahng, 180327 ISPD--2018 109

A. B. Kahng, 180327 ISPD--2018 110

Example Early Result
• Early model with MARS (multiple adaptive regression
splines): 90% of predicted PBA slacks within 5ps
• Testcase: netcard, 28nm FDSOI
9000
8000
7000
# EndPoints (Testing)

6000
5000
4000
3000
2000
1000
0
‐30 ‐20 ‐10 0 10 20 30
Error (ps) = Actual ‐ Predicted PBA Slack
A. B. Kahng, 180327 ISPD--2018 111
Machine Learning Gives Us Scaling !
• High-value opportunities in and around EDA
• Modeling and Prediction
• Predict tool outcome = F(design, constraints, tool config)
• How to run tool “optimally” for given design and design goals?
• Avoid “failed runs”  reduce iterations in design flow
• Dream: one-pass design flow
• Model analysis errors (crude vs. golden analyses)
• Reduced guardbands and pessimism  better design quality
• Optimization (ML models = objective functions!)
• Better use of resources (tools, schedule, engineers) + better tools
• Project-level prediction, adaptive scheduling
• Today: the major focus for IC industry
• U.S. DARPA IDEA program: automation, schedule

A. B. Kahng, 180327 ISPD--2018 112

Takeaways
• Quality, Schedule, Cost are “the last levers for
semiconductor scaling”
• Accessibility of hardware / semiconductor design
• Continue semiconductor value trajectory (for a while longer)
• Foundation #1: machine learning in, around EDA
• Pervasive ML  Drive down iterations, margins
• Cloud-targeted, large-scale optimizations  drive down TAT
• Foundation #2: open-source EDA
• Will a “Linux of EDA” be possible this time around?
• Foundation #3: partitioning and cloud EDA
• Also part of schedule reduction
• Design Capability Gap is a crisis for the industry
• Need all hands on deck!
A. B. Kahng, 180327 ISPD--2018 113
Conclusions and Futures (2)
• ML+EDA: challenges of technology
• “Small data” problem alongside “big data” problem
• Huge implementation space, difficult parameter identification
• Complicated by tool versions, design versions, technology
changes (pictures of cats and trees don’t change every
year)
• Possibly helpful: EDA folks know what’s in their tools!
• ML in EDA: industry challenges
• EDA {doesn’t like to, doesn’t know how to} model itself
• Dependence on customers and customer data to understand
what is needed
• Open: Will customers or EDA vendors (or foundries)
drive ML into design enablements and production flows?
• METRICS … revisited? (measure, record, model, predict, improve)
A. B. Kahng, 180327 ISPD--2018 114

Floor Plan
No ratings yet
Floor Plan
6 pages
Physical Design in Depth
100% (1)
Physical Design in Depth
147 pages
PDF 3 - Floorplan Lab Manual
No ratings yet
PDF 3 - Floorplan Lab Manual
8 pages
Issues in Physics Design and Fixes
No ratings yet
Issues in Physics Design and Fixes
31 pages
Multiple IO Ring
No ratings yet
Multiple IO Ring
27 pages
Physical Design Flow Challenges at 28nm On Multi-Million Gate Blocks
No ratings yet
Physical Design Flow Challenges at 28nm On Multi-Million Gate Blocks
21 pages
Outline: - What Is Physical Design - Design Methods - Design Styles - Analysis and Verification
No ratings yet
Outline: - What Is Physical Design - Design Methods - Design Styles - Analysis and Verification
43 pages
ICC 201012 LG 06 Finishing
No ratings yet
ICC 201012 LG 06 Finishing
14 pages
PD Flow
No ratings yet
PD Flow
8 pages
Physical Design - PLACEMENT
No ratings yet
Physical Design - PLACEMENT
13 pages
ASIC Physical Design Guide
No ratings yet
ASIC Physical Design Guide
272 pages
Placement Script Analysis
No ratings yet
Placement Script Analysis
12 pages
PD Ques
No ratings yet
PD Ques
32 pages
VLSI Physical Design Essentials
No ratings yet
VLSI Physical Design Essentials
54 pages
Placement - VLSI Guide
No ratings yet
Placement - VLSI Guide
10 pages
SoC Design: Tracks, Cells, and Metrics
No ratings yet
SoC Design: Tracks, Cells, and Metrics
10 pages
3 4 NarenderHanchate
No ratings yet
3 4 NarenderHanchate
14 pages
VLSI Physical Design - Physical Design Interview Question Part 2
No ratings yet
VLSI Physical Design - Physical Design Interview Question Part 2
2 pages
VLSI Routing and Timing Analysis
No ratings yet
VLSI Routing and Timing Analysis
8 pages
Magnet - Placement Analysis
No ratings yet
Magnet - Placement Analysis
9 pages
L16 CTS 4
No ratings yet
L16 CTS 4
13 pages
Physical Design Is Physical Implementation of Design
100% (1)
Physical Design Is Physical Implementation of Design
30 pages
1-Introduction and App Options
No ratings yet
1-Introduction and App Options
63 pages
Plangrouppins
No ratings yet
Plangrouppins
95 pages
IC Routing & Optimization Guide
No ratings yet
IC Routing & Optimization Guide
23 pages
Placement Stage of Physical Design
100% (1)
Placement Stage of Physical Design
10 pages
Vlsi Physical Design PDF
No ratings yet
Vlsi Physical Design PDF
41 pages
Physical Design Implementation of 32bit RISC Processor Using Synopsys ICC Primetime & StarRC XT
No ratings yet
Physical Design Implementation of 32bit RISC Processor Using Synopsys ICC Primetime & StarRC XT
6 pages
Innovus
No ratings yet
Innovus
77 pages
Age, Lda: Lntroduction and Lab
No ratings yet
Age, Lda: Lntroduction and Lab
36 pages
W5 ICC2 Physical Synthesis
No ratings yet
W5 ICC2 Physical Synthesis
29 pages
VLSI Logic Synthesis Part 3
No ratings yet
VLSI Logic Synthesis Part 3
19 pages
IO Pad Insertion
No ratings yet
IO Pad Insertion
7 pages
Timing Paths Static Timing Analysis STA Basic Part 1 VLSI Concepts PDF
No ratings yet
Timing Paths Static Timing Analysis STA Basic Part 1 VLSI Concepts PDF
10 pages
Physical Aware Synthesis and Floorplan Challenges
100% (1)
Physical Aware Synthesis and Floorplan Challenges
12 pages
New PD 1
No ratings yet
New PD 1
78 pages
VLSI Physical Design - STA Interview Question Part 5
No ratings yet
VLSI Physical Design - STA Interview Question Part 5
2 pages
Floorplan Script Analysis
No ratings yet
Floorplan Script Analysis
9 pages
PD Inputs
No ratings yet
PD Inputs
3 pages
ICCII Freeze Silicon ECO Flow
No ratings yet
ICCII Freeze Silicon ECO Flow
17 pages
Mentor Calibre DRC Lvs DFM
No ratings yet
Mentor Calibre DRC Lvs DFM
33 pages
Physical Design for Engineers
No ratings yet
Physical Design for Engineers
15 pages
Starrc Ds
No ratings yet
Starrc Ds
7 pages
PD Inputs
No ratings yet
PD Inputs
40 pages
About DRC and Its Impact in Physical Design
No ratings yet
About DRC and Its Impact in Physical Design
5 pages
Design Rule Violation Fixing in Timing Closure
No ratings yet
Design Rule Violation Fixing in Timing Closure
3 pages
PD Interview Questions
No ratings yet
PD Interview Questions
54 pages
PD Flow Files
No ratings yet
PD Flow Files
2 pages
PD Interview Questions
No ratings yet
PD Interview Questions
16 pages
PhysicalDesignForYou (VLSI) - FloorPlan VLSI
No ratings yet
PhysicalDesignForYou (VLSI) - FloorPlan VLSI
6 pages
Physical Verification
100% (1)
Physical Verification
13 pages
Machine Learning For EDA The Road Ahead
No ratings yet
Machine Learning For EDA The Road Ahead
9 pages
Machine Learning Applications in Physical Design - Recent Results and Directions
No ratings yet
Machine Learning Applications in Physical Design - Recent Results and Directions
6 pages
Goswami and Bhatia - 2023 - Application of Machine Learning in FPGA EDA Tool Development
No ratings yet
Goswami and Bhatia - 2023 - Application of Machine Learning in FPGA EDA Tool Development
17 pages
Machine Learning in FPGA EDA Tools
No ratings yet
Machine Learning in FPGA EDA Tools
18 pages
Kahng Ansys Dacbreakfast Talk Distributed2
No ratings yet
Kahng Ansys Dacbreakfast Talk Distributed2
43 pages
S 11 Keynote
No ratings yet
S 11 Keynote
29 pages
PH4418 Physics in Industry - Semiconductors - Part5
No ratings yet
PH4418 Physics in Industry - Semiconductors - Part5
72 pages
Practical Solutions To Accelerating ASIC Design Development Using Machine Learning
No ratings yet
Practical Solutions To Accelerating ASIC Design Development Using Machine Learning
142 pages
Machine Learning: Design, Development and Augmented Intelligence
No ratings yet
Machine Learning: Design, Development and Augmented Intelligence
25 pages
Lmx24, Lmx24X, Lmx24Xx, Lm2902, Lm2902X, Lm2902Xx, Lm2902Xxx Quadruple Operational Amplifiers
No ratings yet
Lmx24, Lmx24X, Lmx24Xx, Lm2902, Lm2902X, Lm2902Xx, Lm2902Xxx Quadruple Operational Amplifiers
59 pages
VLT 2800 - 3
No ratings yet
VLT 2800 - 3
7 pages
GTU Engineering Exam 2023 Practice
No ratings yet
GTU Engineering Exam 2023 Practice
2 pages
Pic Example
No ratings yet
Pic Example
5 pages
An Introduction To Software Engineering
No ratings yet
An Introduction To Software Engineering
25 pages
02.SAP HANA SP08 Course Content Details
No ratings yet
02.SAP HANA SP08 Course Content Details
3 pages
MCA527 Manual
No ratings yet
MCA527 Manual
71 pages
C# Special Characters
100% (1)
C# Special Characters
7 pages
How To Move All Android Apps To SD Card
0% (2)
How To Move All Android Apps To SD Card
10 pages
Software & Electronics Engineer Resume
No ratings yet
Software & Electronics Engineer Resume
2 pages
C Programming Basics
100% (1)
C Programming Basics
91 pages
Pedro Soares HP Server Foundation Test
50% (4)
Pedro Soares HP Server Foundation Test
22 pages
Guide Reparation
No ratings yet
Guide Reparation
37 pages
Create Arduino CC Projecthub User8523373 Rfid Based Automati
No ratings yet
Create Arduino CC Projecthub User8523373 Rfid Based Automati
18 pages
Automatic Control Systems
No ratings yet
Automatic Control Systems
25 pages
UG APE Syllabus
No ratings yet
UG APE Syllabus
2 pages
Quick Guide Candy Bianca Smart Fi Dual en 170830
No ratings yet
Quick Guide Candy Bianca Smart Fi Dual en 170830
2 pages
LoadRunner Stormrunner-Cloud-Ds
No ratings yet
LoadRunner Stormrunner-Cloud-Ds
4 pages
Exec CC
No ratings yet
Exec CC
57 pages
Mcqs & Short Questions
No ratings yet
Mcqs & Short Questions
3 pages
Vss
No ratings yet
Vss
1 page
RET670 Test Report Differential
50% (2)
RET670 Test Report Differential
3 pages
2.2.1 Packet Tracer - Configure Basic EIGRP With IPv4
No ratings yet
2.2.1 Packet Tracer - Configure Basic EIGRP With IPv4
3 pages
2N3906 PNP 40V 0.2a PDF
No ratings yet
2N3906 PNP 40V 0.2a PDF
6 pages
EE2016 F23 TTRL 1
No ratings yet
EE2016 F23 TTRL 1
3 pages
Distance Protection Schemes Guide
100% (1)
Distance Protection Schemes Guide
9 pages
C++ MCQ Question
No ratings yet
C++ MCQ Question
192 pages
WinCC BasicOptions en-US en-US
No ratings yet
WinCC BasicOptions en-US en-US
376 pages
HT640 Datasheet
No ratings yet
HT640 Datasheet
13 pages