Safari
Safari
ASPDAC 2025
Prashant Seetharaman
Siemens EDA
1
Agenda
2
Introduction on memory testing
3
Memory structure and operation
Key takeaway: Performance considerations b/w access patterns, timing requirements and power optimization
4
RAM bit cell structure
Key Takeaway:
By optimizing transistor strengths and ratios, designers achieve a reliable memory cell capable of high-
speed operations with minimal noise and power overhead
5
Array of bit cells
Data sent to and read from bit cells using differential signaling
over bitlines for higher performance
Data=1 if voltage of BL+ > BL-
Data=0 if voltage of BL+ < BL-
Write operation
Read operation
Key takeaway: Through techniques like differential signaling, precise precharge, and robust sensing, this
design ensures high performance in data storage and retrieval 6
Complete Random Access Memory (RAM) structure
• Address decoders
• IO circuits
• Control block
Source: Memory Testing and Built-In Self-Test" in "CMOS VLSI Design: A Circuits and Systems
Perspective" by Neil H.E. Weste and David Harris
7
Multi-port memories
• Basic RAM is single port with read and write capability 1RW
- Address input, data input and data output
R=Read-only port
W=Write-only port
RW=Read/Write port
Design takes into account that cell can be accessed by more than
one port at the same time
- Access can be direct or indirect
- Direct if row and column address is the same on both ports
- Indirect if row address is the same but not column address
(more frequent)
- Implication on test patterns to be generated
Key takeaway: Optimizations such as single-ended read ports ensure these cells meet the demands of modern
high-performance systems while minimizing overhead 9
Pseudo 2 port memory
CLK
PREn
RE’ Source: “Digital Integrated Circuits: A Design Perspective" by Jan M. Rabaey et al
WE’
Key takeaway: By emulating multi-port behavior in a single-port design, it strikes a balance between performance,
area efficiency, and timing complexity, making it suitable for power-sensitive applications 10
Read only memory (ROM) structure
Key takeaway: Its minimalistic design—featuring just one transistor per bit—ensures low area and power
consumption, perfectly suited for static data applications 11
Memory structure and operation summary
• Single and multi-port SRAMs are the most common embedded memories
• All memories are composed of bit cells arranged in rows and columns
• Column multiplexing, physical address mapping and physical data mapping can be used to optimize the memory
layout, performance and power but they also affect testability
• Architecture of multi-port memories becomes complex due to a mix of R/W/RW ports and time-multiplexed
operations
• ROMs have a single bit cell, but their content can be changed late during the manufacturing process
12
MBIST Fundamentals
13
MBIST Fundamentals
2 MBIST architecture
14
Motivation for MBIST
Access
• MBIST controllers physically close to memories to minimize length of connections
• Low speed serial link used to control MBIST controllers from tester
• Physically impossible to access hundreds or even thousands of memories from pins or embedded CPUs
Timing
• Physical proximity also allows application of at-speed tests
• Not practical to apply tests in the GHz frequency range from tester
Test time
• MBIST controllers can test tens of memories in parallel
• Something not possible with a tester or a CPU based approach
15
MBIST Architecture
Memory Interface
• Intercepts functional memory inputs for
applying algorithm
• Unique to each memory
Source: Memory Testing and Built-In Self-Test" in "CMOS VLSI Design: A Circuits and Systems
Perspective" by Neil H.E. Weste and David Harris
16
MBIST Controller components
Source: "Memory Testing and Design for Test" by M. Bushnell and V. Agrawal 17
MBIST Interface components (RAM)
Source: "VLSI Test Principles and Architectures: Design for Testability" by Wang, Wu, and Wen
18
MBIST Interface components (ROM)
Source: "VLSI Test Principles and Architectures: Design for Testability" by Wang, Wu, and Wen
19
Factors influencing MBIST frequency
• Memory configuration
o Number of memories per controller
o Range of address bus and data bus width
o Complexity and/or differences in physical data and MBIST logic is typically implemented as “soft” IP
address map synthesized together with the functional logic.
o Type and granularity of repair Difficult to predict the maximum achievable
o Memory placement relative to controller frequency given the large number of factors affecting
performance
• Algorithm features
o Number of algorithms and instructions per algorithm
o Address register segmentation
o Data pattern inversion based on address
o Comparison of address registers
20
How to improve MBIST performance
• MBIST controller logic can be very complex, and options must be provided to enable operation at very high frequency (GHz
range)
• Pipelining is required for high fan-out signals or signals transmitted at a large distance from the controller potentially
toggling at the maximum clock rate
• Multi-cycle operations provide an advantage over pipelining for parts of the logic with large combinational depth not
toggling at the maximum clock rate
• Not possible to balance logic depth before and after pipeline flops due to large number of configurations
• Pipeline flops introduce additional delays
21
How to improve MBIST performance (contd.)
22
MBIST summary
• Memory BIST is the only practical method of testing a large number of memories due to access, timing and test time
considerations
• The MBIST logic consists of 3 main design objects: access port, controller and memory interface
• Use of pipelining and multi-cycle paths improves performance, but it is still difficult to predict the maximum frequency when
implemented as “soft” IP
23
MBIST Integration in SoC Design flow
24
Design Integration
• Test planning
25
Chip test architecture
Block A Block B
To other To other
1. Large majority of designs are hierarchical, and block based
MEM
MEM
instruments instruments
BISR
BISR
SIB SIB
- Divide-and-conquer approach beneficial to both design and test
BIST BIST
BAP BAP
BISR
BISR
SIB SIB
MEM
MEM
2. IEEE 1687 is most appropriate to implement test architecture
To other
- Easy integration of all test and non-test instruments instruments
MEM
SIB SIB
BISR
SIB
BIST
Key takeaway: A hierarchical, block-based test architecture with integrated BIST and BISR enhances
scalability, reliability, and test efficiency, making it essential for modern SoC designs 26
Memory BIST and BISR timing interfaces
Block A Block B
Several cross-domain interfaces To other To other
1. BAPBIST
MEM
MEM
instruments instruments
BISR
BISR
SIB SIB
2. BISR registers BIST
BIST BIST
3. BISR registers Memories BAP BAP
BISR
BISR
SIB SIB
MEM
MEM
4. BISR controller Functional
5. BAP Functional (not shown) To other
instruments
3
Three methods used for cross-domain transfers
MEM
BISR
SIB SIB SIB
BIST
2
1. Low speed serial link between controllers 5 1
Clock domains
2. Protocol based transfer TCK SIB SIB
4
i) Source register holds when destination register captures
Functional BISR
Functional clock and control
3. Synchronizers BISR
i) Used for few control signals when other two methods not
applicable
Source: “TMBIST User’s Manual”
27
Test planning (controller assignment rules)
Several partitioning rules used to assign memories to a test controller (Cx rules) and determine memories that can be tested in
parallel within a controller (Px rules)
- Partitioning done within a physical block/layout region
28
Test planning (parallel group assignment rules)
Px rules determine memories that can be tested in parallel within a controller
- Even if all memories in a controller are of a same type, they might have different requirements
P3: Memories should not exceed the maximum average power allowed
29
Optimizing parallel group assignments for test time
30
Optimization of comparators assignment based on placement
32
Tiling architecture including test (IJTAG, MemoryBIST)
33
IP cores with pre-defined test bus interfaces
Advantages
• IP provider can minimize impact of MBIST on core
performance
• “Perfect” at-speed test can be achieved by accessing
memories through functional registers
34
Memory BIST for IP cores with shared test bus
35
Access to memory BIST at system level – method 1
36
Access to memory BIST at system level – method 2
37
Verification checklist for memory test logic
All verification steps used for functional logic apply:
Sign-off verification
Timing verification
38
Design Integration summary
Memory BIST and repair implementation must be well integrated with the functional design and verification flow
Several partitioning rules are used to assign memories to a test controller (Cx rules) and determine memories that can be
tested in parallel within a controller (Px rules)
Reuse of memory BIST and repair functions in-system becoming more frequent
39
Advanced MBIST Techniques
40
Advanced MBIST Techniques
• Functional debug
41
Functional debug
Key takeaway: By leveraging MBIST for functional debugging, engineers can efficiently diagnose, isolate, and
resolve memory-related faults in complex systems 42
Dynamic RAM (DRAM)
Key takeaway: DRAM testing demands unique algorithms and procedures to account for its reliance on
capacitors, burst operations, and pre-charge logic 43
DRAM BIST hardware
Key takeaway: DRAM BIST hardware provides a robust framework for testing external memory interfaces
44
Emerging trends and future directions
49
Emerging trends and future directions
50
Non-destructive memory test
Also known as transparent MBIST
• Testing memory without loosing its initial content
• System can resume operation after test
Possible to take advantage of system idle time to perform some testing and
reduce latency of fault detection
Key takeaway: Non-destructive memory testing combines fault detection with data preservation
51
Testing memories with ECC
4 cases to address for memories with Error Correction Code (ECC) logic
Case 3: ECC assumed to fix hard errors during power-on self-test (POST)
• Applicable to SRAM to improve system availability
• Repair done as usual during manufacturing
53
Non-Volatile Memories (NVMs)
• NVMs come with new defects requiring new fault models and test algorithms
• NVMs also require calibration (trimming) of reference voltage or current for both read and write operations
• ECC logic often required on top of conventional row/column repair to improve yield
Bit cell structure composed of single access transistor and Magnetic Tunnel
Junction (MTJ)
Key takeaway: Its innovative use of spin-transfer torque for switching represents a major leap forward in
memory design, making it a strong candidate for future applications in both consumer and industrial markets 55
BIST for MRAM
MRAM adds two major requirements to BIST in order to improve yield
Use ECC to fix manufacturing defects that can’t be fixed with conventional repair
• ECC typically used in combination with row repair
• Column repair might also be used to repair column logic failures even though
repairable with ECC only
- e.g., failures in bitline, write driver or sense amplifier
56
Case studies and publications Siemens link
57
Scaling in AI
Data
2. Scaling comes in all forms, from data all the way to hardware
Model
3. Inference and test time compute is a new frontier for scaling
S/W
H/W
[3] Constitutional AI : Harmlesness from AI Feedback
Bai et al., Anthropic
At present
Future focus
Key takeaway: AI in EDA is not just about enhancing testing processes—it’s about transforming the entire
design and manufacturing lifecycle 59
Tutorial Takeaways
Thanks!
Acknowledgements:
I would like to thank Dr. Benoit Nadeau-Dostie, Ron Press, Albert Au, Artur
Pogiel and Sebastian Bromberek for feedback and helpful pointers.
60
Q&A and Discussion
61