0% found this document useful (0 votes)

3 views36 pages

Caching

The document discusses cache memory organization, including byte and word addressable systems, various types of caches (hardware and software), and their design issues such as cache hits and misses. It explains different mapping techniques for cache memory, including direct-mapped, associative, and set-associative caches, along with their advantages and disadvantages. Additionally, it covers the structure of cache memory, addressing methods, and the importance of cache line tag size.

Uploaded by

pratyush3703

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views36 pages

Caching

Uploaded by

pratyush3703

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 36

Byte addressable system :

f block offset = 100 (binary = 4 decimal), that means “the 5th byte” inside the block.
But the CPU typically requests a whole word (4 bytes), not a single byte.
So the cache controller delivers the entire word that contains that byte (bytes 4–7).

Word Addressable System:

If block offset = 100 (binary = 4 decimal), it means “the 5th word” inside the block (i.e., bytes 16–19 if words are 4 bytes).
Cache returns that word directly.

Caches

Dr. Sonali Chouhan

Embedded Systems (EE312)

© Sonali Chouhan
Many types of caches
• Examples
– H/W: L1, L2 CPU caches, translation
lookaside buffers (TLBs), ...
– S/W: virtual memory, FS (Filesystem)
buffers, web browser caches, …

address data
cache

controller
cache main
CPU memory
address
data data

© Sonali Chouhan
Caches
• Many common design issues
– each cached item has a “tag” (an ID) plus contents
– need a mechanism to efficiently determine whether
given item is cached
• combinations of indices and constraints on valid
locations
– on a miss, usually need to pick something to replace
with new item
• called a “replacement policy”
– on writes, need to either propagate change or mark
item as “dirty”
• Write-through (L1+main) vs. write-back (L1)

• Different solutions for different caches

© Sonali Chouhan
Terms
• Cache hit: required location is in
cache.
• Cache miss: required location is not
in cache.
• Working set: set of
locations/addresses or memory used
by program in a time interval.

© Sonali Chouhan
Inserting an L1 Cache Between
the CPU and Main Memory
The tiny, very fast CPU register file has
The transfer unit between the room for four 4-byte word
CPU register file and the cache
is a 4-byte word
line 0 The small fast L1 cache has room for
line 1 two 4-word blocks
The transfer unit
between the cache and
main memory is a 4-word block 10 abcd
block (16 bytes) ...
block 21 pqrs The big slow main memory has
... room for many 4-word blocks
block 30 wxyz
...
© Sonali Chouhan
General Organization of a Cache
Cache is an array 1 valid bit t tag bits B = 2b bytes
of sets per line per line per cache block
Each set contains
valid tag 0 1 • • • B–1
one or more lines E lines
set 0: •••
per set
Each line holds a valid tag 0 1 • • • B–1
block of data
valid tag 0 1 • • • B–1
set 1: •••
S = 2s sets
valid tag 0 1 • • • B–1

•••
valid tag 0 1 • • • B–1
set S-1: •••
valid tag 0 1 • • • B–1

Cache size: C = B x E x S data bytes

© Sonali Chouhan
Addressing Caches
Address A:
t bits s bits b bits
m-1 0
v tag 0 1 • • • B–1
set 0: •••
v tag 0 1 • • • B–1 <tag> <set index> <block offset>
v tag 0 1 • • • B–1
set 1: •••
v tag 0 1 • • • B–1

••• The word at address A is in the cache

v tag 0 1 • • • B–1
If the tag bits in one of the <valid>
set S-1: •••
v tag 0 1 • • • B–1 lines in set <set index> match <tag>

The word contents begin at offset

<block offset> bytes from the
beginning of the block
© Sonali Chouhan
The addresses are generated by CP. Thus they are not the separate identities. They are mere generated addresses by the CPU.

Addressing Caches
Address A:
t bits s bits b bits
m-1 0
v tag 0 1 • • • B–1
set 0: •••
v tag 0 1 • • • B–1 <tag> <set index> <block offset>
v tag 0 1 • • • B–1
set 1: •••
v tag 0 1 • • • B–1

••• 1. Locate the set based on

v tag 0 1 • • • B–1 <set index>
set S-1: ••• 2. Locate the line in the set based on
v tag 0 1 • • • B–1
<tag>
3. Check that the line is valid
4. Locate the data in the line based on
<block offset>
© Sonali Chouhan
Example: Direct-Mapped
Cache
Simplest kind of cache, easy to build
(only 1 tag compare required per access)
Characterized by exactly one line per set.

set 0: valid tag cache block E=1 lines per set

set 1: valid tag cache block

•••

set S-1: valid tag cache block

Cache size: C = B x S data bytes

© Sonali Chouhan
Accessing Direct-Mapped
Caches
Set selection
– Use the set index bits to determine the set of
interest.
set 0: valid tag cache block
selected set
set 1: valid tag cache block
•••

set S-1: valid tag cache block

t bits s bits b bits

00001
m-1 0
tag set index block offset
© Sonali Chouhan
Accessing Direct-Mapped
Caches
Line matching and word selection
– Line matching: Find a valid line in the selected
set with a matching tag
– Word selection: Then extract the word
=1? (1) The valid bit must be set
0 1 2 3 4 5 6 7

selected set (i): 1 0110 b0 b1 b2 b3

0 1 2 3 4 5 6 7

selected set (i): 1 0110 b0 b1 b2 b3

(3) If cache hit,

block offset selects starting byte.

t bits s bits b bits

0110 i 100
m-1 0
tag set index block offset
© Sonali Chouhan
Cache Memory : Placement
Policy
• There are three commonly used
methods to translate main memory
addresses to cache memory
addresses.
• Associative Mapped Cache
• Direct-Mapped Cache
• Set-Associative Mapped Cache
• The choice of cache mapping scheme
affects cost and performance, and
there is no single best method that is
appropriate for all situations
© Sonali Chouhan
Associative Mapped Cache
• A block in the Main Memory can be mapped
to any block in the Cache Memory available
(not already occupied)
• Advantage: Flexibility. A Main Memory block
can be mapped anywhere in Cache Memory.
• Disadvantage: Slow or expensive. A search
through all the Cache Memory blocks is
needed to check whether the address can be
matched to any of the tags.

© Sonali Chouhan
Main Memory - Cache
Structure
Main Memory
4-line Cache 0000
0001
00 Block
0010 (k words)
01 0011
10 0100
11 0101
Tag Block Length 0110
(k words) 0111
1000
1001
1010
1011
1100
1101
1110
1111 © Sonali Chouhan
Associative Mapping -
Example
Main Memory
Cache 0000.. Block
0001..
0010..
0011..

. 0100..
0101..

.
Tag Block Length 0110..
(k words) 0111..
1000..

. 1001..
1010..
1011..
1100..
1101..
1110..
1111.. © Sonali Chouhan
Direct Mapped Cache
• To avoid the search through all CM
blocks needed by associative
mapping, this method only allows
# blocks in main memory__
# blocks in cache memory
blocks to be mapped to each Cache
Memory block.

4-line Cache 0000.. Block

Set 0001..
00 0010..
01 0011..
10 0100..
11 0101..
Tag Block 0110..
0111..
1000..
1001..
16/4=4 blocks of 1010..
1011..
MM to be mapped to 1100..
1 CM block 1101..
1110..
1111.. © Sonali Chouhan
Direct-Mapped Cache
Advantages
• The tag memory is much smaller
than in associative mapped cache.
• No need for an associative search,
since the set field is used to direct
the comparison to a single field.

© Sonali Chouhan
Direct-Mapped Cache
Disadvantage
• It lacks mapping flexibility. For
example, if two MM blocks mapped
to same CM block are needed
repeatedly (e.g., in a loop), they will
keep replacing each other, even
though all other CM blocks may be
available.

© Sonali Chouhan
Direct Cache Mapping-
Disadvantage (Example)
Main Memory
0000.. a(i)
Set 4-line Cache
0001..
00 0010..
01 0011..
10 0100.. b(i)
11 0101..
Tag Block 0110..
0111..
1000..
1001..
In a(i) + b(i) both keep 1010..
1011..
replacing each other 1100..
1101..
1110..
1111.. © Sonali Chouhan
Set-Associative Mapping
• This is a trade-off between associative and
direct mappings
• The cache is broken into sets where each
set contains "N" cache lines, let's say 4.
Then, each memory address is assigned a
set, and can be cached in any one of those
4 locations within the set that it is
assigned to. In other words, within each
set the cache is associative, and thus the
name.

8 line Cache 0000.. Block

Set 0001..
00 0010..
01 0011..
10 0100..
11 0101..
Tag Block 0110..
0111..
1000..
1001..
1010..
1011..
1100..
1101..
1110..
1111.. © Sonali Chouhan
Example: Set Associative
Cache
Characterized by more than one line
per set
valid tag cache block E=2
set 0:
valid tag cache block lines per set

valid tag cache block

set 1:
valid tag cache block
•••
valid tag cache block
set S-1:
valid tag cache block

E-way associative cache

© Sonali Chouhan
Accessing Set Associative Caches
Set selection
– identical to direct-mapped cache
valid tag cache block
set 0:
valid tag cache block

valid tag cache block

selected set set 1:
valid tag cache block
•••
valid tag cache block
set S-1:
valid tag cache block

t bits s bits b bits

0001
m-1
tag set index block offset 0
© Sonali Chouhan
(/index)
Accessing Set Associative Caches
Line matching and word selection
– must compare the tag in each valid line in
the selected set.
=1? (1) The valid bit must be set
0 1 2 3 4 5 6 7

1 1001
selected set (i): b0 b1 b2 b3
1 0110

(2) The tag bits in one of the

cache lines must match the tag =? If (1) and (2), then cache hit
bits in the address

t bits s bits b bits

0110 i 100
m-1 0
tag set index block offset
© Sonali Chouhan
Accessing Set Associative
Caches
Line matching and word selection
– Word selection is the same as in a direct
mapped cache
0 1 2 3 4 5 6 7

1 1001
selected set (i): b0 b1 b2 b3
1 0110

(3) If cache hit,

block offset selects starting byte.

t bits s bits b bits

0110 i 100
m-1 0
tag set index block offset
© Sonali Chouhan
Cache Line’s Tag Size
• Depends on 3 factors:
• Size of cache memory;
• Associativity of cache memory;
• Cacheable range of operating memory.

Here,
Stag — size of cache tag, in bits;
Smemory — cacheable range of operating memory, in bytes;
Scache — size of cache memory, in bytes;
A — associativity of cache memory, in ways.
© Sonali Chouhan
Memory system
performance

Dr. Sonali Chouhan

Embedded Systems (EE312)

© Sonali Chouhan
Cache operation
• Many main memory locations are
mapped onto one cache entry.
• May have caches for:
– instructions;
– data;
– data + instructions (unified).
• Memory access time is no longer
deterministic.

• Miss Rate
– Fraction of memory references not found
in cache (misses / accesses)
• 1 – hit rate
– Typical numbers (in percentages):
• 3-10% for L1
• can be quite small (e.g., < 1%) for L2,
depending on size, etc.

© Sonali Chouhan
Cache Performance Metrics
• Hit Time
– Time to deliver a line in the cache to the
processor
• includes time to determine whether the
line is in the cache
– Typical numbers:
• 1-2 clock cycle for L1
• 5-20 clock cycles for L2
• Miss Penalty
– Additional time required because of a miss
• typically 50-200 cycles for main memory
(Trend: increasing!) © Sonali Chouhan
Lets think about those
numbers
•Huge difference between a hit and a miss
– 100X, if just L1 and main memory
•Would you believe 99% hits is twice as
good as 97%?
– Consider these numbers:
– cache hit time of 1 cycle, miss penalty of 100
cycles
So, average access time is:
97% hits: 1 cycle + 0.03 * 100 cycles = 4 cycles
99% hits: 1 cycle + 0.01 * 100 cycles = 2 cycles
This is why “miss rate” is used instead of “hit rate”
© Sonali Chouhan
Memory system
performance
• h = cache hit rate.
• tcache = cache access time, tmain = main
memory access time.
• Average memory access time (AMAT):
– AMAT =hit rateL1* access timeL1 + miss
rateL1*miss penaltyL1
– tav = htcache + (1-h)tmain

Computer organization and design: the

hardware/software interface
By David A. Patterson, John L. Hennessy

EE312 Cache p1
No ratings yet
EE312 Cache p1
36 pages
Lec 4
No ratings yet
Lec 4
31 pages
Introduction To Cache Memory: Lecture 4A
No ratings yet
Introduction To Cache Memory: Lecture 4A
31 pages
Cache Memory Design Parameters: CS223 Computer Architecture & Organization
No ratings yet
Cache Memory Design Parameters: CS223 Computer Architecture & Organization
19 pages
5-Cache Memories-14-02-2025
No ratings yet
5-Cache Memories-14-02-2025
42 pages
CO Lec.4
No ratings yet
CO Lec.4
36 pages
55-Types of Caches, Caches Misses,-04!03!2025
No ratings yet
55-Types of Caches, Caches Misses,-04!03!2025
64 pages
04 - Cache Memory
No ratings yet
04 - Cache Memory
61 pages
DECO - Module 4.3 - Cache
No ratings yet
DECO - Module 4.3 - Cache
20 pages
Cache Mapping Techniques Guide
No ratings yet
Cache Mapping Techniques Guide
8 pages
04 Cache Memory
No ratings yet
04 Cache Memory
71 pages
CAO - Lecutre7 Cache Memory
100% (1)
CAO - Lecutre7 Cache Memory
39 pages
Cache Replacement Strategies
No ratings yet
Cache Replacement Strategies
25 pages
Cache Memory
No ratings yet
Cache Memory
51 pages
Cache Memory Essentials
No ratings yet
Cache Memory Essentials
52 pages
Cache Memory
No ratings yet
Cache Memory
56 pages
Computer Architecture: Cache Design
No ratings yet
Computer Architecture: Cache Design
61 pages
06 - Memory System - I
No ratings yet
06 - Memory System - I
63 pages
Lectures wk11
No ratings yet
Lectures wk11
21 pages
Memory
No ratings yet
Memory
42 pages
10 Cache
No ratings yet
10 Cache
28 pages
Lecture 7
No ratings yet
Lecture 7
34 pages
Cache Mapping Functions
No ratings yet
Cache Mapping Functions
39 pages
Cache Mapping
No ratings yet
Cache Mapping
7 pages
Chap 6
No ratings yet
Chap 6
48 pages
11 Cache Memory
No ratings yet
11 Cache Memory
40 pages
Cache Memory
No ratings yet
Cache Memory
16 pages
Cache Memory
No ratings yet
Cache Memory
26 pages
Introduction To Cache
No ratings yet
Introduction To Cache
17 pages
Memory Hierarchy: REG Cache Main Secondary
No ratings yet
Memory Hierarchy: REG Cache Main Secondary
37 pages
Computer Architecture: Cache Memory
No ratings yet
Computer Architecture: Cache Memory
57 pages
Memory Organization PPT1
No ratings yet
Memory Organization PPT1
23 pages
William Stallings Computer Organization and Architecture 8th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 8th Edition Cache Memory
71 pages
CS2115 Chapter-6
No ratings yet
CS2115 Chapter-6
45 pages
BiD 05
No ratings yet
BiD 05
97 pages
Computer Organization and Architecture: Cache Memory
100% (1)
Computer Organization and Architecture: Cache Memory
57 pages
Memory Cache (Finley 2000)
No ratings yet
Memory Cache (Finley 2000)
15 pages
Lecture 08 - CH No. 04 (Part 02)
No ratings yet
Lecture 08 - CH No. 04 (Part 02)
60 pages
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
57 pages
Lec17 Cache 3
No ratings yet
Lec17 Cache 3
33 pages
Ch01 Part3 Caches
No ratings yet
Ch01 Part3 Caches
32 pages
William Stallings Computer Organization and Architecture 6th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 6th Edition Cache Memory
55 pages
Ch01 Part3 Caches
No ratings yet
Ch01 Part3 Caches
32 pages
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
No ratings yet
William Stallings Computer Organization and Architecture 7th Edition Cache Memory
66 pages
4.1 Computer Memory System Overview
No ratings yet
4.1 Computer Memory System Overview
12 pages
04 - Cache Memory PDF
No ratings yet
04 - Cache Memory PDF
71 pages
Cache Memory Management Guide
No ratings yet
Cache Memory Management Guide
45 pages
Computer Organization: Large and Fast: Exploiting Memory Hierarchy
No ratings yet
Computer Organization: Large and Fast: Exploiting Memory Hierarchy
19 pages
Module 6 CO 2020
No ratings yet
Module 6 CO 2020
40 pages
Introduction To Cache Memory: CS223 Computer Architecture & Organization
No ratings yet
Introduction To Cache Memory: CS223 Computer Architecture & Organization
17 pages
Characteristics Location Capacity Unit of Transfer Access Method Performance Physical Type Physical Characteristics Organisation
No ratings yet
Characteristics Location Capacity Unit of Transfer Access Method Performance Physical Type Physical Characteristics Organisation
53 pages
Cache Memory: Prepared by - : Manan Mewada (TA, IET)
No ratings yet
Cache Memory: Prepared by - : Manan Mewada (TA, IET)
19 pages
Cache Memory Mapping Techniques
No ratings yet
Cache Memory Mapping Techniques
36 pages
04 - Cache Memory (Compatibility Mode)
No ratings yet
04 - Cache Memory (Compatibility Mode)
12 pages
Unit 1 Part 2 (Chapter 4) Cache Memory
No ratings yet
Unit 1 Part 2 (Chapter 4) Cache Memory
53 pages
Lec04 Magntostatics
No ratings yet
Lec04 Magntostatics
20 pages
Lec05 Medium Properties
No ratings yet
Lec05 Medium Properties
15 pages
Numericals On Cpu
No ratings yet
Numericals On Cpu
2 pages
Languages of The Computer
No ratings yet
Languages of The Computer
26 pages
Ee312 Isa
No ratings yet
Ee312 Isa
46 pages
AWT Curriculum
No ratings yet
AWT Curriculum
4 pages
TM3-GuideBook en Rev3
No ratings yet
TM3-GuideBook en Rev3
110 pages
User's Manual: USB 2.0 Ethernet Adapter
No ratings yet
User's Manual: USB 2.0 Ethernet Adapter
14 pages
Java - Project AJ
No ratings yet
Java - Project AJ
14 pages
IoT Smart Basket for Efficient Shopping
No ratings yet
IoT Smart Basket for Efficient Shopping
3 pages
Aleena Akram 22F-8801 Lab 6: Task 1
No ratings yet
Aleena Akram 22F-8801 Lab 6: Task 1
10 pages
Spring Boot Notes:: List of Annotations
No ratings yet
Spring Boot Notes:: List of Annotations
9 pages
UML Class Diagram
No ratings yet
UML Class Diagram
1 page
VPC Configurator User Guide 3.0
No ratings yet
VPC Configurator User Guide 3.0
25 pages
Introduction To Huawei Intelligent Storage Products
No ratings yet
Introduction To Huawei Intelligent Storage Products
38 pages
ECS518U Operating Systems Lab 4 Answer Sheet
No ratings yet
ECS518U Operating Systems Lab 4 Answer Sheet
2 pages
Welcome To AWS Training and Certificatio
No ratings yet
Welcome To AWS Training and Certificatio
91 pages
Service Manual - 161150-486 - e - en - Bci Link - Lis - Vidas
No ratings yet
Service Manual - 161150-486 - e - en - Bci Link - Lis - Vidas
138 pages
Java & SQL Technical Assessment
No ratings yet
Java & SQL Technical Assessment
6 pages
6DL1132-6BH00-0PH1 Et200sp - Ha - DQ - 16x24vdc - 0 - 5a - en-US - en-US
No ratings yet
6DL1132-6BH00-0PH1 Et200sp - Ha - DQ - 16x24vdc - 0 - 5a - en-US - en-US
44 pages
PS2 Homebrew: Hardware & SDK Guide
No ratings yet
PS2 Homebrew: Hardware & SDK Guide
13 pages
Open Fast
No ratings yet
Open Fast
425 pages
T24 Reference Architecture OracleOEL Platform View-Draft
No ratings yet
T24 Reference Architecture OracleOEL Platform View-Draft
112 pages
Mini Project
No ratings yet
Mini Project
14 pages
Fortunatus Biseko CV Updated 2
No ratings yet
Fortunatus Biseko CV Updated 2
2 pages
Master of Integrated Technology - CSE - STRUCTURE - Syllabus - July 2020 PDF
No ratings yet
Master of Integrated Technology - CSE - STRUCTURE - Syllabus - July 2020 PDF
34 pages
Web Development Bootcamp
No ratings yet
Web Development Bootcamp
20 pages
Mesin Robot Greasing
No ratings yet
Mesin Robot Greasing
2 pages
Advanced SMP Digital Reverb Guide
No ratings yet
Advanced SMP Digital Reverb Guide
1 page
Nifi 210415 Exercise Manual
100% (1)
Nifi 210415 Exercise Manual
140 pages
AWS and DEVOPS Course Syllabus
No ratings yet
AWS and DEVOPS Course Syllabus
5 pages
TC1640en - Ed02 - Installation Procedure For Version J1.410.45.a Û Release 10.0
No ratings yet
TC1640en - Ed02 - Installation Procedure For Version J1.410.45.a Û Release 10.0
32 pages
Linux Magazine USA - Issue 287 October 2024
No ratings yet
Linux Magazine USA - Issue 287 October 2024
100 pages
Op 100 Spring Interview Questions and Answers (Part 1)
No ratings yet
Op 100 Spring Interview Questions and Answers (Part 1)
41 pages
C Pointers: A Comprehensive Guide
No ratings yet
C Pointers: A Comprehensive Guide
14 pages

Caching

Uploaded by

Caching

Uploaded by

Byte addressable system :

Word Addressable System:

Dr. Sonali Chouhan

• Different solutions for different caches

Cache size: C = B x E x S data bytes

••• The word at address A is in the cache

The word contents begin at offset

••• 1. Locate the set based on

set 0: valid tag cache block E=1 lines per set

set 1: valid tag cache block

set S-1: valid tag cache block

Cache size: C = B x S data bytes

set S-1: valid tag cache block

t bits s bits b bits

selected set (i): 1 0110 b0 b1 b2 b3

(2) The tag bits in the cache

selected set (i): 1 0110 b0 b1 b2 b3

(3) If cache hit,

t bits s bits b bits

4-line Cache 0000.. Block

8 line Cache 0000.. Block

valid tag cache block

E-way associative cache

valid tag cache block

t bits s bits b bits

(2) The tag bits in one of the

t bits s bits b bits

(3) If cache hit,

t bits s bits b bits

Dr. Sonali Chouhan

Computer organization and design: the

You might also like