0% found this document useful (0 votes)

2 views26 pages

cd-unit 4 (1)

nothing

Uploaded by

sharmila11121311

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views26 pages

cd-unit 4 (1)

nothing

Uploaded by

sharmila11121311

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 26

CS3501 Compiler Design Unit-4

computer science and engineering (Jeppiaar Engineering

College)

Scan to open on Studocu

Downloaded by Sharmila Bee

Studocu is not sponsored or endorsed by any college or university

Downloaded by Sharmila Bee

UNIT IV RUN-TIME ENVIRONMENT AND CODE GENERATION 8
Storage Organization, Stack Allocation Space, Access to Non-local Data on the Stack, Heap
Management - Issues in Code Generation - Design of a simple Code Generator.

STORAGE ORGANISATION

 The executing target program runs in its own logical address space in which each program
value has a location.
 The management and organization of this logical address space is shared between the complier,
operating system and target machine. The operating system maps the logical address into
physical addresses, which are usually spread throughout memory.

Typical subdivision of run-time memory:

Code

Static Data

Stack

free memory

Heap

 Run-time storage comes in blocks, where a byte is the smallest unit of addressable memory.
Four bytes form a machine word. Multibyte objects are stored in consecutive bytes and given
the address of first byte.
 The storage layout for data objects is strongly influenced by the addressing constraints of the target
machine.
 A character array of length 10 needs only enough bytes to hold 10 characters, a compiler may
allocate 12 bytes to get alignment, leaving 2 bytes unused.
 This unused space due to alignment considerations is referred to as padding.
 The size of some program objects may be known at run time and may be placed in an area
called static.
 The dynamic areas used to maximize the utilization of space at run time are stack and heap.

Activation records:
 Procedure calls and returns are usually managed by a run time stack called the control stack.
 Each live activation has an activation record on the control stack, with the root of the activation

Downloaded by Sharmila Bee

tree at the bottom, the latter activation has its record at the top of the stack.
 The contents of the activation record vary with the language being implemented. The diagram
below shows the contents of activation record.

Temporaries

Local Data

Machine Status

Control Link

Access Link

Actual
Parameters

Return Value

 Temporary values such as those arising from the evaluation of expressions.

 Local data belonging to the procedure whose activation record this is.
 A saved machine status, with information about the state of the machine just before the call to
procedures.
 An access link may be needed to locate data needed by the called procedure but found
elsewhere.
 A control link pointing to the activation record of the caller.
 Space for the return value of the called functions, if any. Again, not all called procedures return a
value, and if one does, we may prefer to place that value in a register for efficiency.
 The actual parameters used by the calling procedure. These are not placed in activation record but
rather in registers, when possible, for greater efficiency.

STORAGE ALLOCATION STRATEGIES

The different storage allocation strategies are :
1. Static allocation – lays out storage for all data objects at compile time
2. Stack allocation – manages the run-time storage as a stack.
3. Heap allocation – allocates and deallocates storage as needed at run time from a data area known as
heap.
2

Downloaded by Sharmila Bee

Static allocation
 In static allocation, names are bound to storage as the program is compiled, so there is no need for
a run-time support package.
 Since the bindings do not change at run-time, everytime a procedure is activated, its names
are bound to the same storage locations.
 Therefore values of local names are retained across activations of a procedure. That is, when
control returns to a procedure the values of the locals are the same as they were when control left
the last time.
 From the type of a name, the compiler decides the amount of storage for the name and decides
where the activation records go. At compile time, we can fill in the addresses at which the target
code can find the data it operates on.

Stack allocation

 All compilers for languages that use procedures, functions or methods as units of user-defined
actions manage at least part of their run-time memory as a stack.
 Each time a procedure is called , space for its local variables is pushed onto a stack, and when the
procedure terminates, that space is popped off the stack.

Calling sequences:
 Procedures called are implemented in what is called as calling sequence, which consists of code
that allocates an activation record on the stack and enters information into its fields.
 A return sequence is similar to code to restore the state of machine so the calling procedure
can continue its execution after the call.
 The code in calling sequence is often divided between the calling procedure (caller) and the
procedure it calls (callee).
 When designing calling sequences and the layout of activation records, the following principles
are helpful:
 Values communicated between caller and callee are generally placed at the beginning
of the callee’s activation record, so they are as close as possible to the caller’s
activation record.
 Fixed length items are generally placed in the middle. Such items typically include the control
link, the access link, and the machine status fields.
 Items whose size may not be known early enough are placed at the end of the activation
record. The most common example is dynamically sized array, where the value of one of the
callee’s parameters determines the length of the array.
 We must locate the top-of-stack pointer judiciously. A common approach is to have it point to the
end of fixed-length fields in the activation record. Fixed-length data can then be accessed by fixed
offsets, known to the intermediate-code generator, relative to the top-of-stack pointer.

Downloaded by Sharmila Bee

Parameters and returned values

Division of tasks between caller and callee

 The calling sequence and its division between caller and callee are as follows.

 The caller evaluates the actual parameters.

 The caller stores a return address and the old value of top_sp into the callee’s activation
record. The caller then increments the top_sp to the respective positions.
 The callee saves the register values and other status information.
 The callee initializes its local data and begins execution.
 A suitable, corresponding return sequence is:

 The callee places the return value next to the parameters.

 Using the information in the machine-status field, the callee restores top_sp and other
registers, and then branches to the return address that the caller placed in the status field.
 Although top_sp has been decremented, the caller knows where the return value is, relative to the
current value of top_sp; the caller therefore may use that value.

Variable length data on stack:

 The run-time memory management system must deal frequently with the allocation of space for
objects, the sizes of which are not known at the compile time, but which are local to a procedure
and thus may be allocated on the stack.

Downloaded by Sharmila Bee

 The reason to prefer placing objects on the stack is that we avoid the expense of garbage collecting
their space.
 The same scheme works for objects of any type if they are local to the procedure called and have a
size that depends on the parameters of the call.

activation
record for control link
p pointer to
A pointer
to B pointer
to C

array A
arrays of p
array B

array C

activation record for control link top_sp

procedure q called by p

arrays of q top

Access to dynamically allocated arrays

 Procedure p has three local arrays, whose sizes cannot be determined at compile time. The
storage for these arrays is not part of the activation record for p.
 Access to the data is through two pointers, top and top-sp. Here the top marks the actual top of
stack; it points the position at which the next activation record will begin.
 The second top-sp is used to find local, fixed-length fields of the top activation record.
 The code to reposition top and top-sp can be generated at compile time, in terms of sizes that will
become known at run time.

Heap allocation
Stack allocation strategy cannot be used if either of the following is possible :
1. The values of local names must be retained when an activation ends.
2. A called activation outlives the caller.
5

Downloaded by Sharmila Bee

 Heap allocation parcels out pieces of contiguous storage, as needed for activation records or
other objects.
 Pieces may be deallocated in any order, so over the time the heap will consist of alternate
areas that are free and in use.

Position in the Activation records in the heap Remarks

activation tree

s Retained activation
s record for r

r q ( 1 , 9) control link

control link

q(1,9)

control link

 The record for an activation of procedure r is retained when the activation ends.

 Therefore, the record for the new activation q(1 , 9) cannot follow that for s physically.

 If the retained activation record for r is deallocated, there will be free space in the heap
between the activation records for s and q.

ISSUES IN THE DESIGN OF A CODE GENERATOR

The following issues arise during the code generation phase :

1. Input to code generator

2. Target program
3. Memory management
6

Downloaded by Sharmila Bee

4. Instruction selection
5. Register allocation
6. Evaluation order

1. Input to code generator:

 The input to the code generation consists of the intermediate representation of the source program
produced by front end , together with information in the symbol table to determine run-time
addresses of the data objects denoted by the names in the intermediate representation.

 Intermediate representation can be :

a. Linear representation such as postfix notation
b. Three address representation such as quadruples
c. Virtual machine representation such as stack machine code
d. Graphical representations such as syntax trees and dags.

 Prior to code generation, the front end must be scanned, parsed and translated into intermediate
representation along with necessary type checking. Therefore, input to code generation is assumed
to be error-free.

2. Target program:
 The output of the code generator is the target program. The output may be :
a. Absolute machine language
- It can be placed in a fixed memory location and can be executed immediately.

b. Relocatable machine language

- It allows subprograms to be compiled separately.

c. Assembly language
- Code generation is made easier.

3. Memory management:
 Names in the source program are mapped to addresses of data objects in run-time memory by
the front end and code generator.

 It makes use of symbol table, that is, a name in a three-address statement refers to a symbol-
table entry for the name.

 Labels in three-address statements have to be converted to addresses of instructions. For

example,
j :gotoigenerates jump instruction as follows :
 ifi<j, a backward jump instruction with target address equal to location of code for
quadruple i is generated.
 ifi>j, the jump is forward. We must store on a list for quadruplei the location of the
first machine instruction generated for quadruplej. When iis processed, the machine
locations for all instructions that forward jumps to i are filled.

4. Instruction selection:
 The instructions of target machine should be complete and uniform.

Downloaded by Sharmila Bee

 Instruction speeds and machine idioms are important factors when efficiency of target program
is considered.

 The quality of the generated code is determined by its speed and size.

 The former statement can be translated into the latter statement as shown below:

5. Register allocation
 Instructions involving register operands are shorter and faster than those involving operands in
memory.

 The use of registers is subdivided into two subproblems :

 Register assignment – the specific register that a variable will reside in ispicked.

 Certain machine requires even-odd register pairs for some operands and results. For
example , consider the division instruction of the form :
D x, y

where, x – dividend even register in even/odd register pair y –

divisor
even register holds the remainder odd
register holds the quotient
6. Evaluation order
 The order in which the computations are performed can affect the efficiency of the target code.
Some computation orders require fewer registers to hold intermediate results than others.

A SIMPLE CODE GENERATOR

 A code generator generates target code for a sequence of three- address statements and effectively
uses registers to store operands of the statements.

Downloaded by Sharmila Bee

 A register descriptor is used to keep track of what is currently in each registers. The register
descriptors show that initially all the registers are empty.
 An address descriptor stores the location where the current value of the name can be found at run
time.

A code-generation algorithm:

The algorithm takes as input a sequence of three -address statements constituting a basic block. For
each three-address statement of the form x : = y op z, perform the following actions:

1. Invoke a function getreg to determine the location L where the result of the computation y op z should
be stored.

2. Consult the address descriptor for y to determine y’, the current location of y. Prefer the register
for y’ if the value of y is currently both in memory and a register. If the value of y is not already in
L, generate the instruction MOV y’ , L to place a copy of y in L.

3. Generate the instruction OP z’ , L where z’ is a current location of z. Prefer a register to a

memory location if z is in both. Update the address descriptor of x to indicate that x is in location
L. If x is in L, update its descriptor and remove x from all other descriptors.

4. If the current values of y or z have no next uses, are not live on exit from the block, and are in
registers, alter the register descriptor to indicate that, after execution of x : = y op z , those
registers will no longer contain y or z.

The algorithmic sequence of getreg function can be,

1. if x value is in register that register is returned.

2. If (1) fails, new register is returned.
3. If (2) fails, and the operation needs a special register, that register value is temporarily moved
to the memory and the register is returned.
4. If (3) fails, finally memory location is returned.

Downloaded by Sharmila Bee

Generating Code for Assignment Statements:

 The assignment d : = (a-b) + (a-c) + (a-c) might be translated into the following three-
address code sequence:
t : = a – b
u : = a – c
v : = t + u
d:=v+u
with d live at the end.

Code sequence for the example is:

Statements Code Generated Register descriptor Address descriptor

t:=a-b MOV a, R0 R0 contains t t in R0

SUB b, R0

u:=a-c MOV a , R1 R0 contains t t in R0

SUB c , R1 R1 contains u u in R1

v : =t + u ADD R1, R0 R0 contains v u in R1

R1 contains u v in R0

d:=v+u ADD R1, R0 R0 contains d d in R0

d in R0 and memory
MOV R0, d

Generating Code for Indexed Assignments

The table shows the code sequences generated for the indexed assignment statements a : = b [ i ]
and a [ i ] : = b

Statements Code Generated

a : = b[i] MOV b(Ri), R

a[i] : = b MOV b, a(Ri)

Downloaded by Sharmila Bee

Generating Code for Pointer Assignments

The table shows the code sequences generated for the pointer assignments a : = *p and *p : = a

Statements Code Generated

a : = *p MOV *Rp, a

*p : = a MOV a, *Rp

Generating Code for Conditional Statements

Statement Code

CMP x, y
if x < y goto z CJ<z
/* jump to z if condition code
is negative */

x : = y +z if x <0 goto z MOV y, R0

ADD z, R0
MOV R0,x
CJ< z

Downloaded by Sharmila Bee

Runtime Environments:

A runtime environment, often referred to as a runtime system or runtime library,

is a crucial component of a programming language's execution model. It provides
the necessary infrastructure to run programs written in that language. The
runtime environment includes various components and services that facilitate
the execution of code. Here are the key components and functions of a runtime
environment:

1. **Memory Management:**
- The runtime environment manages memory allocation and deallocation
for variables, objects, and data structures.
- It handles memory leaks, freeing up memory that is no longer in use.

2. **Execution Stack:**
- The runtime environment maintains an execution stack, often referred to as
the call stack, which keeps track of function calls and their local variables.
- It handles the allocation and deallocation of stack frames for function calls.

3. **Garbage Collection:**
- In languages with automatic memory management, the runtime
environment includes a garbage collector responsible for identifying and
reclaiming memory that is no longer reachable.

4. **Type System:**
- The runtime environment enforces the type system of the language,
ensuring that operations are performed on compatible data types.
- It may perform type checking and type coercion during runtime.

5. **Exception Handling:**
- The runtime environment provides mechanisms for handling
exceptions, including try-catch blocks or exception tables.
- It ensures that exceptions are caught, and appropriate actions are
taken, such as jumping to an exception handler.

6. Dynamic Dispatch (Polymorphism):

- In object-oriented languages, the runtime environment supports dynamic
method dispatch, allowing method calls on objects to be resolved at runtime
based on their actual types.

7. **I/O Operations:**
- The runtime environment provides functions or libraries for input and
output operations, including reading from and writing to files, console, and
network.

8. Concurrency and Multithreading:

- In languages that support concurrency and multithreading, the runtime
environment manages threads, synchronization, and thread-specific data.

Downloaded by Sharmila Bee

9. **Dynamic Linking and Loading:**
- The runtime environment may handle dynamic linking and loading of libraries
or modules at runtime.
- It resolves external references and links them to the running program.

10.Environment Variables and Configuration:

- The runtime environment allows access to environment variables and
system configuration settings, which can be useful for program configuration
and customization.

11. Standard Library:

- Most runtime environments include a standard library that provides
common utility functions, data structures, and algorithms.
- Programmers can leverage these libraries to perform various tasks without
reinventing the wheel.

12.Profiling and Debugging Tools:

- Some runtime environments come with profiling and debugging
tools that help developers identify performance bottlenecks, memory
issues, and logical errors.

13.**Security Features:**
- The runtime environment may include security mechanisms to protect
against common vulnerabilities like buffer overflows, unauthorized access, and
code injection attacks.

14.**Platform Abstraction:**
- The runtime environment abstracts the underlying hardware and
operating system, providing a consistent interface for program execution
across different platforms.

15.**Resource Management:**
- It manages system resources such as file handles, network sockets,
and threads, ensuring they are properly allocated and released.

The specifics of a runtime environment can vary significantly depending on the

programming language and the execution model. High-level languages like Java,
Python, and C# have robust runtime environments that provide many of these
features, while lower-level languages like C and C++ may have more minimal
runtime environments with fewer abstractions. Understanding the runtime
environment is crucial for both application developers and system programmers,
as it impacts program behavior, performance, and resource usage.

Source Language Issues:

Source language issues refer to various considerations, challenges, and design

decisions that arise when developing or working with a programming language.
These issues encompass language design, syntax, semantics, and pragmatics.
Here are some common source language issues:

1. **Syntax Design:**
Downloaded by Sharmila Bee
- Syntax design involves defining the rules for writing valid programs in the
language.
- Decisions include the choice of symbols, keywords, punctuation, and the
structure of statements and expressions.
- Striking a balance between readability, expressiveness, and simplicity is
critical.

2. **Semantics:**
- Semantics define the meaning of program constructs in a language.
- Designers need to consider how different language features behave, such
as variable scoping, data types, and operations.
- Ensuring consistent and predictable behavior is essential.

3. **Type System:**
- The type system determines how data types are defined, manipulated, and
used in the language.
- Issues include type safety, type compatibility, type inference, and the
presence of primitive and user-defined types.

4. **Control Flow:**
- Designers must decide on the control flow constructs available in the
language, including conditionals (if-else), loops (for, while), and branching (goto
or labeled break/continue).
- Consideration of structured programming principles and readability is
important.

5. Concurrency and Parallelism:

- Languages need to address the challenges of concurrent and parallel
programming.
- Issues include support for threads, synchronization mechanisms, and
avoiding race conditions.

6. **Error Handling:**
- The language should provide mechanisms for error handling, such as
exceptions, error codes, or runtime checks.
- Designing a robust error-handling system can greatly improve program
reliability.

7. **Memory Management:**
- Memory management issues include manual memory allocation and
deallocation, garbage collection, and ownership models (e.g., reference
counting).
- Decisions impact both performance and safety.

8. **Standard Library:**
- Designing a comprehensive standard library with well-documented
functions and modules is essential.
- The library should cover essential tasks like file I/O, networking, data
structures, and algorithms.

9. **Interoperability:**

Downloaded by Sharmila Bee

- Languages often need to interact with other languages or libraries.
Designers must consider how to facilitate inter-language communication.
- Issues include foreign function interfaces (FFIs), calling conventions,
and data marshaling.

Downloaded by Sharmila Bee

10.**Extensibility and Modularity:**
- Languages should support modular programming through mechanisms
like modules, namespaces, and packages.
- Extensibility via libraries or plugins is also important for language ecosystems.

11. Backward Compatibility:

- Maintaining backward compatibility is crucial when evolving a language.
- Decisions about how to handle deprecated features and versioning
impact the user base.

12.Tooling and Development Environment:

- Designing a language includes considering the availability of development
tools, such as compilers, debuggers, and IDE support.
- A robust development ecosystem can greatly aid programmers.

13.Documentation and Learning Curve:

- The language should have clear and accessible documentation for
users, including tutorials, reference manuals, and examples.
- Minimizing the learning curve for new programmers is essential for language
adoption.

14.**Performance Optimization:**
- Language designers must consider performance optimization
techniques, including compiler optimizations, runtime profiling, and the
ability to write high-performance code.

15.Community and Ecosystem:

- Building and nurturing a community around the language is vital for
its long-term success.
- Supporting package managers, code repositories, and forums fosters
collaboration and growth.

16.Platform and Architecture Support:

- Designers must decide which platforms and architectures the language will
support.
- Cross-platform compatibility is often a desirable feature.

Source language issues vary depending on the goals and target audience of the
language. Language designers need to carefully weigh these considerations to
create a language that is expressive, efficient, and user-friendly. Additionally,
language evolution and community feedback play significant roles in addressing
and refining these issues over time.

Dynamic Storage Allocation:

Dynamic storage allocation refers to the process of allocating and managing

memory at runtime, as opposed to static storage allocation, where memory is
allocated at compile time. Dynamic allocation allows programs to work with data
structures whose sizes or lifetimes are not known until the program is running.
It's a critical aspect of memory management in
Downloaded by Sharmila Bee
modern programming languages. Here are some key concepts and methods
related to dynamic storage allocation:

1. Heap Memory:**

- Dynamic storage is typically allocated from a region of memory known as the
heap.
- The heap is a region of memory separate from the program's stack, and it
is used for dynamic allocation because it can grow and shrink as needed
during program execution.

2. Allocation and Deallocation:**

- Allocation refers to the process of reserving a portion of memory from the
heap to store data dynamically.
- Deallocation is the process of releasing memory that is no longer needed to
be used by the program. This helps prevent memory leaks.

3. Dynamic Memory Allocation Functions:**

- Most programming languages provide functions or mechanisms to
allocate memory dynamically, such as `malloc`, `calloc`, `realloc`, and
`free` in C and C++.
- Languages like Python, Java, and C# have built-in memory management
systems that automatically handle dynamic memory allocation and
deallocation.

4. Memory Leaks:**

- Memory leaks occur when memory is allocated dynamically, but it is
not properly deallocated when it is no longer needed.
- Over time, memory leaks can lead to the exhaustion of available
memory, causing a program or system to become unstable or crash.

5. Dangling Pointers:**

- Dangling pointers are pointers that reference memory locations that have
already been deallocated.
- Accessing a dangling pointer can lead to unpredictable behavior, including
crashes or data corruption.

**6. **Fragmentation:**
- Fragmentation occurs when free memory is divided into small, non-
contiguous blocks, making it challenging to allocate large contiguous blocks
even when sufficient free memory is available.
- This can lead to inefficient memory utilization.

7. Garbage Collection:**

- Some languages, like Java, JavaScript, and C#, use automatic garbage
collection to manage dynamic memory.
- Garbage collectors automatically identify and reclaim memory that is no
longer reachable or referenced by the program.

8. Reference Counting:**

- Reference counting is a technique where each dynamically allocated object
keeps track of the number of references to it.
Downloaded by Sharmila Bee
- When the reference count reaches zero, meaning there are no more
references to the object, the memory is automatically deallocated.

9. Memory Pools and Custom Allocators:**

- In some scenarios, custom memory allocation strategies, like memory
pools or custom allocators, are employed to optimize memory allocation for
specific use cases.

**10. **Thread-Safety:**
- In multi-threaded programs, dynamic memory allocation and
deallocation should be thread-safe to avoid race conditions and data
corruption.
- Thread-safe memory management often requires synchronization
mechanisms.

11. Memory Overhead:**

- Dynamic memory allocation may come with memory overhead due to
bookkeeping information that tracks allocated blocks.
- Efficient management of memory overhead is important for minimizing
wasted memory.

Dynamic storage allocation is a powerful feature in programming that enables

flexible data structures and efficient memory utilisation. However, it also requires
careful attention to memory management practices to avoid common pitfalls like
memory leaks and dangling pointers. Different programming languages and
environments offer various tools and techniques for managing dynamic memory
effectively.

Optimal Code Generation for Expressions:

Optimal code generation for expressions is a critical aspect of compiler design and
code optimization. The goal is to generate machine code or intermediate code
that executes expressions efficiently while minimizing runtime computation and
memory usage. Here are some techniques and strategies for achieving optimal
code generation for expressions:

1. Use Efficient Data Structures:

- Choose appropriate data structures to represent expressions during
compilation, such as abstract syntax trees (ASTs) or intermediate
representations (IRs).
- These data structures should facilitate easy traversal and manipulation of the
expression tree.

2. Constant Folding:

- Identify constant subexpressions within an expression and compute
their values at compile time.
- Replace the constant subexpressions with their computed values to
reduce runtime computation.

3. Algebraic Simplification:

- Apply algebraic simplification rules to reduce the complexity of expressions.
Downloaded by Sharmila Bee
- For example, simplify `x + 0` to `x`, `x * 1` to `x`, or `x - x` to `0`.

4. Strength Reduction:

Downloaded by Sharmila Bee

- Replace expensive operations with less expensive equivalents.
- For example, replace multiplication with shifts or divisions with
multiplications by reciprocals.

5. Common Subexpression Elimination (CSE):

- Identify and eliminate redundant calculations of the same
subexpression within an expression.
- Replace repeated calculations with a single computation and reuse the result.

6. Register Allocation:

- Allocate registers efficiently for intermediate values used in expressions.
- Minimize the need to store intermediate results in memory by keeping them in
registers.

7. Instruction Selection:

- Choose the most efficient machine instructions for performing
operations in an expression.
- Use SIMD (Single Instruction, Multiple Data) instructions where applicable for
vectorized operations.

8. Optimize Conditional Branching:

- Optimize branching conditions in expressions to reduce the number of
branches and improve branch prediction.
- Use techniques like branch folding to simplify conditional expressions.

9. Loop-Invariant Code Motion:

- Identify expressions within loops that do not change during loop iterations
(loop-invariant expressions).
- Move these expressions outside the loop to avoid redundant computations.

10. Inline Function Calls:

- For small, frequently called functions or expressions, consider inlining them
to eliminate the function call overhead.

11. Compiler Optimizations:

- Take advantage of compiler optimizations, such as loop
optimization, constant propagation, and dead code elimination, to
improve code quality for expressions.

12. Target-Specific Optimization:

- Consider target-specific optimizations that take advantage of the
architecture's features, such as instruction pipelining or parallel execution units.

13. Profiling and Feedback-Directed Optimization:

- Use profiling data to identify hotspots in the code, especially in
frequently executed expressions.
- Apply optimization techniques selectively to focus on areas of the code
where they have the most impact.

14. Compiler Flags and Options:

Downloaded by Sharmila Bee

- Utilize compiler flags and options to enable specific optimizations or
control the optimization level for expressions.

15. Benchmarking and Testing:

- Benchmark the generated code to measure its performance against expected
goals.
- Thoroughly test the optimized code to ensure correctness.

Optimizing code generation for expressions is a balance between code size,

execution speed, and memory usage. It often requires trade-offs and
considerations specific to the target architecture and compiler. Compiler writers
use a combination of the above techniques to generate efficient code for
expressions in a way that benefits the overall performance of programs.

Dynamic Programming Code Generation:

Dynamic programming is a technique used to solve optimization problems by

breaking them down into smaller subproblems and solving each subproblem
only once, storing the results in a table or cache. This approach can be used in
code generation to efficiently generate code for complex computations or
transformations. Here's how dynamic programming can be applied to code
generation:

1. Define the Problem:

- Clearly define the code generation problem you want to solve. This could be
optimizing code generation for mathematical expressions, parsing, or any
other task.

2. Identify Subproblems:

- Break down the main code generation problem into smaller, overlapping
subproblems. Each subproblem should be a simpler version of the main
problem.

3. Define Recurrence Relations:

- Determine how the solutions to subproblems relate to each other. This is
typically done through recurrence relations or equations.
- Define the base cases, which are the simplest subproblems that can be solved
directly.

4. Create a Table or Cache:

- Create a data structure, often a table or cache, to store the
results of solved subproblems. This is used to avoid redundant
computations.
- The dimensions of the table should correspond to the parameters or variables
that define the subproblems.

5. Bottom-Up or Top-Down Approach:

- Dynamic programming can be implemented using a bottom-up approach,
where you solve the smallest subproblems first and build up to the main
problem, or a top-down approach, where you start with the main problem and
recursively solve smaller subproblems.
Downloaded by Sharmila Bee
- The choice depends on the problem and the structure of the recurrence
relations.

Downloaded by Sharmila Bee

**6. Memoization (Top-Down Approach):**
- In the top-down approach, use memoization to cache the results of
subproblems as they are solved.
- Before solving a subproblem, check if its solution is already in the cache. If
so, retrieve the cached result instead of recomputing it.

7. Table Filling (Bottom-Up Approach):

- In the bottom-up approach, iteratively fill in the table from the base cases
to the main problem.
- Each cell in the table represents the solution to a subproblem.
- Use previously computed solutions to calculate solutions for larger
subproblems.

8. Optimize and Store Code:

- As you solve subproblems and populate the table or cache, you can
simultaneously generate and optimize code.
- The code generated for each subproblem should contribute to the final code
for the main problem.

9. Retrieve Final Solution:

- Once the main problem's solution is computed and stored in the table or
cache, retrieve it to obtain the final generated code.

10. Handle Specific Code Generation Logic:

- Depending on the code generation problem, you may need to implement
specific logic for generating code snippets or performing transformations within
the dynamic programming framework.

11. Analyze Complexity:

- Analyze the time and space complexity of your dynamic programming-
based code generation approach. Ensure it meets performance
requirements.

Dynamic programming-based code generation is particularly useful for problems

involving complex recursive computations, optimization tasks, or problems with
overlapping subproblems. By breaking down the problem into manageable parts
and efficiently storing and reusing results, dynamic programming can lead to
more efficient and optimized code generation processes.

Downloaded by Sharmila Bee

Module 5
No ratings yet
Module 5
22 pages
CD Unit 4 (Rte, CG)
No ratings yet
CD Unit 4 (Rte, CG)
27 pages
Compiler Design Unit 5
100% (1)
Compiler Design Unit 5
28 pages
Source Language Issues Procedures:: CS1601 Compiler Design
No ratings yet
Source Language Issues Procedures:: CS1601 Compiler Design
11 pages
CD Unit 4
No ratings yet
CD Unit 4
6 pages
Cse Unit 5
No ratings yet
Cse Unit 5
54 pages
UNIT - IV - TUV - Compiler Design - NOTES - PG
No ratings yet
UNIT - IV - TUV - Compiler Design - NOTES - PG
38 pages
CD GTU Study Material Presentations Unit-6 09092020043010PM
No ratings yet
CD GTU Study Material Presentations Unit-6 09092020043010PM
41 pages
Unit 4 Part 1
No ratings yet
Unit 4 Part 1
18 pages
Compiler Design Unit-4
No ratings yet
Compiler Design Unit-4
27 pages
Compiler Design UNIT IV
No ratings yet
Compiler Design UNIT IV
18 pages
Unit 5
No ratings yet
Unit 5
32 pages
Unit 4
No ratings yet
Unit 4
22 pages
CD Unit-5
No ratings yet
CD Unit-5
28 pages
5CS4-CD-Unit-4 - PPT @zammers
No ratings yet
5CS4-CD-Unit-4 - PPT @zammers
56 pages
UNIT-5 Compiler Design
No ratings yet
UNIT-5 Compiler Design
15 pages
Module 5
No ratings yet
Module 5
30 pages
11 RunTimeAdministration1
No ratings yet
11 RunTimeAdministration1
68 pages
Unit 4 - Run - Time Environment
No ratings yet
Unit 4 - Run - Time Environment
34 pages
Input To The Code Generator
No ratings yet
Input To The Code Generator
62 pages
CSF401 Unit-04
No ratings yet
CSF401 Unit-04
19 pages
Unit 4
No ratings yet
Unit 4
43 pages
Csf401 Unit 04
No ratings yet
Csf401 Unit 04
19 pages
Compiler Design - Unit-4
No ratings yet
Compiler Design - Unit-4
42 pages
CD 4 New
No ratings yet
CD 4 New
19 pages
7 CD-PPT-5 Unit
No ratings yet
7 CD-PPT-5 Unit
26 pages
Chapter-3 Run Time Memory Management
No ratings yet
Chapter-3 Run Time Memory Management
8 pages
UNIT V CD Print
No ratings yet
UNIT V CD Print
9 pages
Compiler Design Unit 4 Question and Answers
No ratings yet
Compiler Design Unit 4 Question and Answers
25 pages
Lecture 35 Runtime Environments
No ratings yet
Lecture 35 Runtime Environments
44 pages
Unit IV
No ratings yet
Unit IV
57 pages
Unit 4
No ratings yet
Unit 4
43 pages
Unit Iv QB
No ratings yet
Unit Iv QB
17 pages
Unit-3 - Run Time Memory Management
No ratings yet
Unit-3 - Run Time Memory Management
8 pages
Storage Allocation Strategies
94% (16)
Storage Allocation Strategies
11 pages
Pocedure Calls
No ratings yet
Pocedure Calls
23 pages
Run-Time Environments
No ratings yet
Run-Time Environments
24 pages
Chapter 7: Runtime Environment: - Run Time Memory Organization
No ratings yet
Chapter 7: Runtime Environment: - Run Time Memory Organization
18 pages
Run Time Storage Management:: Unit-Iv
No ratings yet
Run Time Storage Management:: Unit-Iv
28 pages
CD 5
No ratings yet
CD 5
14 pages
Storage Allocation Strategies
No ratings yet
Storage Allocation Strategies
5 pages
Run-Time Environments
No ratings yet
Run-Time Environments
51 pages
Chapter 7 Suchi (Runtime Environments)
No ratings yet
Chapter 7 Suchi (Runtime Environments)
103 pages
Unit-5-Issues in Code Generation
No ratings yet
Unit-5-Issues in Code Generation
20 pages
PCD - Unit Iv
No ratings yet
PCD - Unit Iv
16 pages
CD Unit 4
No ratings yet
CD Unit 4
31 pages
Storage Allocation and Parameter Passing
100% (1)
Storage Allocation and Parameter Passing
9 pages
CS3501 CD Qb-Unit 4
No ratings yet
CS3501 CD Qb-Unit 4
5 pages
Mod 4-5
No ratings yet
Mod 4-5
40 pages
Run-Time Environments: COP5621 Compiler Construction
No ratings yet
Run-Time Environments: COP5621 Compiler Construction
21 pages
Run Time Environments
No ratings yet
Run Time Environments
32 pages
Chapter 7 (Runtime-Environments)
No ratings yet
Chapter 7 (Runtime-Environments)
94 pages
Object Lifetime and Storage Management
No ratings yet
Object Lifetime and Storage Management
16 pages
Unit 4 CD
No ratings yet
Unit 4 CD
21 pages
CD Unit 5-1
No ratings yet
CD Unit 5-1
15 pages
CD Unit - 4
No ratings yet
CD Unit - 4
39 pages
Runtime Storage & Code Optimization
No ratings yet
Runtime Storage & Code Optimization
22 pages
BCS 324 Notes - Unit4
No ratings yet
BCS 324 Notes - Unit4
38 pages
CD Notes 4,5
No ratings yet
CD Notes 4,5
40 pages
Runtime Environment and Storage Organization Seminar
No ratings yet
Runtime Environment and Storage Organization Seminar
33 pages
Excel
No ratings yet
Excel
1 page
Guarding Transaction With Ai Alternative NM
No ratings yet
Guarding Transaction With Ai Alternative NM
4 pages
Pravin Raj
No ratings yet
Pravin Raj
3 pages
Unit 2
No ratings yet
Unit 2
2 pages
Forcasting House Datasets
No ratings yet
Forcasting House Datasets
5 pages
Unit 2 Part - A Answer
No ratings yet
Unit 2 Part - A Answer
7 pages
Dis QB Final 2025 Regulation
No ratings yet
Dis QB Final 2025 Regulation
13 pages
Unit 5 Dbms 2mark Q&A
No ratings yet
Unit 5 Dbms 2mark Q&A
4 pages
Unit-5 2m&ans
No ratings yet
Unit-5 2m&ans
4 pages
Unit-3 CCS 2m&ans
No ratings yet
Unit-3 CCS 2m&ans
5 pages
CP 4391 Security Practices
No ratings yet
CP 4391 Security Practices
3 pages
Toc MODEL QP Sindhu
No ratings yet
Toc MODEL QP Sindhu
4 pages
Unit 2 Notes Oops
No ratings yet
Unit 2 Notes Oops
47 pages
AD3491 - FDSA Univ
No ratings yet
AD3491 - FDSA Univ
4 pages
Oss Sylabus
No ratings yet
Oss Sylabus
3 pages
An Explainable Deep Learning Architecture For Pediatri - 2024 - Biomedical Signa
No ratings yet
An Explainable Deep Learning Architecture For Pediatri - 2024 - Biomedical Signa
13 pages
Unit 3 DW QB
No ratings yet
Unit 3 DW QB
5 pages
Red Black Tree
No ratings yet
Red Black Tree
17 pages
Understanding and Selecting Data Masking Solutions: Creating Secure and Useful Data
No ratings yet
Understanding and Selecting Data Masking Solutions: Creating Secure and Useful Data
33 pages
E4-E5 CM (QoS)
No ratings yet
E4-E5 CM (QoS)
25 pages
Oop Java
No ratings yet
Oop Java
287 pages
Senior ASP - NET MVC Interview Questions
No ratings yet
Senior ASP - NET MVC Interview Questions
3 pages
Super POSH Engineering Guide
No ratings yet
Super POSH Engineering Guide
28 pages
Integration & E2E Testing Guide
No ratings yet
Integration & E2E Testing Guide
19 pages
Mastercam X MR1/SP2 Updates & Features
No ratings yet
Mastercam X MR1/SP2 Updates & Features
24 pages
DSC Unit-4
No ratings yet
DSC Unit-4
30 pages
swt301 Lab1 sp25
No ratings yet
swt301 Lab1 sp25
4 pages
22CS602 - Oose - Unit Iii - DCM
No ratings yet
22CS602 - Oose - Unit Iii - DCM
87 pages
Oops Project 29
No ratings yet
Oops Project 29
22 pages
Hpc301 User Manual
No ratings yet
Hpc301 User Manual
25 pages
Lexical and Parser Tool Generation of CBOOP Translator
No ratings yet
Lexical and Parser Tool Generation of CBOOP Translator
13 pages
Lee Jenevieve Resume
No ratings yet
Lee Jenevieve Resume
2 pages
UIT11e Ch09 PPT
No ratings yet
UIT11e Ch09 PPT
37 pages
Evermotion Archinteriors PDF Download
No ratings yet
Evermotion Archinteriors PDF Download
2 pages
Unit 3 Notes
No ratings yet
Unit 3 Notes
36 pages
Sorting
No ratings yet
Sorting
57 pages
Tudo Su Rajiv Aku Mike
No ratings yet
Tudo Su Rajiv Aku Mike
2 pages
Computer Science Engineering Proposed
No ratings yet
Computer Science Engineering Proposed
10 pages
Lesson Plan ASN
No ratings yet
Lesson Plan ASN
4 pages
Steps To Create ZAM - MARA1 Table
100% (1)
Steps To Create ZAM - MARA1 Table
7 pages
SAP TechED 2018 Presentation
No ratings yet
SAP TechED 2018 Presentation
46 pages
SQL Server DBA Training
No ratings yet
SQL Server DBA Training
3 pages
Intro to Computer Basics
No ratings yet
Intro to Computer Basics
24 pages
DTFT Presentation
No ratings yet
DTFT Presentation
11 pages
Cs221 Section3 Problems
No ratings yet
Cs221 Section3 Problems
6 pages
Akshay Final Journal
No ratings yet
Akshay Final Journal
15 pages
Multiple Queues Documentation
No ratings yet
Multiple Queues Documentation
2 pages
SAP PM Module Training Guide
100% (2)
SAP PM Module Training Guide
163 pages

cd-unit 4 (1)

Uploaded by

cd-unit 4 (1)

Uploaded by

CS3501 Compiler Design Unit-4

computer science and engineering (Jeppiaar Engineering

Scan to open on Studocu

Downloaded by Sharmila Bee

Downloaded by Sharmila Bee

Typical subdivision of run-time memory:

Downloaded by Sharmila Bee

 Temporary values such as those arising from the evaluation of expressions.

STORAGE ALLOCATION STRATEGIES

Downloaded by Sharmila Bee

Downloaded by Sharmila Bee

caller’s control link

Division of tasks between caller and callee

 The caller evaluates the actual parameters.

 The callee places the return value next to the parameters.

Variable length data on stack:

Downloaded by Sharmila Bee

activation record for control link top_sp

Access to dynamically allocated arrays

Downloaded by Sharmila Bee

Position in the Activation records in the heap Remarks

ISSUES IN THE DESIGN OF A CODE GENERATOR

The following issues arise during the code generation phase :

1. Input to code generator

Downloaded by Sharmila Bee

1. Input to code generator:

 Intermediate representation can be :

b. Relocatable machine language

 Labels in three-address statements have to be converted to addresses of instructions. For

Downloaded by Sharmila Bee

 The use of registers is subdivided into two subproblems :

where, x – dividend even register in even/odd register pair y –

A SIMPLE CODE GENERATOR

Downloaded by Sharmila Bee

3. Generate the instruction OP z’ , L where z’ is a current location of z. Prefer a register to a

The algorithmic sequence of getreg function can be,

1. if x value is in register that register is returned.

Downloaded by Sharmila Bee

Code sequence for the example is:

Statements Code Generated Register descriptor Address descriptor

t:=a-b MOV a, R0 R0 contains t t in R0

u:=a-c MOV a , R1 R0 contains t t in R0

v : =t + u ADD R1, R0 R0 contains v u in R1

d:=v+u ADD R1, R0 R0 contains d d in R0

Generating Code for Indexed Assignments

Statements Code Generated

a : = b[i] MOV b(Ri), R

a[i] : = b MOV b, a(Ri)

Downloaded by Sharmila Bee

Statements Code Generated

Generating Code for Conditional Statements

x : = y +z if x <0 goto z MOV y, R0

Downloaded by Sharmila Bee

Downloaded by Sharmila Bee

A runtime environment, often referred to as a runtime system or runtime library,

6. **Dynamic Dispatch (Polymorphism):**

8. **Concurrency and Multithreading:**

Downloaded by Sharmila Bee

10.**Environment Variables and Configuration:**

11. **Standard Library:**

12.**Profiling and Debugging Tools:**

The specifics of a runtime environment can vary significantly depending on the

Source Language Issues:

Source language issues refer to various considerations, challenges, and design

5. **Concurrency and Parallelism:**

Downloaded by Sharmila Bee

Downloaded by Sharmila Bee

11. **Backward Compatibility:**

12.**Tooling and Development Environment:**

13.**Documentation and Learning Curve:**

15.**Community and Ecosystem:**

16.**Platform and Architecture Support:**

Dynamic Storage Allocation:

Dynamic storage allocation refers to the process of allocating and managing

**1. **Heap Memory:**

**2. **Allocation and Deallocation:**

6. Dynamic Dispatch (Polymorphism):

8. Concurrency and Multithreading:

10.Environment Variables and Configuration:

11. Standard Library:

12.Profiling and Debugging Tools:

5. Concurrency and Parallelism:

11. Backward Compatibility:

12.Tooling and Development Environment:

13.Documentation and Learning Curve:

15.Community and Ecosystem:

16.Platform and Architecture Support:

1. Heap Memory:**

2. Allocation and Deallocation:**

3. Dynamic Memory Allocation Functions:**

4. Memory Leaks:**

5. Dangling Pointers:**

7. Garbage Collection:**

8. Reference Counting:**

9. Memory Pools and Custom Allocators:**

11. Memory Overhead:**

1. Use Efficient Data Structures:

2. Constant Folding:

3. Algebraic Simplification:

4. Strength Reduction:

5. Common Subexpression Elimination (CSE):

6. Register Allocation:

7. Instruction Selection:

8. Optimize Conditional Branching:

9. Loop-Invariant Code Motion:

10. Inline Function Calls:

11. Compiler Optimizations:

12. Target-Specific Optimization:

13. Profiling and Feedback-Directed Optimization:

14. Compiler Flags and Options:

15. Benchmarking and Testing:

1. Define the Problem:

2. Identify Subproblems:

3. Define Recurrence Relations:

4. Create a Table or Cache:

5. Bottom-Up or Top-Down Approach:

7. Table Filling (Bottom-Up Approach):

8. Optimize and Store Code:

9. Retrieve Final Solution:

10. Handle Specific Code Generation Logic:

11. Analyze Complexity: