0% found this document useful (0 votes)

19 views37 pages

Database Transactions Explained

Uploaded by

zaqmlp1017

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views37 pages

Database Transactions Explained

Uploaded by

zaqmlp1017

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

SQL: Transactions

Introduction to Databases
CompSci 316 Spring 2020
So far: One query/update
One machine

Multiple query/updates One query/update

One machine Multiple machines

Transactions Parallel query processing

Map-Reduce, Spark, ..
Distributed query processing

Multiple query/updates, multiple machines:

Distributed transactions, Two-Phase Commit protocol, .. (not covered)
Why should we care about running
multiple queries/updates/programs on a
machine concurrently?
Motivation: Concurrent Execution
• Concurrent execution of user programs is essential
for good DBMS performance.
• Disk accesses are frequent, and relatively slow
• it is important to keep the CPU busy by working on several
user programs concurrently
• short transactions may finish early if interleaved with long
ones
• May increase system throughput (avg. #transactions
per unit time)
• May decrease response time (avg. time to complete a
transaction)
Transactions
T1: BEGIN A=A+100, B=B-100 END
T2: BEGIN A=1.06*A, B=1.06*B END

• A transaction is the DBMS’s abstract view of a user

program
• a sequence of reads and write
• DBMS only cares about R/W of “elements” (tuples,
tables, etc)
• the same program executed multiple times would
be considered as different transactions
Example
• Consider two transactions:

T1: BEGIN A=A+100, B=B-100 END

T2: BEGIN A=1.06*A, B=1.06*B END

• Intuitively, the first transaction is transferring $100 from B’s account

to A’s account. The second is crediting both accounts with a 6%
interest payment
• There is no guarantee that T1 will execute before T2 or vice-versa, if
both are submitted together.
• However, the net effect must be equivalent to these two transactions
running serially in some order
Are these interleaving (schedule) good?
T1: BEGIN A=A+100, B=B-100 END
T2: BEGIN A=1.06*A, B=1.06*B END

• Schedule 1:
T1: A=A+100, B=B-100
T2: A=1.06*A, B=1.06*B

• Schedule 2:
T1: A=A+100, B=B-100
T2: A=1.06*A, B=1.06*B

• Schedule 3:
T1: A=A+100, B=B-100
T2: A=1.06*A, B=1.06*B
Example: View of DBMS
T1: BEGIN A=A+100, B=B-100 END
T2: BEGIN A=1.06*A, B=1.06*B END

• Schedule 2:
T1: A=A+100, B=B-100
T2: A=1.06*A, B=1.06*B

v The DBMS’s view:

• Two possible
T1: R(A), W(A), R(B), W(B) representation
T2: R(A), W(A), R(B), W(B) of schedules
• No message
R1(A), W1(A), R2(A), W2(A), R2(B), W2(B), R1(B), W1(B) passing
• Fixed set of
C1 = “Commit” by Transaction T1. objects (for
(next slide)
A1 = “Abort” by Transaction T1 now)
Commit and Abort
T1: BEGIN A=A+100, B=B-100 END
T2: BEGIN A=1.06*A, B=1.06*B END

• A transaction might commit after completing all its

actions
• or it could abort (or be aborted by the DBMS) after
executing some actions
Concurrency Control and Recovery
T1: BEGIN A=A+100, B=B-100 END
T2: BEGIN A=1.06*A, B=1.06*B END

• Concurrency Control
• (Multiple) users submit (multiple) transactions
• Concurrency is achieved by the DBMS, which interleaves actions
(reads/writes of DB objects) of various transactions
• user should think of each transaction as executing by itself one-at-a-time
• The DBMS needs to handle concurrent executions

• Recovery
• Due to crashes, there can be partial transactions
• DBMS needs to ensure that they are not visible to other transactions
ACID Properties

• Atomicity
• Consistency
• Isolation
• Durability
Atomicity
T1: BEGIN A=A+100, B=B-100 END
T2: BEGIN A=1.06*A, B=1.06*B END

• A user can think of a transaction as always executing all its

actions in one step, or not executing any actions at all
• Users do not have to worry about the effect of incomplete
transactions

Transactions can be aborted (terminated) by the DBMS or by itself

• because of some anomalies during execution (and then restarts)
• the system may crash (say no power supply)
• may decide to abort itself encountering an unexpected situation
e.g. read an unexpected data value or unable to access disks

Ensured by recovery methods using “Logs” by “undo”-ing incomplete tr.

Consistency
T1: BEGIN A=A+100, B=B-100 END
T2: BEGIN A=1.06*A, B=1.06*B END

• Each transaction, when run by itself with no concurrent

execution of other actions, must preserve the consistency
of the database
• e.g. if you transfer money from the savings account to the checking
account, the total amount still remains the same

Responsibility of programmer’s code

and ensured by DBMS through other properties
Isolation
T1: BEGIN A=A+100, B=B-100 END
T2: BEGIN A=1.06*A, B=1.06*B END

• A user should be able to understand a transaction

without considering the effect of any other
concurrently running transaction
• even if the DBMS interleaves their actions
• transaction are “isolated or protected” from other
transactions

Often ensured by “Locks”,

and other concurrency control approaches
Durability
T1: BEGIN A=A+100, B=B-100 END
T2: BEGIN A=1.06*A, B=1.06*B END

• Once the DBMS informs the user that a

transaction has been successfully completed,
its effect should persist
• even if the system crashes before all its changes
are reflected on disk

Ensured by recovery methods using “Logs” by

“redo”-ing complete/committed tr.
Schedule
• An actual or potential sequence for executing
actions as seen by the DBMS

• A list of actions from a set of transactions

• includes READ, WRITE, ABORT, COMMIT

• Two actions from the same transaction T MUST

appear in the schedule in the same order that they
appear in T
• cannot reorder actions from a given transaction
Scheduling Transactions
• Serial schedule: Schedule that does not interleave the actions
of different transactions

• Equivalent schedules: For any database state, the effect (on

the set of objects in the database) of executing the first
schedule is identical to the effect of executing the second
schedule

• Serializable schedule: A schedule that is equivalent to some

serial execution of the committed transactions
• Note: If each transaction preserves consistency, every serializable
schedule preserves consistency
Serial Schedule
T1 T2 • If the actions of different
R(A) transactions are not
W(A) interleaved
R(B)
• transactions are executed
W(B) from start to finish one by
COMMIT one
R(A)
W(A)
R(B)
W(B)
• Simple, but advantages of
COMMIT
concurrent execution lost
Serializable Schedule
• Equivalent to “some” serial schedule
• However, no guarantee on T1-> T2 or T2 -> T1
T1 T2 T1 T2 T1 T2
R(A) R(A) R(A)
W(A) W(A) W(A)
R(B) R(A) R(A)
W(B) W(A) R(B)
COMMIT R(B) W(B)
R(A) W(B) W(A)
W(A) R(B) R(B)
R(B) W(B) W(B)
W(B) COMMIT COMMIT
COMMIT COMMIT COMMIT
serial schedule serializable schedules
(Later, how to check for serializability)
Anomalies with Interleaved Execution
• Conflicts may arise if one transaction wants to write to a
data that another transaction reads/writes

• Write-Read (WR) – reading uncommitted or “dirty” data

• Read-Write (RW) – unrepeatable reads
• Write-Write (WW) – overwriting uncommitted data or “lost
updates”

• No conflict with RR if no write is involved

SQL transactions
• A transaction is automatically started when a user
executes an SQL statement
• Subsequent statements in the same session are
executed as part of this transaction
• Statements see changes made by earlier ones in the
same transaction
• Statements in other concurrently running transactions
do not
• COMMIT command commits the transaction
• Its effects are made final and visible to subsequent
transactions
• ROLLBACK command aborts the transaction
• Its effects are undone
Fine prints
• Schema operations (e.g., CREATE TABLE) implicitly
commit the current transaction

• Many DBMS support an AUTOCOMMIT feature,

which automatically commits every single
statement
• You can turn it on/off through the API
SQL isolation levels
• Strongest isolation level: SERIALIZABLE
• Complete isolation
• Weaker isolation levels:
• REPEATABLE READ,
• READ COMMITTED,
• READ UNCOMMITTED
• Increase performance by eliminating overhead and
allowing higher degrees of concurrency
• Trade-off: sometimes you get the “wrong” answer
READ UNCOMMITTED
• Can read “dirty” data (WR conflict)
• A data item is dirty if it is written by an uncommitted
transaction
• Problem: What if the transaction that wrote the
dirty data eventually aborts?
• Example: wrong average
• -- T1: -- T2:
UPDATE User
SET pop = 0.99
WHERE uid = 142;
SELECT AVG(pop)
FROM User;
ROLLBACK;
COMMIT;
READ COMMITTED
• No dirty reads, but non-repeatable reads possible
(RW conflicts)
• Reading the same data item twice can produce different
results
• Example: different averages
• -- T1: -- T2:
SELECT AVG(pop)
FROM User;
UPDATE User
SET pop = 0.99
WHERE uid = 142;
COMMIT;
SELECT AVG(pop)
FROM User;
COMMIT;
REPEATABLE READ
• Reads are repeatable, but may see phantoms
• Example: different average (still!)
• -- T1: -- T2:
SELECT AVG(pop)
FROM User;
INSERT INTO User
VALUES(789, 'Nelson',
10, 0.1);
COMMIT;
SELECT AVG(pop)
FROM User;
COMMIT;
Summary of SQL isolation levels
Isolation level/anomaly Dirty reads Non-repeatable reads Phantoms
READ UNCOMMITTED Possible Possible Possible
READ COMMITTED Impossible Possible Possible
REPEATABLE READ Impossible Impossible Possible
SERIALIZABLE Impossible Impossible Impossible

• Syntax: At the beginning of a transaction,

SET TRANSACTION ISOLATION LEVEL isolation_level
[READ ONLY | READ WRITE];
• READ UNCOMMITTED can only be READ ONLY

• PostgreSQL defaults to READ COMMITTED

Bottom line
• Group reads and dependent writes into a
transaction in your applications
• E.g., enrolling a class, booking a ticket

• Anything less than SERIALABLE is potentially very

dangerous
• Use only when performance is critical
• READ ONLY makes weaker isolation levels a bit safer
29

Conflicting operations
• Two operations on the same data item conflict if at
least one of the operations is a write
• r(X) and w(X) conflict
• w(X) and r(X) conflict
• w(X) and w(X) conflict
• r(X) and r(X) do not conflict
• r/w(X) and r/w(Y) do not conflict

• Order of conflicting operations matters

• E.g., if 𝑇!.r(A) precedes 𝑇".w(A), then conceptually, 𝑇!
should precede 𝑇"
30

Precedence graph
• A node for each transaction
• A directed edge from 𝑇! to 𝑇" if an operation of 𝑇!
precedes and conflicts with an operation of 𝑇" in
the schedule

𝑇! 𝑇" 𝑇! 𝑇"
𝑇! 𝑇!
r(A) r(A)
w(A) r(A)
𝑇" 𝑇"
r(A) w(A)
w(A) w(A)
r(B) Good: r(B) Bad:
r(C) no cycle r(C) cycle
w(B) w(B)
w(C) w(C)
31

Conflict-serializable schedule
• A schedule is conflict-serializable iff its precedence
graph has no cycles

• A conflict-serializable schedule is equivalent to

some serial schedule (and therefore is “good”)
• In that serial schedule, transactions are executed in the
“topological order” of the precedence graph
• You can get to that serial schedule by repeatedly
swapping adjacent, non-conflicting operations from
different transactions
32

Locking (for Conurrency Control)

• Rules
• If a transaction wants to read an object, it must first
request a shared lock (S mode) on that object
• If a transaction wants to modify an object, it must first
request an exclusive lock (X mode) on that object
• Allow one exclusive lock, or multiple shared locks

Mode of the lock requested

S X
Mode of lock(s)
S Yes No Grant the lock?
currently held
by other transactions X No No

Compatibility matrix
33

Basic locking is not enough

Add 1 to both A and B 𝑇% 𝑇& Multiply both A and B by 2
(preserve A=B) (preserves A=B)
lock-X(A)
Read 100 r(A)
Write 100+1 w(A)
unlock(A)
lock-X(A)
Possible schedule r(A) Read 101 𝑇!
under locking w(A) Write 101*2
unlock(A)
𝑇"
But still not lock-X(B)
conflict-serializable!
r(B) Read 100
w(B) Write 100*2
unlock(B)
lock-X(B)
Read 200 r(B) A≠B!
Write 200+1 w(B)
unlock(B)
34

Two-phase locking (2PL)

• All lock requests precede all unlock requests
• Phase 1: obtain locks, phase 2: release locks
𝑇! 𝑇" 𝑇! 𝑇"
lock-X(A) 2PL guarantees a
r(A) conflict-serializable r(A)
w(A) schedule w(A)
lock-X(B) r(A)
unlock(A) w(A)
lock-X(A)
r(A) r(B)
w(A) w(B)
lock-X(B) r(B)
r(B) w(B)
w(B)
r(B) Cannot obtain the lock on B
w(B) until 𝑇! unlocks
unlock(B)
35

Remaining problems of 2PL

𝑇! 𝑇"
• 𝑇& has read uncommitted
r(A) data written by 𝑇%
w(A) • If 𝑇% aborts, then 𝑇& must
r(A)
w(A) abort as well
r(B) • Cascading aborts possible if
w(B)
r(B)
other transactions have
w(B) read data written by 𝑇&
Abort!

• Even worse, what if 𝑇& commits before 𝑇% ?

• Schedule is not recoverable if the system crashes right
after 𝑇" commits
36

Strict 2PL
• Only release locks at commit/abort time
• A writer will block all other readers until the writer
commits or aborts

• Used in many commercial DBMS

• Oracle is a notable exception
Isolation levels not based on locks?
Snapshot isolation in Oracle
• Based on multiversion concurrency control
• Used in Oracle, PostgreSQL, MS SQL Server, etc.
• Intuition: uses a “private snapshot” or “local copy”
• If no conflict make global or abort

• More efficient than locks, but may lead to

aborts

Chapter 16 Transaction Management
No ratings yet
Chapter 16 Transaction Management
30 pages
Lecture 13
No ratings yet
Lecture 13
43 pages
SQL - Transactions
No ratings yet
SQL - Transactions
19 pages
Transactional Databases
No ratings yet
Transactional Databases
25 pages
Module 5 DBMS IA3 Reference
No ratings yet
Module 5 DBMS IA3 Reference
69 pages
Transaction Management - PPTs
No ratings yet
Transaction Management - PPTs
4 pages
Database Transaction Management
No ratings yet
Database Transaction Management
43 pages
Database Management Systems: Course Code: 21IS504 Krishna Swaroop A
No ratings yet
Database Management Systems: Course Code: 21IS504 Krishna Swaroop A
18 pages
Transactions
No ratings yet
Transactions
30 pages
DBMS Class 12 - Transaction Management - 04.08.2023
No ratings yet
DBMS Class 12 - Transaction Management - 04.08.2023
25 pages
ch16 Overview Xacts
No ratings yet
ch16 Overview Xacts
18 pages
Topic 3 Concurrency Control
No ratings yet
Topic 3 Concurrency Control
40 pages
11 Transaction 1928dbhc
No ratings yet
11 Transaction 1928dbhc
19 pages
Lec 18
No ratings yet
Lec 18
48 pages
CHAPTER III - ppt7978863
No ratings yet
CHAPTER III - ppt7978863
51 pages
Transactions and Concurrecynotes
No ratings yet
Transactions and Concurrecynotes
43 pages
Slides 11 Transactions
No ratings yet
Slides 11 Transactions
34 pages
Unit 4 CH1 Transactions
No ratings yet
Unit 4 CH1 Transactions
62 pages
DBMS - Transaction Management
No ratings yet
DBMS - Transaction Management
114 pages
Dbms Module 5
No ratings yet
Dbms Module 5
22 pages
DBMS Transaction Processing Guide
No ratings yet
DBMS Transaction Processing Guide
52 pages
Transaction Management
No ratings yet
Transaction Management
69 pages
Unit 3 - Transaction Management & Serializability
No ratings yet
Unit 3 - Transaction Management & Serializability
130 pages
Unit V DBMS
No ratings yet
Unit V DBMS
24 pages
DBMS UNIT 3 Transaction
No ratings yet
DBMS UNIT 3 Transaction
38 pages
5 SQLModifications
No ratings yet
5 SQLModifications
23 pages
CH 15 Transaction
No ratings yet
CH 15 Transaction
26 pages
Transactions: Csc-340 10A 1
No ratings yet
Transactions: Csc-340 10A 1
39 pages
Unit 5 Transcation
No ratings yet
Unit 5 Transcation
82 pages
Unit 3 Transaction Management
No ratings yet
Unit 3 Transaction Management
25 pages
dbms7 1
No ratings yet
dbms7 1
63 pages
Acid Properties DB 2022
No ratings yet
Acid Properties DB 2022
24 pages
Transaction
No ratings yet
Transaction
17 pages
Topic-1: Transaction
No ratings yet
Topic-1: Transaction
40 pages
Unit 4 Dbms
No ratings yet
Unit 4 Dbms
85 pages
Y24 Transactions New
No ratings yet
Y24 Transactions New
43 pages
Transaction Concurrency
No ratings yet
Transaction Concurrency
66 pages
Advanced Database
No ratings yet
Advanced Database
50 pages
Database Systems: Transaction Management Concurrency Control
No ratings yet
Database Systems: Transaction Management Concurrency Control
31 pages
DBMS Transaction Notes
No ratings yet
DBMS Transaction Notes
32 pages
DE Module5 TransactionProcessing
No ratings yet
DE Module5 TransactionProcessing
41 pages
ch2 Part1 Transactions 1
No ratings yet
ch2 Part1 Transactions 1
34 pages
Unit 4 - Concepts of Concurrency Control
No ratings yet
Unit 4 - Concepts of Concurrency Control
53 pages
DBMS - Unit 4 (Database Transaction Management)
No ratings yet
DBMS - Unit 4 (Database Transaction Management)
74 pages
Lect 14 25052024 043851pm
No ratings yet
Lect 14 25052024 043851pm
28 pages
Transaction Management - DICT114Aejjjsnsji
No ratings yet
Transaction Management - DICT114Aejjjsnsji
30 pages
CAS CS 460/660 Introduction To Database Systems Transactions and Concurrency Control
No ratings yet
CAS CS 460/660 Introduction To Database Systems Transactions and Concurrency Control
62 pages
18csc303j Dbms Unit-V Updated
No ratings yet
18csc303j Dbms Unit-V Updated
85 pages
Transactions
No ratings yet
Transactions
58 pages
Transaction Management & Concurrency Control
No ratings yet
Transaction Management & Concurrency Control
141 pages
Chap16 17 Transaction Con Currency
No ratings yet
Chap16 17 Transaction Con Currency
58 pages
Unit Iv: Transaction and Concurrency
No ratings yet
Unit Iv: Transaction and Concurrency
54 pages
Database 1
No ratings yet
Database 1
52 pages
Subject: Database Management System CODE: 4CS4 - 05 UNIT: 04 Transaction Processing
No ratings yet
Subject: Database Management System CODE: 4CS4 - 05 UNIT: 04 Transaction Processing
62 pages
Database Concurrency & Recovery
No ratings yet
Database Concurrency & Recovery
24 pages
Transactions
No ratings yet
Transactions
27 pages
Class 14
No ratings yet
Class 14
38 pages
CH-4 Concurrency Control
No ratings yet
CH-4 Concurrency Control
80 pages
Native POSIX Thread Library Overview
No ratings yet
Native POSIX Thread Library Overview
17 pages
Interprocess Communication
No ratings yet
Interprocess Communication
17 pages
Dbms Concepts 2pl Basic Operations and Mongodb Crud
No ratings yet
Dbms Concepts 2pl Basic Operations and Mongodb Crud
10 pages
OpenMP for Parallel Programming
No ratings yet
OpenMP for Parallel Programming
10 pages
Operating Systems Exam June 2024
No ratings yet
Operating Systems Exam June 2024
3 pages
Settings Provider
No ratings yet
Settings Provider
74 pages
Process Management & Scheduling
No ratings yet
Process Management & Scheduling
11 pages
Concurrency: Sync & Mutual Exclusion
No ratings yet
Concurrency: Sync & Mutual Exclusion
32 pages
Kernel Threads & Locks Lab Guide
No ratings yet
Kernel Threads & Locks Lab Guide
3 pages
Unit 2 (CPU Scheduling)
No ratings yet
Unit 2 (CPU Scheduling)
27 pages
Unit I
No ratings yet
Unit I
22 pages
UNIT V-Multithreaded Programming (23 Marks) Part-A 1. Define A Thread?
No ratings yet
UNIT V-Multithreaded Programming (23 Marks) Part-A 1. Define A Thread?
20 pages
Distributed Mutual Exclusion Guide
No ratings yet
Distributed Mutual Exclusion Guide
9 pages
Amdahl's Law: Example 1
No ratings yet
Amdahl's Law: Example 1
12 pages
Android System Reboot Logs
No ratings yet
Android System Reboot Logs
3 pages
JPL Assignment 05
No ratings yet
JPL Assignment 05
8 pages
Database Transactions 101
No ratings yet
Database Transactions 101
13 pages
OS Full Journal
No ratings yet
OS Full Journal
90 pages
@vtucode - In-2022-Scheme-Module-2-3rd semester-CSE
0% (1)
@vtucode - In-2022-Scheme-Module-2-3rd semester-CSE
35 pages
Multithreading Interview Guide
No ratings yet
Multithreading Interview Guide
20 pages
CS3451 - Introduction To Operating Systems: Ii Year / Iv Semester
No ratings yet
CS3451 - Introduction To Operating Systems: Ii Year / Iv Semester
20 pages
06 Deadlock and Starvation 1
No ratings yet
06 Deadlock and Starvation 1
17 pages
Process and Threads
No ratings yet
Process and Threads
36 pages
Lecture 5 Scheduling Algorithms
No ratings yet
Lecture 5 Scheduling Algorithms
27 pages
Coloured Petri Nets Kurt Jensen PDF
No ratings yet
Coloured Petri Nets Kurt Jensen PDF
1 page
CSE357 WorKsheets
No ratings yet
CSE357 WorKsheets
109 pages
Operating Systems: The Critical-Section Problem
No ratings yet
Operating Systems: The Critical-Section Problem
26 pages
RST 2
No ratings yet
RST 2
80 pages
All About Linux Signals
No ratings yet
All About Linux Signals
17 pages

Database Transactions Explained

Uploaded by

Database Transactions Explained

Uploaded by

SQL: Transactions

Multiple query/updates One query/update

Transactions Parallel query processing

Multiple query/updates, multiple machines:

• A transaction is the DBMS’s abstract view of a user

T1: BEGIN A=A+100, B=B-100 END

• Intuitively, the first transaction is transferring $100 from B’s account

v The DBMS’s view:

• A transaction might commit after completing all its

• A user can think of a transaction as always executing all its

Transactions can be aborted (terminated) by the DBMS or by itself

Ensured by recovery methods using “Logs” by “undo”-ing incomplete tr.

• Each transaction, when run by itself with no concurrent

Responsibility of programmer’s code

• A user should be able to understand a transaction

Often ensured by “Locks”,

• Once the DBMS informs the user that a

Ensured by recovery methods using “Logs” by

• A list of actions from a set of transactions

• Two actions from the same transaction T MUST

• Equivalent schedules: For any database state, the effect (on

• Serializable schedule: A schedule that is equivalent to some

• Write-Read (WR) – reading uncommitted or “dirty” data

• No conflict with RR if no write is involved

• Many DBMS support an AUTOCOMMIT feature,

• Syntax: At the beginning of a transaction,

• PostgreSQL defaults to READ COMMITTED

• Anything less than SERIALABLE is potentially very

• Order of conflicting operations matters

• A conflict-serializable schedule is equivalent to

Locking (for Conurrency Control)

Mode of the lock requested

Basic locking is not enough

Two-phase locking (2PL)

Remaining problems of 2PL

• Even worse, what if 𝑇& commits before 𝑇% ?

• Used in many commercial DBMS

• More efficient than locks, but may lead to

You might also like