0% found this document useful (0 votes)

37 views23 pages

Distributed Algorithms Explained

This document discusses distributed algorithms and the challenges of designing them for distributed systems. It summarizes that distributed algorithms are sensitive to the interaction model (synchronous vs asynchronous), types of failures like crashes or Byzantine faults, and timing issues. It notes there are many impossibility results, but solutions attempt to control timing using techniques like timeouts and assume partial synchrony or guaranteed message delivery.

Uploaded by

wahyudisyam11

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views23 pages

Distributed Algorithms Explained

Uploaded by

wahyudisyam11

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

06-06798 Distributed Systems

Lecture 9: Distributed Algorithms

11 February, 2002

Overview
Distributed algorithms
achieve co-ordination, agreement, etc

Examine sources of difficulties

timing interaction model failures

and effect on distributed algorithms

impossibility results increase in complexity and sophistication
11 February, 2002 2

Distributed algorithms
Sequential algorithm
sequence of steps to be taken (by a single process) to arrive at a solution

Distributed systems
multiple processes, each with own variables communication by exchanging messages form a graph, some topology (ring, arbitrary)

Distributed algorithm
sequence of steps to be taken by each process, including transmission of messages, to arrive at a solution
11 February, 2002 3

Why difficult?
In sequential algorithms
steps taken in strict sequence rate of execution immaterial

In distributed systems
no global time processes execute at different, unpredictable rates communication latency and delays failures must be dealt with processes have local state: true global state of the system difficult to observe
4

11 February, 2002

Examine combined effect of:

Timing
clocks, local/global time time used in: timestamp, event ordering

Interaction model
synchronous/asynchronous

Failures
benign (omission, timing) Byzantine

11 February, 2002

Clocks and timing

Internal clocks
record local time count at different rate
clock drift = relative amount of time by which clock differs from a perfect clock

different time values if read at the same time

Problems
local time unreliable when used as timestamp correction must be applied (clock synchronisation) event ordering difficult (logical time)
11 February, 2002 6

Event ordering
Scenario: group of email users X, Y, Z, A
X sends message to group Y, Z reply to group

In real-time
X sends message first; Y reads it & replies Z reads both & replies

What can happen

A sees messages in this order: from Z, X, Y

Solution [Lamport78]
record logical time
11 February, 2002 7

Logical time
Now known as Message Sequence Charts Each process
has local time axis records own events in linear order

Communication
represented by arrows between processes ordered locally according to send/arrival time

Global event ordering

can be deduced without global time partial order
11 February, 2002 8

Example of logical time

send X 1 m1 2 receive send Z receive receive m A t1 t2 m1 receive m2 receive send 3 m2 receive Physical time receive 4 receive

receive t3

X, Y: send before receive, local order within process yields order 1,2,3,4 11 February, 2002 9

Interaction models
Synchronous:
known upper/lower bounds on execution speeds, message transmission delays and clock drift rates
each takes at least MIN but no more than MAX time units

conceptually simpler model

Asynchronous:
arbitrary process execution speeds, message transmission delays and clock drift rates more general: if solution valid for asynchronous then also valid for synchronous

11 February, 2002

The synchronous model

Simpler:
can make assumptions about delays, drift rates, etc

But more difficult/expensive to build Some algorithms easier: coordinated attack

What if asynchronous?

need guarantees of delivery times, clock drift, ... two armies: initiator leads, both must attack together suppose know bounds on message delays (MIN, MAX time units) and no failures
(One) sends Charge!, waits for MIN time units and charges (Two) receives Charge!, waits for 1 time unit and charges

then One leads, Two is guaranteed to charge within MAX-MIN+1

11 February, 2002 11

The asynchronous model

More realistic:
no assumptions about delays, drift rates, etc cf Internet, WANs:
routers introduce delays (messages may take a long time) unpredictable load on server (affects response time) processor sharing (affects execution time)

But algorithms more difficult:

previous solution to co-ordinated attack does not work: suppose no bounds on message delays and no failures
choose sufficiently large T (One) sends Charge!, waits for T time units and charges (Two) receives Charge!, waits for 1 time unit and charges cannot guarantee One leads (message may take longer than T)
12

11 February, 2002

Failures...
Make the situation much worse:
message may fail to arrive (omission failure) process may stop and others may detect this (stopping failure) process may crash and others cannot detect this (crash failure)

Types of failures
benign
omission, stopping, timing/performance

arbitrary (called Byzantine)

corrupt message, wrong method called, wrong result
11 February, 2002 13

Distributed consensus
Often needed to
commit/abort transactions in distributed databases agree on altitude on board of an aeroplane

Here: coordinated attack (synchronous model, omission failures)

graph (processes are nodes, links are arcs) initial opinion Charge! or Surrender! all must attack together, otherwise destroyed communicate via messengers (can be captured or lost) must agree whether to attack or not, & attack if possible

Solution possible if messengers reliable (see earlier)

11 February, 2002 14

Consensus requirements
Agreement
no two processes decide on different values

Validity
if all start with Charge! then this is the only possible decision value if all start with Surrender! then this is the only possible decision value (other variants possible)

Termination
all processes eventually decide
11 February, 2002 15

Impossibility result
There is no deterministic solution that solves the coordinated attack problem even on on this graph:

Solutions
make probabilistic assumptions about the loss of messages while keeping processes deterministic
errors may happen with some probability

use randomisation while allowing some violation of validity/agreement

11 February, 2002 16

Outline of argument
Assume there exists an algorithm (for contradiction)
processes propose Charge!, Surrender! exchange a set of messages eventually agree

Consider the last message in exchange

messenger could be captured! result the same if message deleted, can dispense with it

Repeat for the remaining messages

left with no message

Conclusion: no algorithm exists (for this graph)

11 February, 2002 17

Process crash failures

Crash failures
process stops executing, does not respond

Crash detection
use timeouts in synchronous model: can detect crash
how?

in asynchronous model: cannot distinguish if

it has crashed, is slow, or message failed to arrive!

11 February, 2002

Stopping failures
Stopping failure (or fail-stop crash)
process stops executing others can for certain detect this

Detection
in synchronous model: use timeouts plus guaranteed message delivery
if message failed to arrive can deduce stopping failure has occurred but if it arrives can we deduce no stopping failure has occurred?

in asynchronous model: more difficult! cannot distinguish if

message takes too long to arrive, or stopping failure has occurred
11 February, 2002 19

Byzantine failures
Also called arbitrary
worst possible error system or component malfunction wrong values, wrong method

Examples
memory fault where no checksums: corrupt messages where no message sequence numbers: duplicate messages

11 February, 2002

Byzantine failures
Many difficulties!
in asynchronous model impossibility result: three processes cannot solve Byzantine agreement even in the presence of one failure need n > 3f where n number of processes, f failures

Solutions
can tolerate up to a certain number of failures increased complexity use of randomisation

11 February, 2002

Timing/performance failures
Can occur in synchronous systems
server overloaded, slow response often not critical (poor response time)

Class of Failure Clock Performance Performance

Affects Process Process Channel

Description Processs local clock exceeds the bounds on its rate of drift from real time. Process exceeds the bounds on the interval between two steps. A messages transmission takes longer than the stated bound.

11 February, 2002

Summary
Distributed algorithms are sensitive to:
types of interaction models types of failures timing

Impossibility results
very common!

Design issues
control timing if possible, allows timeouts partial synchrony guaranteed delivery of messages
11 February, 2002 23

03 SystemModels Fundamental
No ratings yet
03 SystemModels Fundamental
8 pages
Se342: Distributed Computing: Lecture # 03-b Fundamental Models
No ratings yet
Se342: Distributed Computing: Lecture # 03-b Fundamental Models
26 pages
25 DistributedCoordination
No ratings yet
25 DistributedCoordination
30 pages
Unit 3 Coordinaton and Agreement Algorithm
No ratings yet
Unit 3 Coordinaton and Agreement Algorithm
119 pages
Chapter 6-Synchronozation
No ratings yet
Chapter 6-Synchronozation
24 pages
DS Chapter 5 Synchronizations
No ratings yet
DS Chapter 5 Synchronizations
34 pages
Chapter 6 Synchronization
No ratings yet
Chapter 6 Synchronization
50 pages
Coordination and Agreement: Distributed Systems
No ratings yet
Coordination and Agreement: Distributed Systems
37 pages
3 - Fundamental Models-1
No ratings yet
3 - Fundamental Models-1
6 pages
01 Da24 Introduction
No ratings yet
01 Da24 Introduction
55 pages
Synchronization: CS403/534 Distributed Systems Erkay Savas Sabanci University
No ratings yet
Synchronization: CS403/534 Distributed Systems Erkay Savas Sabanci University
46 pages
Distributed Systems Coordination
No ratings yet
Distributed Systems Coordination
18 pages
Unit IV
No ratings yet
Unit IV
46 pages
Chapter 6
No ratings yet
Chapter 6
31 pages
Distributed System Models
No ratings yet
Distributed System Models
30 pages
Distributed Systems Synchronization
No ratings yet
Distributed Systems Synchronization
30 pages
3 Synchronization
No ratings yet
3 Synchronization
45 pages
Sec 2425 L02
No ratings yet
Sec 2425 L02
56 pages
Fundamental Models: Instructor DR / Ayman Soliman
No ratings yet
Fundamental Models: Instructor DR / Ayman Soliman
26 pages
Chapter 7-Fault Tolerance
No ratings yet
Chapter 7-Fault Tolerance
71 pages
Mutual Exclusion in Distributed Computing
100% (1)
Mutual Exclusion in Distributed Computing
15 pages
Da Slides
No ratings yet
Da Slides
355 pages
M.Tech Course Distributed Computing
No ratings yet
M.Tech Course Distributed Computing
117 pages
Logical Time in Asynchronous Systems Email Example: A B A B
No ratings yet
Logical Time in Asynchronous Systems Email Example: A B A B
8 pages
Chapter 6 Synchronization
No ratings yet
Chapter 6 Synchronization
37 pages
Distributed Systems Synchronization
No ratings yet
Distributed Systems Synchronization
111 pages
Distributed Systems Synchronization
No ratings yet
Distributed Systems Synchronization
119 pages
Chapter 8 Fault Tolerance
No ratings yet
Chapter 8 Fault Tolerance
20 pages
C1 C2 C3 Review DCmodel GlobalStates TimeCausality
No ratings yet
C1 C2 C3 Review DCmodel GlobalStates TimeCausality
81 pages
Lecture 18: Distributed Agreement: CSC 469H1F / CSC 2208H1F Fall 2007 Angela Demke Brown
No ratings yet
Lecture 18: Distributed Agreement: CSC 469H1F / CSC 2208H1F Fall 2007 Angela Demke Brown
35 pages
IntroDistribuetComputing
No ratings yet
IntroDistribuetComputing
41 pages
08 - Coordination and Agreement
No ratings yet
08 - Coordination and Agreement
31 pages
Consensus Failure
No ratings yet
Consensus Failure
79 pages
AOS PPT Unit 1,2 - 20241112 - 222203 - 0000
No ratings yet
AOS PPT Unit 1,2 - 20241112 - 222203 - 0000
20 pages
Synchronization in Distributed Systems
No ratings yet
Synchronization in Distributed Systems
51 pages
DC 2marks
No ratings yet
DC 2marks
5 pages
Distributed System
No ratings yet
Distributed System
5 pages
Chapter 8-Fault Tolerance
No ratings yet
Chapter 8-Fault Tolerance
30 pages
Chapter 8 - Fault Tolerance
No ratings yet
Chapter 8 - Fault Tolerance
19 pages
Synchronization
No ratings yet
Synchronization
114 pages
University of Makeni (Unimak) Sylvanus Koroma
100% (1)
University of Makeni (Unimak) Sylvanus Koroma
14 pages
DS Chapter 8-Fault Tolerance
No ratings yet
DS Chapter 8-Fault Tolerance
68 pages
Distributed Systems Coordination
No ratings yet
Distributed Systems Coordination
63 pages
Coordination: CE32204 - Distributed System Presented By: Eka Stephani Sinambela Institut Teknologi Del
No ratings yet
Coordination: CE32204 - Distributed System Presented By: Eka Stephani Sinambela Institut Teknologi Del
16 pages
4 4 PDF
No ratings yet
4 4 PDF
29 pages
Lecture 16,17, 18 Mutual Exclusion Algorithms
No ratings yet
Lecture 16,17, 18 Mutual Exclusion Algorithms
90 pages
Distributed Mutual Exclusion Upload
No ratings yet
Distributed Mutual Exclusion Upload
48 pages
Chapter 8-Fault Tolerance
No ratings yet
Chapter 8-Fault Tolerance
37 pages
Distributed Process Management
No ratings yet
Distributed Process Management
56 pages
Time and Coordination
No ratings yet
Time and Coordination
11 pages
Unit5 Compressed Fault Tolerance - PACE
No ratings yet
Unit5 Compressed Fault Tolerance - PACE
11 pages
7mutual Exclusion
No ratings yet
7mutual Exclusion
27 pages
Distributed Systems Coordination
No ratings yet
Distributed Systems Coordination
130 pages
Chapter 5-Synchronozation
No ratings yet
Chapter 5-Synchronozation
43 pages
Distributed Algorithms Course
No ratings yet
Distributed Algorithms Course
466 pages
Module 3-DC
No ratings yet
Module 3-DC
61 pages
Atomic Broadcast Is A Reliable Broadcast That Satisfies The Following Condition Total Order
No ratings yet
Atomic Broadcast Is A Reliable Broadcast That Satisfies The Following Condition Total Order
5 pages
YugaByte Fundamentals DBA Certification Guide
No ratings yet
YugaByte Fundamentals DBA Certification Guide
8 pages
BERLIN VERSION Beacfbd - 2022-10-24
No ratings yet
BERLIN VERSION Beacfbd - 2022-10-24
41 pages
Zakaria Hamdino m2
No ratings yet
Zakaria Hamdino m2
5 pages
MCQ With Answers
No ratings yet
MCQ With Answers
20 pages
Blockchain Barriers in Sustainable Supply Chains
No ratings yet
Blockchain Barriers in Sustainable Supply Chains
22 pages
Conceptualizing Blockchains: Characteristics & Applications: Karim Sultan, Umar Ruhi and Rubina Lakhani
No ratings yet
Conceptualizing Blockchains: Characteristics & Applications: Karim Sultan, Umar Ruhi and Rubina Lakhani
9 pages
Haha Coin White Paper v1
No ratings yet
Haha Coin White Paper v1
61 pages
Decentralized Consensus Mechanisms in Blockchain: A Comparative Analysis
No ratings yet
Decentralized Consensus Mechanisms in Blockchain: A Comparative Analysis
12 pages
Fake Media Detection via NLP & Blockchain
No ratings yet
Fake Media Detection via NLP & Blockchain
12 pages
P84-Secure Cloud-Based EHR System Using Attribute-Based Cryptosystem
No ratings yet
P84-Secure Cloud-Based EHR System Using Attribute-Based Cryptosystem
9 pages
MiCA Compromises v7 (28 Feb 2022) Track Changes
No ratings yet
MiCA Compromises v7 (28 Feb 2022) Track Changes
151 pages
Brief Notes On Blockchain - For Semester
No ratings yet
Brief Notes On Blockchain - For Semester
40 pages
Adaptive Data Based Neural Network Leader-Follower Control of Multi-Agent Networks
No ratings yet
Adaptive Data Based Neural Network Leader-Follower Control of Multi-Agent Networks
6 pages
Reading 84 Introduction To Digital Assets Answers
No ratings yet
Reading 84 Introduction To Digital Assets Answers
3 pages
In Ps Blockchain Noexp PDF
No ratings yet
In Ps Blockchain Noexp PDF
32 pages
How Will Blockchain Technology Impact Autditing and Accouting
No ratings yet
How Will Blockchain Technology Impact Autditing and Accouting
19 pages
Fundamentals of Distributed Systems
100% (1)
Fundamentals of Distributed Systems
20 pages
Sahara AI Litepaper
No ratings yet
Sahara AI Litepaper
24 pages
Block Chain Quantum
No ratings yet
Block Chain Quantum
133 pages
Social Security in The Time of Covid
No ratings yet
Social Security in The Time of Covid
212 pages
Aboualy Mahmoud Bachelor Thesis
No ratings yet
Aboualy Mahmoud Bachelor Thesis
46 pages
BCE 413 Block Chain Technology Course Outline v1
No ratings yet
BCE 413 Block Chain Technology Course Outline v1
4 pages
Swift News PDF Blockchain Settlement Regulation Paper
No ratings yet
Swift News PDF Blockchain Settlement Regulation Paper
36 pages
Mastering Blockchain: Review Questions
No ratings yet
Mastering Blockchain: Review Questions
24 pages
Banking and Insurance II Unit New
No ratings yet
Banking and Insurance II Unit New
31 pages
Tokenization Part-I Online-1
No ratings yet
Tokenization Part-I Online-1
77 pages
Syllabus CS8603 DISTRIBUTED SYSTEMS
No ratings yet
Syllabus CS8603 DISTRIBUTED SYSTEMS
2 pages
Blockchain Course for Tech Experts
No ratings yet
Blockchain Course for Tech Experts
2 pages
Messari Report Eth2 The Next Evolution of Cryptoeconomy
No ratings yet
Messari Report Eth2 The Next Evolution of Cryptoeconomy
70 pages

Distributed Algorithms Explained

Uploaded by

Distributed Algorithms Explained

Uploaded by

06-06798 Distributed Systems

Lecture 9: Distributed Algorithms

Examine sources of difficulties

and effect on distributed algorithms

Examine combined effect of:

Clocks and timing

different time values if read at the same time

What can happen

Global event ordering

Example of logical time

conceptually simpler model

The synchronous model

But more difficult/expensive to build Some algorithms easier: coordinated attack

then One leads, Two is guaranteed to charge within MAX-MIN+1

The asynchronous model

But algorithms more difficult:

arbitrary (called Byzantine)

Here: coordinated attack (synchronous model, omission failures)

Solution possible if messengers reliable (see earlier)

use randomisation while allowing some violation of validity/agreement

Consider the last message in exchange

Repeat for the remaining messages

Conclusion: no algorithm exists (for this graph)

Process crash failures

in asynchronous model: cannot distinguish if

in asynchronous model: more difficult! cannot distinguish if

Class of Failure Clock Performance Performance

Affects Process Process Channel

You might also like