0% found this document useful (0 votes)

95 views121 pages

Tra C Concept, Measurements, Statistics: Lecturer: Dmitri A. Moltchanov E-Mail: Moltchan@cs - Tut.fi

This document discusses concepts related to traffic modeling, including: - Traffic is measured based on the number of occupied resources like trunks over time. It varies based on subscriber activity and has both stochastic and deterministic components. - Traffic is categorized as offered, carried, and lost depending on the network's capacity. It exhibits patterns over days, weeks, and years. - The busy hour is the consistent 60-minute period with the highest traffic levels, though the exact time may vary daily. Blocking occurs when arrivals are rejected due to lack of network capacity.

Uploaded by

Legenda P. Pratama

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

95 views121 pages

Tra C Concept, Measurements, Statistics: Lecturer: Dmitri A. Moltchanov E-Mail: Moltchan@cs - Tut.fi

Uploaded by

Legenda P. Pratama

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 121

Traﬃc concept, measurements, statistics

Lecturer: Dmitri A. Moltchanov

E-mail: moltchan@cs.tut.ﬁ

http://www.cs.tut.ﬁ/˜moltchan/modsim/

http://www.cs.tut.fi/kurssit/TLT-2706/
Traffic modeling D.Moltchanov, TUT, 2005
OUTLINE:
• Traffic concept;
• Traffic measurements;
• Step-by-step traffic modeling procedure;
• Points of interest in traffic modeling;
• Observations from Internet traffic measurements;
• What statistics to capture;
• Estimation of the statistics;
• Choosing a candidate model;
• Fitting parameters of the model;
• Testing for accuracy of approximation;
• Example of the traffic modeling procedure.

Lecture: Traﬃc concept, measurements, models 2

Traﬃc modeling D.Moltchanov, TUT, 2005

1. Importance of the traﬃc

The cost of any telecommunication system depends on the amount of traffic.
The main aims when planning a telecommunication system is to:
• adjust the amount of equipment so that the required quality if satisfied;
• use the equipment as efficient as possible;
• keep costs as small as possible.
Teletraffic theory deals with developing methods suitable for:
• optimization of the structure of the network to satisfy a given traffic;
• adjustment of the amount of equipment to satisfy a given traffic.
Since these both tasks depends upon the amount of traffic we have to define:
• what is the traffic?
• what is the unit of traffic?
We distinguish between circuit-switched and packet switched networks.

Lecture: Traﬃc concept, measurements, models 3

Traﬃc modeling D.Moltchanov, TUT, 2005

2. Traﬃc in circuit-switched networks

Deﬁnition: traﬃc intensity in a pool of resources at t is the number of busy resources at t.

Mean traﬃc intensity is given by:

T
1
Y (T ) = n(t)dt, (1)
T 0

• where n(t) denotes the number of occupied resources (servers) at the time t.

Note the following:

• a pool of resources may be any group of certain servers (i.e. number of trunks);
• statistical moments of the traffic intensity may be calculated for a given period of time T ;
• traffic intensity is usually referred to as average traffic intensity.

Definition: carried traffic AC is the traffic carried by the group of servers during interval T .

Lecture: Traﬃc concept, measurements, models 4

Traﬃc modeling D.Moltchanov, TUT, 2005

n(t), number of busy trunks

average traffic intensity

instantaneous traffic intensity

t, time

Figure 1: Illustration of the average traﬃc intensity.

The following is important:

• the carried traﬃc cannot exceed the number of trunks;
• a single trunk can at most can carry one Erlang of the traﬃc!

The total traﬃc carried in a period of time T is called a traﬃc volume (Erlang-hours).

Lecture: Traﬃc concept, measurements, models 5

Traffic modeling D.Moltchanov, TUT, 2005
Note the following:
• carried traffic AC is often proportional to income of a network operator;
• losses are usually due to inability to carry all traffic!

Definition: offered traffic A:

• traﬃc which would be carried if no arrivals were rejected due to a lack of capacity;
• this concept is usually used in theoretical studies:

How to compute oﬀered traﬃc:

A = λ × s. (2)

• λ: mean number of arrivals per a time unit;

• s: mean service time of arrival.

Lecture: Traﬃc concept, measurements, models 6

Traffic modeling D.Moltchanov, TUT, 2005
So far we defined two different types of traffic. These are:
• offered traffic A;
• carried traffic AC .
• volume of these traffics (A, AC ) may not be equal to each other.

Lost (rejected, blocked) traffic: difference between offered traffic and carried traffic:

AL = A − AC . (3)

• the value of lost traﬃc is reduced by increasing the capacity of the system;
• when the capacity of the system tends to inﬁnity AC → A.

Example: arrival intensity is 10 arrs/m.; mean service time is 2 minutes:

A = 10 · 2 = 20(Erlangs). (4)

Lecture: Traﬃc concept, measurements, models 7

Traﬃc modeling D.Moltchanov, TUT, 2005

2.1. Traﬃc variations

Traffic in circuit-switched networks varies according to activity of subscribers:
• traffic is generated by single sources – subscribers;
• subscribers are assumed to be independent.
Measurements have shown that traffic is characterized by two major components:
• stochastic component:
– random generation of calls by subscribers.
• deterministic component:
– nearly deterministic variability of number of calls over days, weeks, months and even years
– cause: subscribers’ needs to make more calls in a certain period of time.
Variations in traffic can be divided into:
• variations in service times;
• variations in number of calls.

Lecture: Traﬃc concept, measurements, models 8

Traﬃc modeling D.Moltchanov, TUT, 2005

Figure 2: Average number of voice calls: 10 workdays averages, taken from V.B. Iversen.

Lecture: Traﬃc concept, measurements, models 9

Traﬃc modeling D.Moltchanov, TUT, 2005

Figure 3: Average service times for voice calls: taken from V.B. Iversen.

Lecture: Traﬃc concept, measurements, models 10

Traﬃc modeling D.Moltchanov, TUT, 2005
Peaks in average number of calls and service time usually depends on:
• whether the exchange is located in residential, rural, or business area;
• on what traﬃc we look at.

Deterministic nature: traffic patterns looks very similar for a different days:
• traffic patterns are similar during week-days;
• traffic patterns are similar during week-end days;
• traffic patterns are different during week-days and week-end days.

Natural question:
• when the peak number of calls occurs?
• is this peak the same for each day?

Lecture: Traﬃc concept, measurements, models 11

Traffic modeling D.Moltchanov, TUT, 2005
Generally, deterministic variations in the traffic can be divided to:
• 24 hours variations:
– as those we considered previously.
• weekly variations:
– highest traffic: Monday, then on Friday, Tuesday, Wednesday, Thursday, Saturday, Sunday.
• year variations:
– for example: there is a very low traffic in vacation times (July in Finland).
• large scale variation:
– traffic increases depending on technology development and economic state of the society.
The following is important:
• we considered a traditional PSTN traffic;
• other traffic types or other circuit-switched networks have their own patterns and variations.
– dial-up Internet via modems;
– voice calls in GSM/IS-95/UMTS mobile networks.

Lecture: Traﬃc concept, measurements, models 12

Traﬃc modeling D.Moltchanov, TUT, 2005

Figure 4: Average number of modem calls: single day, taken from V.B. Iversen, year 1999.

Lecture: Traﬃc concept, measurements, models 13

Traﬃc modeling D.Moltchanov, TUT, 2005

Figure 5: Average service times for modem calls: taken from V.B. Iversen.

Lecture: Traﬃc concept, measurements, models 14

Traﬃc modeling D.Moltchanov, TUT, 2005

2.2. The concept of busy hour

Time consistent busy hour (TCBH):
• time period of 60 minutes during which, measured on a long time, the highest traﬃc occurs.
Note the following:
• the highest traﬃc may not occur during the same time every day!
calls/minute

0 4 8 12 16 20 24 time

calls/minute

0 4 8 12 16 20 24 time

Lecture: Traﬃc concept, measurements, models 15

Traﬃc modeling D.Moltchanov, TUT, 2005

2.3. Blocking concept

Circuit-switched telecommunications systems:
• are dimensioned so that subscribers are sharing the expensive equipment:
– never dimensioned so that all subscribers can connect at the same time;
– equipment which is separate for each subscriber should be made as cheap as possible.
• there is a concentration from the subscribers towards exchange.

to domestic concentrator

Exchange
customers

...
...

...
to international concentrator

Figure 6: Sketch of the telephone exchange.

Lecture: Traﬃc concept, measurements, models 16

Traﬃc modeling D.Moltchanov, TUT, 2005
What are usual dimensioning rules applied:
• about 5 − 8% of subscribers should be able to make domestic calls at the same time;
– note that each phone is usually used 10 − 16% of the time.
• about 1% of subscribers should be able to make international calls at the same time.

How it is made and what it gives:

• statistical multiplexing at the call level;
• subscriber feels that he has an unrestricted access to all resources.

There should be some problems:

• resources are shared with many others;
• it is possible that a subscriber cannot establish a call.

Lecture: Traﬃc concept, measurements, models 17

Traﬃc modeling D.Moltchanov, TUT, 2005
When it is not possible to to establish a call it:
• has to wait;
• has to be blocked.

Depending on how system operates we distinguish between:

• loss systems: arrival is lost when there are insuﬃcient resources in the system;
• waiting systems: arrival waits when there are insuﬃcient resources in the system;
• mixed loss-waiting systems: depending on arrival it can wait of get lost.

Networks performance measures in loss systems can be expressed using:

• call congestion B: fraction of call attempts that observes all servers busy;
• time congestion E: fraction of time when all servers are busy;
• traffic congestion C: the fraction of the offered traffic that is not carried.

Lecture: Traﬃc concept, measurements, models 18

Traﬃc modeling D.Moltchanov, TUT, 2005

3. Traﬃc in packet-switched networks

In data networks we talk about transmission needs:
• any packet can be of s units in length (bits or bytes);
• any link is characterized by a capacity φ (units per second).

Then the service time for a customer (so-called transmission time) is:
s
. (5)
φ

Utilization ρ of the link is:

λs
ρ= , 0 < ρ < 1. (6)
φ
• λ is arrival rate of packets per time unit.

Lecture: Traﬃc concept, measurements, models 19

Traﬃc modeling D.Moltchanov, TUT, 2005

4. Traffic measurements
To provide quantitative analysis of telecommunication system we have to:
• provide adequate traffic model:
– determine important statistical parameters of input traffic:
∗ measure traffic patterns;
∗ compute statistical parameters of the patterns.
– match these parameters using appropriate traffic model.
• provide model of the service process;
• carry out analysis of the system under different conditions.
Any traffic measurement is characterized by the following three parameters:
• period of measurement;
• type of measurement;
• parameters under consideration.

Lecture: Traﬃc concept, measurements, models 20

Traﬃc modeling D.Moltchanov, TUT, 2005

4.1. Why do we need traﬃc measurements?

There are four main reasons why network traﬃc measurements are very useful:
• network troubleshooting:
– malfunctioning equipment may disrupt the operation of the network;
– examples: broadcast storm, illegal packet sizes, incorrect addresses, security attacks;
– measurements: may help to locate this equipment.
• protocol/application debugging:
– developers want to test a new, improved version of protocol/application;
– measurements: may reveal ’hidden problems’ of the protocol/applications.
• traﬃc characterization:
– what is the current workload, what are the future trends?;
– measurements: are required to answer these questions.
• performance evaluation:
– what is the performance of the router, application?
– measurements: are required to characterize the workload.

Lecture: Traﬃc concept, measurements, models 21

Traﬃc modeling D.Moltchanov, TUT, 2005

4.2. Methods of measurements

Traffic measurements can be implemented using the following operations:
• observing number of events:
– collecting a number of events over a constant time intervals;
– it corresponds to number representation of arrival process;
• observing time intervals:
– collecting data about the lengths of time intervals between events;
– this corresponds to an interval representation of arrival process.
Using these operations we may obtain any characteristic of a traffic process to:
• apply these characteristics to develop a traffic model;
• directly to dimension a system under consideration.
Traffic measuring methods can be divided into two major categories:
• continuous measuring methods: measuring equipment is activated at the instant of the event;
• discrete measuring methods.

Lecture: Traﬃc concept, measurements, models 22

Traﬃc modeling D.Moltchanov, TUT, 2005

4.3. Discrete measurements

Discrete measurement (so-called scanning method):
• time points are chosen;
• measuring equipment tests whether there have been changes at the measuring time points;
• time points are usually equally separated;
• events between two time points are considered as happened together.

time

points of measutements

1 1 2 1 2 1 1
time

Figure 7: Discrete measurements.

Lecture: Traﬃc concept, measurements, models 23

Traﬃc modeling D.Moltchanov, TUT, 2005

4.4. Active and passive measurements

Whether we monitor real network traffic or some kind of ’artificial’ traffic:
• active traffic measurements;
• passive traffic measurements.
Active measurements:
• packets are generated by a tool to probe the network and measure characteristics:
– ping: tool to estimate network latency to a particular destination in the Internet;
– tracert: tool to determine routing paths;
– pathchar: tool to estimate link capacities and latencies along the Internet path.
Passive measurements:
• network monitor is used to observe and record traffic on an operational network;
• most free measurement tools fall into this category:
– tcpdump;
– Etherial.

Lecture: Traﬃc concept, measurements, models 24

Traﬃc modeling D.Moltchanov, TUT, 2005

4.5. Software and hardware-based measurements tools

Depending on implementation we distinguish between:
• hardware-based traffic measurement tools;
• software-based traffic measurement tools.
Hardware-based measurement tools:
• equipment (device) with specific functionality;
• often referred to us as traffic analyzers;
• expensive: depends on the number of network interfaces, storage capacity, analysis capabilities;
• usually provide on-line statistical traffic analysis.
Software-based measurement tools:
• specific programs developed for collection and analysis of data;
• are not so expensive sometimes providing the same functionality;
• some examples: tcpdump, Etherial, ping;
• non-specific examples: web servers, proxies, firewalls, providing log files.

Lecture: Traﬃc concept, measurements, models 25

Traﬃc modeling D.Moltchanov, TUT, 2005

Figure 8: An example of the main window of Etherial with captured trace.

Lecture: Traﬃc concept, measurements, models 26

Traﬃc modeling D.Moltchanov, TUT, 2005

5. Step-by-step traﬃc modeling procedure

Step-by-step procedure:
• determine the level of interest;
• measure traffic at the point of interest;
• decide what statistics should be captured;
• estimate statistics of traffic observations:
• choose a candidate model;
• fit parameters of the model;
• test accuracy of the model.

We will be dealing with:

• traﬃc in packet-switched networks;
• diﬀerent levels of aggregation.

Lecture: Traﬃc concept, measurements, models 27

Traﬃc modeling D.Moltchanov, TUT, 2005

6. Level of interest for traﬃc modeling

Traffic can be represented:
• at the session level:
– request for downloading files from ftp server;
– request for downloading pages from www server.
• at the packet level.
Which level to choose:
• depends on particular task;
General notes:
• session level:
– usually claimed for follow Poisson process;
– reality might be quite different!
• packet level: any behavior should be expected.

Lecture: Traﬃc concept, measurements, models 28

Traﬃc modeling D.Moltchanov, TUT, 2005

7. Points of interest in traﬃc modeling

You have to take into account:
• where you are asked to model the traﬃc (evaluate performance)?

customer side network side

Figure 9: Points at which traﬃc is usually measured and modeled.

Lecture: Traﬃc concept, measurements, models 29

Traﬃc modeling D.Moltchanov, TUT, 2005

7.1. Point 1: particular application:

We distinguish between:
• voice application;
• video application;
• data transfers:
– ftp information;
– http information;
– ssh information.
What is important: properties of transport layer protocol and application:
• UDP: no specific pattern:
– does not affect much properties of application;
– you may model traffic of application only.
• TCP: very specific pattern:
– affect data transmission;
– should be taken into account.

Lecture: Traﬃc concept, measurements, models 30

Traﬃc modeling D.Moltchanov, TUT, 2005

18
Congestion window, MSSs
16

TCP Reno

4 TCP Tahoe

1
time

Figure 10: TCP traﬃc pattern: TCP Reno and TCP Tahoe.

• how much traﬃc does the application have?

• ftp: large ﬁles; ssh, e-mail, http: large and small transfers.

Lecture: Traﬃc concept, measurements, models 31

Traﬃc modeling D.Moltchanov, TUT, 2005

7.2. Point 2: aggregated traﬃc from a number of applications

We distinguish between:
• heterogenous applications;
• homogenous applications.

customer side customer side

voip voip

voip video

voip ftp

voip voip

Figure 11: Homogenous and heterogenous traﬃc aggregates.

Lecture: Traﬃc concept, measurements, models 32

Traﬃc modeling D.Moltchanov, TUT, 2005

7.3. Point 3: aggregated network traﬃc

What is that:
• aggregation of a large number of ﬂows.

access router

backbone router

access router

Figure 12: Aggregated backbone traﬃc.

• may have quite sophisticated properties;

• practically, cannot be obtained as superposition of individual ﬂows.

Lecture: Traﬃc concept, measurements, models 33

Traﬃc modeling D.Moltchanov, TUT, 2005

8. Observations from the Internet traﬃc measurements

The most important fact: Internet traﬃc changes in time.

Recent observations, trends and facts on the Internet traﬃc:

• TCP accounts for most of the packet traffic in the Internet;
• traffic flows are bidirectional, but often asymmetric;
• most TCP sessions are short-lived;
• the packet arrival process in the Internet is not Poisson;
• the session arrival process may be approximated by Poisson distribution;
• packet sizes are bimodally distributed;
• packet traffic is non-uniformly distributed;
• aggregate network traffic may have multi-fractal nature;
• Internet traffic continues to changes.

Lecture: Traﬃc concept, measurements, models 34

Traﬃc modeling D.Moltchanov, TUT, 2005

8.1. Domination of TCP

TCP accounts for most of the packet traﬃc in the Internet:
• beginning of 90th :
– it was ﬁrstly observed that TCP dominates.
• middle of 90th :
– introduction of multimedia services;
– development of RTP, RTCP, RTSP...;
– UDP share was expected to grow.
• beginning on 2000:
– TCP still dominant protocol;
– multimedia content also extensively use TCP.
Reasons of TCP dominance:
• multimedia is usually place of web pages;
• availability of TCP.

Lecture: Traﬃc concept, measurements, models 35

Traﬃc modeling D.Moltchanov, TUT, 2005

8.2. Bidirectional asymmetric traﬃc ﬂows

Traffic flows are bidirectional, but often asymmetric:
• bidirectional exchange of data:
– ftp, http, ssh, e-mail, etc.
• asymmetric traffic pattern:
– ftp, http, ssh, e-mail are all request-response based protocols.
• what are the current trends:
– p2p applications may generate bidirectional asymmetric traffic;
– p2p applications: napster, kazaa, etc.

Lecture: Traﬃc concept, measurements, models 36

Traﬃc modeling D.Moltchanov, TUT, 2005

8.3. Short-lived TCP sessions

Most TCP sessions are short-lived:
• almost 90% of TCP connections exchange fewer than 10Kbytes of data in few seconds:
– WWW service: request - response;
– http v1.0: separate connection for an object on a page;
– http v1.1: single connection for a page;
– most pages and objects are less than 10Kbytes in length.
• what the eﬀect:
– heavy-tailed distribution of session sizes.
– heavy-tail: a lot of frequencies corresponding to large histogram bins;
– reasons for heavy-tail: most of the sessions are small, some a big (ftp).

Lecture: Traﬃc concept, measurements, models 37

Traﬃc modeling D.Moltchanov, TUT, 2005

8.4. Packet arrivals are not homogenous Poisson

The packet arrival process in the Internet is not homogenous Poisson:
• common belief was:
– aggregated traﬃc is Poisson (or at least Markovian) in nature;
– a lot of studies have been made with Poisson assumption.
• reality:
– arrival process is not homogenous Poisson;
– interarrival times are at least correlated;
– packet arrival process may not be event covariance stationary;
– there can be so-called packet ’clumps’ or ’batches’
• result: packet arrival process is far from common assumptions.

Lecture: Traﬃc concept, measurements, models 38

Traﬃc modeling D.Moltchanov, TUT, 2005

8.5. Session arrivals are Poisson

The session arrival process may be approximated by Poisson distribution:
• what are the reasons:
– there are a lot of users getting access to a certain site;
– users can be assume the be independent;
– situation is similar to telephone network where Poisson assumption holds.

Lecture: Traﬃc concept, measurements, models 39

Traﬃc modeling D.Moltchanov, TUT, 2005

8.6. Special distribution of packet sizes

Packet sizes are bimodally distributed:
• around 50% of packets are as large as possible:
– these are TCP data packets;
– recall, it is determined by MTU of Ethernet: 1500 bytes;
– around 50% of packets are 1500 bytes in length.
• around 40% of packets are as small as possible:
– these are TCP ACKs;
– recall, it is determined by headers of TCP (20 bytes), IP (20 bytes): 40 bytes;
– around 40% of packets are 40 bytes in length.
• around 10% of packet lengths are uniformly distributed between 40 and 1500;
• additional peaks: fragmentation of IP packets.

Lecture: Traﬃc concept, measurements, models 40

Traﬃc modeling D.Moltchanov, TUT, 2005

8.7. Non-uniform distribution of the traﬃc

Note: it is related to ﬂows in the network!

Packet traﬃc is non-uniformly distributed:

• around 90% of traﬃc is between 10% of nodes:
– explanation: client-server conﬁguration of most services.
• this property may change:
– p2p applications.

Lecture: Traﬃc concept, measurements, models 41

Traﬃc modeling D.Moltchanov, TUT, 2005

8.8. Unknown patterns of the packet traﬃc

What was suggested over decades:
• 80th : Poisson nature of the aggregated packet traﬃc:
– common agreement: this assumption is no longer valid!
• 90th : self-similar nature of the aggregated traﬃc:
– the most respected hypotheses today;
– seems a little bit strange.
• 2000: is it simply non-stationary?
– probably the correct answer:
– small timescales: stationary;
– long timescales: non-stationary.

Lecture: Traﬃc concept, measurements, models 42

Traﬃc modeling D.Moltchanov, TUT, 2005

8.9. Changing nature of Internet traﬃc

Internet traﬃc continuous to changes:
• what applications dominated over decades:
– 80 − 95: e-mail, remote access;
– 95−: WWW, large ﬁle transfers;
– predictions: 2010: p2p, WWW.
• how to deal with:
– you cannot rely upon ’old measurements’;
– new measurements are required.

Lecture: Traﬃc concept, measurements, models 43

Traﬃc modeling D.Moltchanov, TUT, 2005

8.10. Find more about Internet traﬃc

Where you may find more about current Internet traffic:
• Internet traffic archive: http://ita.ee.lbl.gov;
• Internet traffic report: http://www.internettrafficreport.com;
• National laboratory for applied network research (NLANR): http://www.nlanr.net;
• NLANR measurement and operations analysis team (MOAT): http://moat.nlanr.net;
• National Internet measurement infrastructure (NIMI): http://www.ncne.nlanr.net/nimi;
• tcpdump measurements software: http://www.tcpdump.org;
• etherial software: http://www.etherial.com;
• research papers:
– free search engine: http://researchindex.org/;
– free search engine: http://scholar.google.com/;
– ieee: http://ieeexplore.ieee.org/Xplore/guesthome.jsp.

Lecture: Traﬃc concept, measurements, models 44

Traﬃc modeling D.Moltchanov, TUT, 2005

9. What statistics to capture

General answer is not straightforward:
• what are the aims of traffic modeling:
– just propose a new, better traffic model?
– carry out performance evaluation?
– what kind of performance evaluation simulation/analytic?
• how close you are going to describe the traffic:
– trade-off between accuracy and complexity!
– is it sufficient just to get basic ideas?
– is there interest in precise parameters?
• what statistics are important:
– mean, variance, distribution, ACF?
– you can never say before you get results;
– you can never say before you capture a certain parameter.

Lecture: Traﬃc concept, measurements, models 45

Traﬃc modeling D.Moltchanov, TUT, 2005

9.1. Description of stochastic processes

Note the following for process {S(n), n = 0, 1, . . . }:
• full information is given by N -dimensional distribution:

S(t1 , s1 , t2 , s2 , . . . ) = P r{S(t1 ) ≤ s1 , S(t2 ) ≤ s2 , . . . }. (7)

– impossible to deal with;

– never considered in teletraﬃc theory.
• in most general case we operate with:
– empirical distribution (in terms of the histogram);
– autocorrelation function.
• note the following:
– distribution and ACF: does not fully describe arbitrary process;
– distribution and ACF: gives full description for processes with Normal distribution.

Lecture: Traﬃc concept, measurements, models 46

Traﬃc modeling D.Moltchanov, TUT, 2005

fi,E(D) KY(i)
0.11 1

0.8
0.083

0.6
0.055
0.4

0.028
0.2

0 0
0 5 10 15 20 25 30 35 0 2 4 6 8 10

iD i, lag
(a) Empirical distribution (b) NACF

Figure 13: Empirical distribution and NACF with approximations.

It is advisable to construct both:

• you get a picture how distribution behaves;
• you get a picture how ACF behaves.

Lecture: Traﬃc concept, measurements, models 47

Traﬃc modeling D.Moltchanov, TUT, 2005

fX(x)
exponential, hyperexponential

gamma, beta, Erlang, Weibull

normal

Pareto

Figure 14: Forms of distribution.

Note: form may signiﬁcantly aﬀect results of performance analysis!

Lecture: Traﬃc concept, measurements, models 48

Traﬃc modeling D.Moltchanov, TUT, 2005
If distribution is hard to capture, capture moments:
• 1st moment: mean;
• 2nd moment: variance;
• 3rd moment: skewness;
• 4th moment: kurtosis;
• higher moments.

If ACF is hard to capture, capture:

• lag-1 ACF only;
• short-range behavior of ACF;
• long-range behaviour of ACF.

Lecture: Traﬃc concept, measurements, models 49

Traﬃc modeling D.Moltchanov, TUT, 2005

9.2. Importance of moments

fX(x)

variance

mean x

Figure 15: Mean and variance of distribution.

• mean: measure of central tendency;

• variance: measure of dispersion.

Lecture: Traﬃc concept, measurements, models 50

Traﬃc modeling D.Moltchanov, TUT, 2005

fX(x)

sk < 0 sk = 0 sk > 0

Figure 16: Skewness of the distribution.

• skewness: normalized third central moment (for symmetric sk = 0);

• skewness: measure of the lopsidedness of the distribution.

Lecture: Traﬃc concept, measurements, models 51

Traﬃc modeling D.Moltchanov, TUT, 2005

fX(x)

kurt 2

kurt 2 > kurt 2

kurt 1

Figure 17: Kurtosis of the distribution.

• kurtosis: normalized fourth central moment - 3;

• kurtosis: whether the distribution is tall and skinny or short and squat compared to normal.

Lecture: Traﬃc concept, measurements, models 52

Traffic modeling D.Moltchanov, TUT, 2005
Notes on fitting moments:
• if 4 first moments are fitted you may expect fair approximation of histogram;
• sometimes it is easier to fit histogram than more than 2 moments.

fX(x)

Figure 18: Distribution to be approximated.

Lecture: Traﬃc concept, measurements, models 53

Traﬃc modeling D.Moltchanov, TUT, 2005

fX(x)

Figure 19: Fitting mean and variance results in a number of forms of a distribution.

• diﬀerent skewness;
• diﬀerent length on fX (x) axis.

Lecture: Traﬃc concept, measurements, models 54

Traﬃc modeling D.Moltchanov, TUT, 2005

fX(x)

Figure 20: Fitting mean, variance and skewness limits a number of forms of a distribution.

We still have diﬀerences:

• diﬀerent length on fX (x) axis.

Lecture: Traﬃc concept, measurements, models 55

Traﬃc modeling D.Moltchanov, TUT, 2005

fX(x)

Figure 21: Fitting mean, variance, skewness and kurtosis may result in a desired distribution.

• if we ﬁt kurtosis we are sure that the length on fX (x) axis is the same;
• 4 moments are fairly enough to get good ﬁtting.

Lecture: Traﬃc concept, measurements, models 56

Traﬃc modeling D.Moltchanov, TUT, 2005

9.3. Importance of the ACF

Note the following:
• autocorrelation aﬀects results of performance analysis:

The eﬀect may not be straightforward

• autocorrelation in packet arrivals usually leads to more losses;
• autocorrelation in bit errors of the wireless channel may lead to less losses.

ACF manifests itself in two eﬀects:

• short-range dependence:
– more losses and larger delays compared to no autocorrelation.
• long-range dependence:
– more losses and larger delays compared to short-range dependent models.

Lecture: Traﬃc concept, measurements, models 57

Traﬃc modeling D.Moltchanov, TUT, 2005

K(i)
short-range dependence

long-range dependence

i < 10-20 i > 50 i

Figure 22: Long and short-range dependence.

• short-range dependence: K(i) = 0 already for some i < 20 ∼ 30;

• long-range dependence: K(i) = 0, i > 50.

Lecture: Traﬃc concept, measurements, models 58

Traﬃc modeling D.Moltchanov, TUT, 2005

K(i)
single exponential/geometric component

two exponential/geometric components

power decay (long-range dependence)

Figure 23: Common forms of the normalized ACF in traﬃc models.

• exponential/geometric decay: short-range dependence;

• power decay: long-range dependence.

Lecture: Traﬃc concept, measurements, models 59

Traﬃc modeling D.Moltchanov, TUT, 2005

K(i)

k
mixture of exponential terms K (i ) = å j j l j i
j =1

Figure 24: Power decay can be approximated by sum of exponentials (to the some extent).

• some models have such a kind of ACF;

• example: Markov modulated processes.

Lecture: Traﬃc concept, measurements, models 60

Traﬃc modeling D.Moltchanov, TUT, 2005
Note on long-range dependence:
• may exists as a consequence of non-stationarity!
• you have to be pretty sure in stationarity:
– anomaly behavior of ACF may be the sign of non-stationarity;
– example: ACF is not strictly decreasing.
– example: ACF is slowly decreasing.

The following is important:

• observations of stationary traﬃc usually have strictly decreasing ACF;
• however, note the following:
– some anomalies can be due to outbursts that may not be important;
– some anomalies are important.
• you have to have intuition!

Lecture: Traﬃc concept, measurements, models 61

Traﬃc modeling D.Moltchanov, TUT, 2005

Y(i) Y(i)
15 15

10
10
5

5
0

5 0
0 25 50 75 100 0 25 50 75 100
KY(i) i, time KY(i) i, time
1 1

1 2
0 25 50 75 100 0 25 50 75 100

i, lag i, lag

Figure 25: Traces and NACFs of non-stationary observations.

Lecture: Traﬃc concept, measurements, models 62

Traﬃc modeling D.Moltchanov, TUT, 2005

Y(i) Y(i)
4 20

2 10

0 0

2 10
0 25 50 75 100 0 25 50 75 100
i, time i, time
KY(i) KY(i)
1 1

0.5 0.5

0 0

0.5 0.5
0 25 50 75 100 0 25 50 75 100

i, lag i, lag

Figure 26: Traces and NACFs of stationary observations.

Lecture: Traﬃc concept, measurements, models 63

Traﬃc modeling D.Moltchanov, TUT, 2005

9.4. What statistics are important?

Note the following:
• mean value:
– must be captured;
• variance:
– must be captured;
– one may use standard deviation or coefficient of variation instead.
• lag-1 ACF:
– was found to be important;
– may have unexpected effect.
• structure of the ACF:
– sometimes may affect significantly (e.g. long-range dependence).
• histogram of relative frequencies:
– captures all moments of one-dimensional distribution;
– required when you have to be pretty sure.

Lecture: Traﬃc concept, measurements, models 64

Traﬃc modeling D.Moltchanov, TUT, 2005

9.5. Example of importance of parameters

We consider: SBP+SPP/D/1/K queuing system:
• SBP: switched Bernoulli process;
• SPP: switched Poisson process;
• arrivals of SBP have priority over SPP;
• constant service time of one slot;
• K waiting positions;
• parameters of interest: pdf of waiting time fL (l), pdf of losses fQ (q).

Application: frame transmission process over wireless channels:

• SBP: frame error process;
• SPP: frame arrival process;
• limited buﬀer of the mobile terminal;
• single wireless channel.

Lecture: Traﬃc concept, measurements, models 65

Traﬃc modeling D.Moltchanov, TUT, 2005

Figure 27: Eﬀect of the lag-1 autocorrelation coeﬃcient of SPP.

Lecture: Traﬃc concept, measurements, models 66

Traﬃc modeling D.Moltchanov, TUT, 2005

Figure 28: Eﬀect of the variance of SPP.

Lecture: Traﬃc concept, measurements, models 67

Traﬃc modeling D.Moltchanov, TUT, 2005

Figure 29: Eﬀect of the form of the distribution of SPP.

Lecture: Traﬃc concept, measurements, models 68

Traﬃc modeling D.Moltchanov, TUT, 2005

Figure 30: Eﬀect of the lag-1 autocorrelation coeﬃcient of SBP.

Lecture: Traﬃc concept, measurements, models 69

Traﬃc modeling D.Moltchanov, TUT, 2005

Figure 31: Eﬀect of the variance of SBP.

Lecture: Traﬃc concept, measurements, models 70

Traﬃc modeling D.Moltchanov, TUT, 2005

9.6. Common matching schemes

Common matching:
• mean and variance:
– usually easy to do;
– used to get mean performance parameters.
• mean, variance and lag-1 ACF:
– there are a number of models and algorithms;
– relatively easy to do.
• mean and ACF:
– there are a number of models and algorithms;
– sometimes not easy to do.
• histogram:
– one may look for analytical distribution;
– usually easy to do using discrete distribution.
• histogram and ACF.

Lecture: Traﬃc concept, measurements, models 71

Traﬃc modeling D.Moltchanov, TUT, 2005

9.7. Classes of models and characteristics

Classes of models:
• renewal class of models:
– distribution can be arbitrary;
– ACF is zero for all lags: no autocorrelation.
• autoregressive class:
– distribution is normal;
– ACF is a sum of exponential/geometric terms.
• Markov-modulated models:
– distribution can be arbitrary;
– ACF is a sum of exponential/geometric terms.
• models with self similar properties:
– distribution can be either normal (FBM) or arbitrary (F-ARIMA);
– ACF non-zero for large lags: long-range dependence.

Lecture: Traﬃc concept, measurements, models 72

Traﬃc modeling D.Moltchanov, TUT, 2005

9.8. Receipts
You may use the following when you are to capture:
• first two moments:
– Erlang, hyperexponential, exponential distributions;

– approximation by discrete distribution: p1 , p2 , . . . , pk such that i pi = 1.
• first m moments:
– special case of phase-type distribution;
– approximation by discrete distribution.
• first two moments and lag-1 of ACF:
– Markov modulated processes;
– autoregressive processes.
• first two moments and ACF:
– Markov modulated processes;
– autoregressive processes.

Lecture: Traﬃc concept, measurements, models 73

Traﬃc modeling D.Moltchanov, TUT, 2005

10. Estimating statistics of traﬃc observations

What is special in teletraﬃc:
• usually we have enough statistics to estimate;
• that is, we can capture for days...
• see, Internet Traﬃc Archive: http://ita.ee.lbl.gov/

What does it mean:

• recall, unbiased and consistent estimate for variance:

1
N
2
σ [X] = (Xi − m)2 . (8)
N − 1 i=1

– we may not care about 1/(N − 1) and just use 1/N when N is suﬃciently large.

Lecture: Traﬃc concept, measurements, models 74

Traffic modeling D.Moltchanov, TUT, 2005
General questions you have to answer at this step:
• is the traffic process ergodic:
– practically, there are no means to test for ergodicity;
– look for reasons for ergodicity of the traffic process;
– ergodic: a single sufficiently long observation can be further used;
– not ergodic: a set of observations must be obtained.
• are there reasons for stationarity of the traffic process?
– practically, there are no means to test for stationarity;
• if stationary (first- or second-order):
– estimate important statistics;
• if not stationary:
– try to change representation of observations;
– another representation may be stationary.

Lecture: Traﬃc concept, measurements, models 75

Traﬃc modeling D.Moltchanov, TUT, 2005

11. Choosing a candidate model

Input information
• parameters of the traﬃc that have to be captured;
• a set of traﬃc models.

What you have to know:

• traﬃc models and their properties;
• analytical tractability of models:
– simulation: any model is suitable;
– analytical: only tractable models are suitable.

Examples:
• analytically tractable: renewal models, Markovian models;
• analytically intractable: most non-Markovian models.

Lecture: Traﬃc concept, measurements, models 76

Traﬃc modeling D.Moltchanov, TUT, 2005

11.1. Example of the problem

Assume we have:
• observations of RV X that is deﬁned on [0, ∞);
• we have to capture ﬁrst two moments of observations:
2
E[X], C < 1. (9)

What one may guess:

• Erlang distribution (E2 ):
– deﬁned on [0, ∞);
– has C 2 < 1.
• shifted exponential distribution:
– deﬁned on [d, ∞).
– has C 2 < 1.
• what to choose?

Lecture: Traﬃc concept, measurements, models 77

Traﬃc modeling D.Moltchanov, TUT, 2005

fX(x) E2: fX(x) = bxe-bx

Shifted exp: fX(x) = be-b (x-d)

d x

Figure 32: pdfs of shifted exponential and E2 distributions.

Conclusion:
• shifted exponential does not satisfy implicit requirement X ∈ [0, ∞)
• we choose Erlang distribution;

Lecture: Traﬃc concept, measurements, models 78

Traﬃc modeling D.Moltchanov, TUT, 2005

12. Self-similar traﬃc

Note the following:
• it is a common belief nowadays:
– may not be true.
• a way to deal with aggregated network traﬃc:
– may not be the only approach.

Simple example of deterministic self-similar behavior: Cantor set:

• take a certain subspace of the space (assume rectangle [0, 1][0, 1] in R2 );
• scale its size by 1/3 and place in corners on initial subspace;
• do the same for each obtained rectangle;
• continue...
Note: self-similar structures are sometimes called fractals.

Lecture: Traﬃc concept, measurements, models 79

Traﬃc modeling D.Moltchanov, TUT, 2005

arrivals

take 2 of length 1/3 and place in corners

Figure 33: Illustration of the 2D Cantor set and 1D Cantor set as ON/OFF traﬃc.

Lecture: Traﬃc concept, measurements, models 80

Traﬃc modeling D.Moltchanov, TUT, 2005

arrivals

Figure 34: Weighted Cantor set with weights 2/3 and 1/3 (we preserve the whole weight).

Lecture: Traﬃc concept, measurements, models 81

Traﬃc modeling D.Moltchanov, TUT, 2005

Figure 35: Coast of England is an example of fractals: length scales with a timescale.

Lecture: Traﬃc concept, measurements, models 82

Traﬃc modeling D.Moltchanov, TUT, 2005

Figure 36: Stochastic self-similarity in the network traﬃc.

Lecture: Traﬃc concept, measurements, models 83

Traﬃc modeling D.Moltchanov, TUT, 2005

12.1. Deﬁnition of self-similarity

Assume we are working with:
• {Y (t), t = 0, 1, . . . }: cumulative arrival process:
– may not be stationary!
• {X(t), t = 0, 1, . . . }, X(t) = Y (t + 1) − Y (t): process of increments:
– ﬁrst order diﬀerence process: X(t) = ∇Y (t) (recall ARIMA);
– this process is should be covariance stationary with zero mean.
Y(t) X(t)
60 50

20
0
0

40 50
0 2 4 6 8 10 0 2 4 6 8 10

t, time t, time

Figure 37: Example of processes {Y (t), t = 0, 1, . . . } and {X(t), t = 0, 1, . . . }.

Lecture: Traﬃc concept, measurements, models 84

Traﬃc modeling D.Moltchanov, TUT, 2005
Deﬁne aggregated averaged process of {X(t), t = 0, 1, . . . } as:
1 1
mn
X (m) (n) = (Xnm−m+1 + · · · + Xnm ) = X(t). (10)
m m
t=m(n−1)+1

{X(n),n = 0,1,..}

t
{X(5)(n),n = 5,10,..}

t
{X(10)(n),n = 10,20,..}

Figure 38: Averaging the process {X(t), t = 0, 1, . . . }.

Note: X (m) (n) is the sample mean of the sequence (Xnm−m+1 + · · · + Xnm ).

Lecture: Traﬃc concept, measurements, models 85

Traﬃc modeling D.Moltchanov, TUT, 2005
A process {X(t), t = 0, 1, . . . } is exactly second-order self-similar if:
σ2
γ(k) = [(k + 1)2H − 2k 2H + (k − 1)2H ], k = 1, 2, . . . , (11)
2
• γ(k) is ACF of {X(t), t = 0, 1, . . . };
• such structure implies that γ(k) = γ (m) (k) for all m ≥ 1;
• H is called Hurst parameter;
• for self-similar processes 0.5 < H < 1.
A process {X(t), t = 0, 1, . . . } is asymptotically second-order self-similar if:

(m) σ2
γ (k) = lim [(k + 1)2H − 2k 2H + (k − 1)2H ], k = 1, 2, . . . . (12)
m→∞ 2

• γ (m) (k) is ACF of {X (m) (n), n = 0, 1, . . . }.

Note the following:
• exactly self-similar: correlation structure is strictly preserved over timescales;
• asymptotically self-similar: correlation structure is preserved under time-aggregation.

Lecture: Traﬃc concept, measurements, models 86

Traﬃc modeling D.Moltchanov, TUT, 2005

12.2. Self-similarity in terms of distribution

Consider the following:
• continuous time process {Y (t), t ∈
};

Process {Y (t), t ∈
} is self-similar with certain 0 < H < 1 if:

Y (t) = a−H Y (at), a > 0, t ≥ 0. (13)

• it means: {Y (t), t ∈
} and {Y (at), t ∈
} normalized by a−H have the same distribution;
• we usually interpret {Y (t), t ∈
} as cumulative arrival function.

What is important:
• {Y (t), t ∈
} cannot be stationary du to normalization factor a−H !
• its increment process can be (does not mean it must) covariance stationary.

Lecture: Traﬃc concept, measurements, models 87

Traﬃc modeling D.Moltchanov, TUT, 2005

12.3. Long range dependence

Determine variance of {X (m) (n), n = 0, 1, . . . } via variance of {X(t), t = 0, 1, . . . }:
2
1 1
σ 2 [X (m) ] = E 2 (Xnm−m+1 + · · · + Xnm ) − E [Xnm−m+1 + · · · + Xnm ] =
m m
2
m
σ 2 [X]
= + 2 (m − k)r(k) =
m m k=1
m
k
= σ 2 [X] 1 + 2 1− r(k) m−1 . (14)
k=1
m

• r(k), k = 1, 2, . . . is the NACF of {X(t), t = 0, 1, . . . }.

Note: if process {X(n), n = 0, 1, . . . } is uncorrelated, {X(n)(m) , n = 0, 1, . . . } is uncorrelated.
r(k) = 0, k = 1, 2, . . . , σ 2 [X (m) ] = D[X]m−1 . (15)

Note: if process {X(n), n = 0, 1, . . . } is correlated, then for large m we have:

m

σ 2 [X (m) ] = σ 2 [X] 2 r(k) m−1 . (16)
k=1

Lecture: Traﬃc concept, measurements, models 88

Traﬃc modeling D.Moltchanov, TUT, 2005
∞
Consider the case when r(k) = 0 and k=−∞ r(k) < ∞:
σ 2 [X (m) ] = σ 2 [X]Cm−1 , (17)
• C is some constant;
• sample variances decay to zero as fact as m−1 .

Models that MAY have such behavior:

• Markovian models;
• ARMA(p, q) and its special cases (MA(q), AR(p)).

Where sample variance may decay at slower rate than m−1 :

• {X(t), t = 0, 1, . . . } is self-similar process;
• {X(t), t = 0, 1, . . . } is non-stationary process.

Where practically: aggregated traﬃc.

Lecture: Traﬃc concept, measurements, models 89

Traﬃc modeling D.Moltchanov, TUT, 2005
How to model slower than m−1 decay?
• one should have decay of σ 2 [X (m) ] proportional to m−a , a ∈ (0, 1), why?
– if a = 1 we have limited serial correlation or no correlation at all;
– if a = 0 the process degenerates to constant case.
• it requires the sum in expression for σ 2 [X (m) ] must be proportional to m1−a :

m
r(k) = Cm1−a , a ∈ (0, 1). (18)
k=1

When α < 1, previous implies that the ACF decays so slowly that it is not summable:

∞
r(k) → ∞. (19)
k=−∞

An example of such ACF is a power decaying ACF:

r(k) ∼ Ck −α , α ∈ (0, 1). (20)

• in this case we say that ACF decays as power function.

Lecture: Traﬃc concept, measurements, models 90

Traﬃc modeling D.Moltchanov, TUT, 2005

KX(i) KX(i)

0.8 0.8

0.6 0.6

0.4 0.4

0.2 0.2

0 0

0.2 0.2
0 20 40 60 80 100 0 20 40 60 80 100
empirical ACF i, time empirical ACF i, time
error: +2/sqrt(n) error: +2/sqrt(n)
error: -2/sqrt(n) error: -2/sqrt(n)

Figure 39: NACFs of short-range dependent and long-range dependent processes.

Lecture: Traﬃc concept, measurements, models 91

Traﬃc modeling D.Moltchanov, TUT, 2005

12.4. Values of H
We have the following possibilities:

• H = 0.5: process is completely uncorrelated, ∞
k=−∞ r(k) is ﬁnite;
∞
• 0 < H < 0.5: k=−∞ r(k) = 0 that is rarely observed in applications;
• H = 1 leads to r(k) = 1, k = 1, 2, . . . there is linear dependence in the series;
• H > 1: prohibited due to stationarity exposed on {X(t), t = 0, 1, . . . }.

Self-similarity and long-range dependence:

• there are long-range dependent processes that are not self-similar;
• there are self-similar processes that are not long-range dependent;
• asymptotic self-similar:
– self-similarity leads to long-range dependence;
– long-range dependence leads to self-similarity.

Lecture: Traﬃc concept, measurements, models 92

Traﬃc modeling D.Moltchanov, TUT, 2005

12.5. Heavy-tailed distributions

Observe the following:
• heavy-tailed distribution is when CPDF is:

P r{Z > z} ≈ x−a . (21)

– 0 < a < 2 is some constant.

– tails of distribution decreases slowly.
• short-tailed distribution is when CPDF is:

P r{Z > z} ≈ e−z . (22)

– tails of distribution decreases quickly.

Mote the following:
• 0 < a < 1: infinite mean, infinite variance;
• 1 < a < 2: infinite variance.
We are interested: 1 < a < 2 when only variance is infinite.

Lecture: Traﬃc concept, measurements, models 93

Traffic modeling D.Moltchanov, TUT, 2005
Common heavy-tailed distribution is Pareto:
a a
P r{Z ≤ z} = 1 − , b ≤ x, (23)
x
• 0 < a < 2 is the scale parameter;
• b is the location parameter;
• mean is given by:
ab
E[Z] = . (24)
a−1
• variance is infinite.
Note the following:
• gamma distribution has subexponential tail but has a finite variance;
• weibull distribution has subexponential tail but has a finite variance.
Note: main characteristic is high variability:
• may take very large values with non-negligible probabilities;
• sample: a lot of small values and some values are extremely big.

Lecture: Traﬃc concept, measurements, models 94

Traﬃc modeling D.Moltchanov, TUT, 2005

fY(y) fY(y)
0.04 0.04

0.03 0.03

0.02 0.02

0.01 0.01

0 0
50 8.33 33.33 75 116.67 158.33 200 50 8.33 33.33 75 116.67 158.33 200
Normal distribution y Weibull distribution
Exponential distribution y

Figure 40: Examples of short-tailed and heavy-tailed distributions.

Lecture: Traﬃc concept, measurements, models 95

Traﬃc modeling D.Moltchanov, TUT, 2005

12.6. Heavy-tails and predictability

Assume the following:
• we have duration of a lifetime of a certain thing with RV Z;
• time is discrete t = 0, 1, . . . ;
• {A(t), t = 0, 1, . . . } is an indicator process:
– A(t) = 1 something is in;
– A(t) = 0 something already expired.
• we are interested in: something still persists in the future given that it persists now:

U (τ ) = P r{A(τ + 1) = 1|A(τ ) = 1}, 1 ≤ t ≤ τ. (25)

We can express U (τ ) as:

P r{Z = τ }
U (τ ) = 1 − . (26)
P r{Z ≥ τ }

Lecture: Traﬃc concept, measurements, models 96

Traﬃc modeling D.Moltchanov, TUT, 2005
Assume short-tailed distribution in the form P r{Z > x} ≈ c1 e−c2 x :
P r{Z = τ }
U (τ ) = 1 − ≈
P r{Z ≥ τ }
c1 e−c2 τ − c1 e−c2 (τ +1) −c2 −c2
≈1− −c τ
= 1 − (1 − e ) = e . (27)
c1 e 2
• prediction is the same for all τ !
• recall exponential distribution.

Assume long-tailed distribution:

P r{Z = τ }
U (τ ) = 1 − ≈
P r{Z ≥ τ }
a a
cτ −a − c(τ + 1)−a τ τ
≈1− =1− 1− = . (28)
cτ −a τ +1 τ +1
• prediction is diﬀerent for all τ !
• when τ → ∞, U (τ ) → 1!

Lecture: Traﬃc concept, measurements, models 97

Traﬃc modeling D.Moltchanov, TUT, 2005

12.7. Heavy-tails as a cause of LRD

Let us introduce the following:
• FBM: fractional Brownian motion:
– non-stationary Gaussian process with 0 < H < 1.
• FGN: fractional Brownian noise.
– stationary increment process of FBM with 0 < H < 1;
– distribution is also Gaussian.
Note that when H = 0.5:
• FBM reduces to Brownian motion:
– also non-stationary process.
• FGN reduces to Gaussian noise:
– stationary process;
– completely uncorrelated.
Note: for Gaussian processes distributional and second-order self-similarity are equivalent!

Lecture: Traﬃc concept, measurements, models 98

Traﬃc modeling D.Moltchanov, TUT, 2005
Consider the following:
• N independent on/oﬀ sources Xi (t), i = 1, 2, . . . , N ;
N
• aggregated process SN (t) = i=1 X(i)(t).

Figure 41: Examples of on/oﬀ sources and their aggregation.

Lecture: Traﬃc concept, measurements, models 99

Traﬃc modeling D.Moltchanov, TUT, 2005
Consider cumulative process YN (T t):

N
Tt
YN (T t) = X(i)(s) ds. (29)
0 i=1

• T > 0 is a scale factor.

If the following holds:

P r{τon > x} ≈ cx−a , x → ∞, 1 < a < 2, c > 0, (30)

For large T and N process YN (T t) behaves as:

E[τon ]
N T t + CN 1/2 T H BH (t) (31)
E[τof f ] + E[τon ]
• H = (3 − a)/2 is Hurst parameter;
• BH (t) is FBM with Hurst parameter H;
• C > 0 is a constant depending on distributions of τon and τof f ;
• distribution of the oﬀ times can be arbitrary (short-tailed or heavy-tailed).

Lecture: Traﬃc concept, measurements, models 100

Traffic modeling D.Moltchanov, TUT, 2005
What we can say about YN (T t):
• long range dependent (0.5 < H < 1) if 1 < a < 2: on intervals are heavy-tailed:
– distribution of off intervals does not matter.
• long range dependent (0.5 < H < 1) if 1 < a < 2: off intervals are heavy-tailed:
– distribution of on intervals does not matter.
• if off and on intervals are short-tailed it is short-range dependent:
– heavy-tails are required to have self-similarity.

Practice:
• ﬁle sizes distribution has heavy-tail;
• generator: web site with downloading capabilities.

Lecture: Traﬃc concept, measurements, models 101

Traﬃc modeling D.Moltchanov, TUT, 2005

12.8. Estimating H: variance-time plot

What property we are going to use:
• self-similar process has a slowly decaying variances with increasing of m:

σ 2 [X](m) = σ 2 [X]m−β . (32)

• where H = 1 − β/2.
You have to do the following:
• determine several m (it is better to use m = 1, 10, 100, 1000, . . . );
• compute log10 σ 2 [X (m) /m] and log10 m for each m;
• ignore small values of m;
• ﬁt a least squares ﬁt line through a rest of resulting points in the plane;
• estimate the Hurst parameters as:
β
H =1− (33)
2
– where β is the value of estimated asymptotic slope.

Lecture: Traﬃc concept, measurements, models 102

Traﬃc modeling D.Moltchanov, TUT, 2005

KX(i)
log10(s2[X(m)/m])
0

0.8
1

0.6
2
0.4

3
0.2

4
0

5 0.2
0 1 2 3 4 5 0 20 40 60 80 100
log10(m) empirical ACF i, time
error: +2/sqrt(n)
error: -2/sqrt(n)

Figure 42: Example of variance-time plot and ACF of self-similar process (H ≈ 0.82).

Lecture: Traﬃc concept, measurements, models 103

Traﬃc modeling D.Moltchanov, TUT, 2005

log10(s2[X(m)/m]) KX(i)
4

0.8
2.4

0.6
0.8
0.4

0.8
0.2

2.4
0

4 0.2
0 1 2 3 4 5 0 20 40 60 80 100
log10(m) empirical ACF i, time
error: +2/sqrt(n)
error: -2/sqrt(n)

Figure 43: Example of variance-time plot and ACF of a process without self-similarity (H ≈ 0.48).

Lecture: Traﬃc concept, measurements, models 104

Traﬃc modeling D.Moltchanov, TUT, 2005

12.9. Estimating H: R/S statistics

What we use here:
• R/S (rescaled/adjusted range) statistics:
– practically, measure of decline of variance.
• R/S statistics for self-similar process:

lim E[R(n)/S(n)] ≈ cnH . (34)

n→∞

• estimate of H: slope of log-log plot of R/S statistics.

How to compute R/S statistics: for each value d = 10, 20, 30, . . . do:
• compute K points of R/S statistics R(ti , d)/S(ti , d);
• starting points must satisfy (ti − 1) + d ≤ N ;
• estimate H as the slope of log-log graph of R/S statistics.

Lecture: Traﬃc concept, measurements, models 105

Traﬃc modeling D.Moltchanov, TUT, 2005

Figure 44: Algorithm to compute R/S statistics.

Lecture: Traﬃc concept, measurements, models 106

Traﬃc modeling D.Moltchanov, TUT, 2005

log(R/S)

logd
Figure 45: Estimating Hurst parameter using R/S statistics (H ≈ 0.9).

Lecture: Traﬃc concept, measurements, models 107

Traﬃc modeling D.Moltchanov, TUT, 2005

12.10. Caution!
Note the following:
• self-similar processes:
– cumulative process {Y (t), t = 0, 1, . . . } may not be stationary;
– increment process {X(t), t = 0, 1, . . . }, Xt = Yt+1 − Y (y) must be stationary;
– note: increment process is just fist difference process.
• some traffic seems to be non-stationary at all:
– deterministic variations: hourly, daily (recall PSTN traffic)!
Stationarity of the process:
• in real traffic depends of the timescale at which traffic is measured;
• recent hypothesis:
– short timescales: stationary behavior;
– long timescales: non-stationary behavior.
Another problem: distinguishing between self-similarity and non-stationarity.

Lecture: Traﬃc concept, measurements, models 108

Traffic modeling D.Moltchanov, TUT, 2005
Illustrative example:
• we generate process segments of which have different mean and variance;
• is this process self-similar?
– NO: both {Y (t), t = 0, 1, . . . } and X(t) = Y (t + 1) − Y (t) are non-stationary!
– can you say that observing only left figure?

Figure 46: First 5E5 and 1E4 observations of non-stationary trace.

Lecture: Traﬃc concept, measurements, models 109

Traﬃc modeling D.Moltchanov, TUT, 2005
What people usually do:
• estimate Hurst parameter or NACF;
• incorrect conclusion about self-similarity and non-stationarity!
KX(i)
log10(s2[X(m)/m])
3

0.8
1.8

0.6
0.6
0.4

0.6
0.2

1.8
0

3 0.2
0 1 2 3 4 5 0 20 40 60 80 100
log10(m) empirical ACF i, time
error: +2/sqrt(n)
error: -2/sqrt(n)

Figure 47: NACF and variance-time plot for non-stationary trace (H ≈ 0.76!!!).

Lecture: Traﬃc concept, measurements, models 110

Traﬃc modeling D.Moltchanov, TUT, 2005

13. Fit parameters of models

Note the following:
• no general algorithms;
• algorithms are speciﬁc for a class of models;
• there could be more than a single algorithm for a chosen model;
• there could be no algorithm for a chosen model.

General procedure:
• determine parameters of the model:
– these parameters must completely characterize a model;
– not only parameters you are going to capture.
• derive equation relating measuring statistics and parameters:
– note that some parameters can be free.

Lecture: Traﬃc concept, measurements, models 111

Traﬃc modeling D.Moltchanov, TUT, 2005

14. Tests accuracy of the model

Note the following:
• sometimes is not needed:
– when you exactly match parameters.
• sometimes is needed:
– when approximation is used at a certain step.

Tests:
• compare distribution and empirical data:
– χ2 test;
– Smirnov’s test.
• compare autocorrelations:
– just visually comparing;
– Q-Q graph.

Lecture: Traﬃc concept, measurements, models 112

Traﬃc modeling D.Moltchanov, TUT, 2005

15. Example
What we have to do:
• propose a model of the aggregated traﬃc;
• capture histogram and ACF as close as possible;
• model should be further used in simulation study.

network side

1
...

Figure 48: The point at which traﬃc is to be modeled.

Lecture: Traﬃc concept, measurements, models 113

Traﬃc modeling D.Moltchanov, TUT, 2005

15.1. Measuring the traﬃc at the point of interest

We carried out two sufficiently long measurements:
• reality: 2000 observations may not sufficiently long!
• disclaimer: these observations do not represent real traffic of any kind!
Y(i) Y(i)
30 30

24 24

18 18

12 12

6 6

0 0
0 500 1000 1500 2000 0 500 1000 1500 2000

i i
(a) Experiment 1 (b) Experiment 2

Figure 49: Traﬃc observations obtained in two experiments.

Lecture: Traﬃc concept, measurements, models 114

Traﬃc modeling D.Moltchanov, TUT, 2005

15.2. Estimating statistics

What you may guess?
• are they stationary ergodic?
• what kind of distribution these traces come from?
• is the same approximating distribution the same for both traces?
• which model to use to capture statistics?

What to do to get basic knowledge:

• compute statistics;
• analyze statistics to identify properties.

What statistics we usually start in MODELING:

• histogram of relative frequencies;
• normalized autocorrelations function.

Lecture: Traﬃc concept, measurements, models 115

Traﬃc modeling D.Moltchanov, TUT, 2005
Histograms looks like as follows:
• are they really normal?
• testing using χ2 : yes with level of signiﬁcance α = 0.1!

fi,E(D) fi,E(D)
0.11 0.11

0.083 0.083

0.055 0.055

0.028 0.028

0 0
0 5 10 15 20 25 30 35 0 5 10 15 20 25 30 35

iD iD
(a) Experiment 1 (b) Experiment 2

Figure 50: Histograms of presented traces with normal approximations.

Lecture: Traﬃc concept, measurements, models 116

Traﬃc modeling D.Moltchanov, TUT, 2005
NACFs look like as follows:
• we have no anomalies;
• such NACFs are inherent for stationary processes.

KY(i) KY(i)
1 1

0.8 0.8

0.6 0.6

0.4 0.4

0.2 0.2

0 0
0 2 4 6 8 10 0 2 4 6 8 10

i, lag i, lag
(a) Experiment 1 (b) Experiment 2

Figure 51: Normalized ACFs of presented traces with geometric approximations.

Lecture: Traﬃc concept, measurements, models 117

Traﬃc modeling D.Moltchanov, TUT, 2005

15.3. Choosing a candidate model

What are our observations:
• observations are stationary ergodic: assumption;
• empirical distribution is normal;
• ACF is distributed according to a single exponential/geometric term.
Which model to guess:
• autoregressive model or order 1: AR(1):

Y (n) = φ0 + φ1 Y (n − 1) +
(n), n = 1, 2, . . . , (35)

– φ0 and φ1 are some parameters,

∼ N (0, σ 2 );
– marginal distribution is normal, NACF K(i) = φi1 , i = 0, 1, . . . .
• Markov modulated model:
– may approximate Normal distribution;
– NACF is a sum of exponential/geometric terms.

Lecture: Traﬃc concept, measurements, models 118

Traﬃc modeling D.Moltchanov, TUT, 2005

15.4. Fitting AR(1) model

What we have to do: estimate the following:

φ0 , φ1 , σ 2 [
]. (36)

Properties of AR(1) model:

• if AR(1) process is covariance stationary we have:

E[Y ] = µY , σ 2 [Y ] = γY (0), Cov(Y0 , Yi ) = γY (i). (37)

• µY , σ 2 [Y ] and γY (i) of AR(1) are related to φ0 , φ1 and σ 2 [

] as
φ0 2 σ 2 [
]
µY = , σ [Y ] = , γY (i) = φi1 γY (0). (38)
1 − φ1 1 − φ21

• φ0 , φ1 and σ 2 [
] are related to statistics of observations as:

φ1 = KX (1), φ0 = µX (1 − φ1 ), σ 2 [
] = σ 2 [X](1 − φ21 ), (39)

– KX (1), µX and σ 2 [X] are the lag-1 value of ACF, mean and variance of observations.

Lecture: Traﬃc concept, measurements, models 119

Traﬃc modeling D.Moltchanov, TUT, 2005

15.5. Testing for accuracy of ﬁtting

Why we need it:
• we were asked to capture histogram and NACF;
• we ﬁt only ﬁrst two moments and lag-1 value of ACF!

Is there a case when we need not to do testing:

• assume we were to capture only µX , σ 2 [X] and KX (1);
• since we explicitly ﬁt them, AR(1) model exactly represents them.

What allows us to assume we get fair approximation:

• AR(1) model is characterized by only three parameters that all were matched;
• distribution of AR(1) model is normal;
• NACF of AF(1) models is geometrically distributed.

Lecture: Traﬃc concept, measurements, models 120

Traﬃc modeling D.Moltchanov, TUT, 2005
The ﬁrst step:
• generate trace from the model:
– for simplicity you may generate exactly the same amount of observation.

Test histograms using χ2 or Smirnov’s test for two samples:

• ﬁrst sample: empirical observations;
• second sample: generated from model;
• hypotheses to be tested:
– H0 : distributions of two samples are the same;
– H1 : distributions of both samples are diﬀerent.

Test NACFs:
• you may carry out visual test by plotting NACFs of both samples;
• you may test for signiﬁcant correlation using Box-Ljiung statistics.

Lecture: Traﬃc concept, measurements, models 121

3B. Teletraffic Theory
No ratings yet
3B. Teletraffic Theory
28 pages
Teletraffic
No ratings yet
Teletraffic
34 pages
Teletraffic Theory Introduction
No ratings yet
Teletraffic Theory Introduction
48 pages
Teletraffic Theory: Traffic Modeling
No ratings yet
Teletraffic Theory: Traffic Modeling
35 pages
Telecom Traffic Engineering Basics
No ratings yet
Telecom Traffic Engineering Basics
60 pages
Traffic Intensity
No ratings yet
Traffic Intensity
24 pages
Module - 3 - Digital Switching System
No ratings yet
Module - 3 - Digital Switching System
59 pages
Eee 552 Traffic Theory
No ratings yet
Eee 552 Traffic Theory
67 pages
Module3-Telecommunications Traffic: Introduction
No ratings yet
Module3-Telecommunications Traffic: Introduction
26 pages
EEE 414 Traffic (Compatibility Mode) PDF
No ratings yet
EEE 414 Traffic (Compatibility Mode) PDF
68 pages
Telecommunication S Engineering: Traffic Theory & Traffic Analysis
No ratings yet
Telecommunication S Engineering: Traffic Theory & Traffic Analysis
45 pages
Teletraffic Engineering Basics
No ratings yet
Teletraffic Engineering Basics
57 pages
04 TE 384 Lecture - 4 - Teletraffic
No ratings yet
04 TE 384 Lecture - 4 - Teletraffic
70 pages
Lecture 6
No ratings yet
Lecture 6
48 pages
Teletraffic Engineering and Network Planning
No ratings yet
Teletraffic Engineering and Network Planning
14 pages
6 Telecommunications Traffic
No ratings yet
6 Telecommunications Traffic
21 pages
Huawei-WCDMA Capacity Planning
No ratings yet
Huawei-WCDMA Capacity Planning
86 pages
7d Cellular Radio
No ratings yet
7d Cellular Radio
69 pages
Traffic Model and Engineering
No ratings yet
Traffic Model and Engineering
16 pages
TeleTraffic For Beginners
No ratings yet
TeleTraffic For Beginners
22 pages
Telecommunications Traffic Basics
100% (1)
Telecommunications Traffic Basics
21 pages
Trafficanalysis
No ratings yet
Trafficanalysis
20 pages
Teletraffic Theory Basics
No ratings yet
Teletraffic Theory Basics
51 pages
Trafic Engineering
No ratings yet
Trafic Engineering
21 pages
Traffic Engineering in Telecom Networks
0% (1)
Traffic Engineering in Telecom Networks
34 pages
Telecom Traffic Engineering Basics
No ratings yet
Telecom Traffic Engineering Basics
21 pages
Telecom Traffic Training Guide
No ratings yet
Telecom Traffic Training Guide
21 pages
Traffic Engg
No ratings yet
Traffic Engg
4 pages
Traffic 2 by Mushtaq 12042010
No ratings yet
Traffic 2 by Mushtaq 12042010
40 pages
Teletraffic Insights and Innovations
No ratings yet
Teletraffic Insights and Innovations
235 pages
Telecommunication Traffic Engineering
No ratings yet
Telecommunication Traffic Engineering
35 pages
Network Traffic Theory Basics
No ratings yet
Network Traffic Theory Basics
13 pages
Lecture 14
No ratings yet
Lecture 14
16 pages
Telecom Switching & Networks Guide
No ratings yet
Telecom Switching & Networks Guide
32 pages
Teletraffic Engineering: Department of Electrical Engineering
No ratings yet
Teletraffic Engineering: Department of Electrical Engineering
35 pages
MMN - Lec10 QoS Traffic Engineering
No ratings yet
MMN - Lec10 QoS Traffic Engineering
70 pages
Lecture 3
No ratings yet
Lecture 3
42 pages
Voice Traffic Engineering Management
No ratings yet
Voice Traffic Engineering Management
23 pages
TS Lecture9
No ratings yet
TS Lecture9
10 pages
Wiley Statsref TeletrafficModels
No ratings yet
Wiley Statsref TeletrafficModels
10 pages
Traffic Analysis
No ratings yet
Traffic Analysis
31 pages
Chapter 6 Telephone - Traffic
No ratings yet
Chapter 6 Telephone - Traffic
234 pages
Ccitt: Terms and Definitions of Traffic Engineering
No ratings yet
Ccitt: Terms and Definitions of Traffic Engineering
19 pages
Survey of Network Traffic Models: Balakrishnan Chandrasekaran, Bchandrasekaran@wustl - Edu
No ratings yet
Survey of Network Traffic Models: Balakrishnan Chandrasekaran, Bchandrasekaran@wustl - Edu
8 pages
Traffic Theory
No ratings yet
Traffic Theory
12 pages
Introduction To Real-Time Communications
No ratings yet
Introduction To Real-Time Communications
22 pages
Source:: Traffic Analysis Overview
No ratings yet
Source:: Traffic Analysis Overview
11 pages
Sudacad Telecomm Diploma
No ratings yet
Sudacad Telecomm Diploma
35 pages
Examples: Lect03.ppt S-38.145 - Introduction To Teletraffic Theory - Spring 2005
No ratings yet
Examples: Lect03.ppt S-38.145 - Introduction To Teletraffic Theory - Spring 2005
53 pages
Teletraffic Engineering Basics
No ratings yet
Teletraffic Engineering Basics
27 pages
R Teletraffic Engineering
No ratings yet
R Teletraffic Engineering
35 pages
Examples - Traffic Models
No ratings yet
Examples - Traffic Models
53 pages
Teaching Materials Using Technology
No ratings yet
Teaching Materials Using Technology
33 pages
Assignment 1: Chapter/Topic: Problem Solving On Current Issues
No ratings yet
Assignment 1: Chapter/Topic: Problem Solving On Current Issues
1 page
Presentation Evaluation Rubric
No ratings yet
Presentation Evaluation Rubric
5 pages
Robust Dimensioning and Routing For Dynamic WDM Networks: (Xzhang29, Lumetta) @illinois - Edu
No ratings yet
Robust Dimensioning and Routing For Dynamic WDM Networks: (Xzhang29, Lumetta) @illinois - Edu
5 pages
Gamification in E-Learning Review
No ratings yet
Gamification in E-Learning Review
30 pages
Gamification of Educational Technology A Narrative Review
No ratings yet
Gamification of Educational Technology A Narrative Review
30 pages
Net2Plan User PDF
No ratings yet
Net2Plan User PDF
66 pages
Decision Tree Splitting Methods
No ratings yet
Decision Tree Splitting Methods
33 pages
S Pts 055132 Chapter3
No ratings yet
S Pts 055132 Chapter3
19 pages
Tech Tools for Career & Courses
No ratings yet
Tech Tools for Career & Courses
37 pages
VPLS Introduction
No ratings yet
VPLS Introduction
82 pages
Elio
No ratings yet
Elio
44 pages
Maybe I Sssss
No ratings yet
Maybe I Sssss
1 page
Football Player Detection Using YOLOv3
No ratings yet
Football Player Detection Using YOLOv3
10 pages
Calculating The Height of A Building Worksheet
No ratings yet
Calculating The Height of A Building Worksheet
3 pages
Management Theory Jungle
No ratings yet
Management Theory Jungle
4 pages
Astrology Insights: Sun & Rising Signs
50% (2)
Astrology Insights: Sun & Rising Signs
7 pages
Gamma Scalping 101 Gamma Theta Trading
No ratings yet
Gamma Scalping 101 Gamma Theta Trading
13 pages
S-Frame Theory Manual
No ratings yet
S-Frame Theory Manual
81 pages
Syllabus of Assessment X UT-II Semester-II 2024-25
No ratings yet
Syllabus of Assessment X UT-II Semester-II 2024-25
2 pages
Fermat's Theorem and Beyond
No ratings yet
Fermat's Theorem and Beyond
8 pages
SAT Suite Question Bank - Results S
No ratings yet
SAT Suite Question Bank - Results S
18 pages
A Study On The Binary Option Model and Its Pricing
No ratings yet
A Study On The Binary Option Model and Its Pricing
7 pages
Algebra 2 Exam Guide
No ratings yet
Algebra 2 Exam Guide
3 pages
Ch. 18 - Mirrors and Lenses
100% (1)
Ch. 18 - Mirrors and Lenses
28 pages
Intro To Civil-Engg-Tie Simp
No ratings yet
Intro To Civil-Engg-Tie Simp
6 pages
CH 23
No ratings yet
CH 23
28 pages
Multi Wii Pro Manual
100% (5)
Multi Wii Pro Manual
11 pages
Package IC2': R Topics Documented
No ratings yet
Package IC2': R Topics Documented
14 pages
Assignment On Sampling Distribution
No ratings yet
Assignment On Sampling Distribution
2 pages
Parametric Cost Estimating of Sterile Building Using Artificial Neural Network & Genetic Algorithm Model
No ratings yet
Parametric Cost Estimating of Sterile Building Using Artificial Neural Network & Genetic Algorithm Model
8 pages
Me6401 Kom Notes Rejinpaul
No ratings yet
Me6401 Kom Notes Rejinpaul
64 pages
Mastering Mathematics
No ratings yet
Mastering Mathematics
4 pages
Physics I Problems PDF
No ratings yet
Physics I Problems PDF
1 page
Grade 12 Generalphysics LP
No ratings yet
Grade 12 Generalphysics LP
4 pages
Nterharmonics in Ower Ystems: IEEE Interharmonic Task Force, Cigré 36.05/CIRED 2 CC02 Voltage Quality Working Group
No ratings yet
Nterharmonics in Ower Ystems: IEEE Interharmonic Task Force, Cigré 36.05/CIRED 2 CC02 Voltage Quality Working Group
9 pages
Class IX Maths: Angle Constructions
No ratings yet
Class IX Maths: Angle Constructions
16 pages
Industrial - Engineering Syllabus Coal India
No ratings yet
Industrial - Engineering Syllabus Coal India
3 pages
Laws of Motion - Class 11 Physics NCERT Solutions Free PDF Download
No ratings yet
Laws of Motion - Class 11 Physics NCERT Solutions Free PDF Download
51 pages
Algebra 2 Review
No ratings yet
Algebra 2 Review
5 pages
Konechny 2002
No ratings yet
Konechny 2002
113 pages
2022 Hmmb032 Supp Exam
No ratings yet
2022 Hmmb032 Supp Exam
11 pages
Physics Experiment: Work & Energy
No ratings yet
Physics Experiment: Work & Energy
7 pages