U Ur - Thesis
U Ur - Thesis
A THESIS SUBMITTED TO
THE GRADUATE SCHOOL OF NATURAL AND APPLIED SCIENCES
OF
MIDDLE EAST TECHNICAL UNIVERSITY
BY
JUNE 2023
Approval of the thesis:
Date: 09.06.2023
I hereby declare that all information in this document has been obtained and
presented in accordance with academic rules and ethical conduct. I also declare
that, as required by these rules and conduct, I have fully cited and referenced all
material and results that are not original to this work.
Signature :
iv
ABSTRACT
The amount of the existing data is increasing rapidly and any way to understand
the shape of the given data is welcomed. In this thesis, we examine a method of
Topological Data Analysis which is called Persistent Homology. After a detailed
overview of 1-parameter case, we extend the structures to the multiparameter case
arising as a natural necessity. We present the similarities and differences between 1-
parameter and multiparameter persistent homology in an expository way. At the last
stage, we work on the interleaving, bottleneck, matching distances and their stability
properties. We also analyze the relations between these pseudometrics, and consider
some examples that help us to calculate them.
v
ÖZ
Var olan veri miktarı hızla artıyor ve verilen verinin şeklini anlamanın her yolu mem-
nuniyetle karşılanıyor. Bu tezde, Kalıcı Homoloji olarak adlandırılan Topolojik Veri
Analizi’nin bir metodunu inceliyoruz. 1-değişkenli durumun ayrıntılı bir incelemesin-
den sonra, yapıları, doğal bir gereksinim olarak ortaya çıkan çok değişkenli duruma
genişletiyoruz. 1-değişkenli ve çok değişkenli kalıcı homoloji arasındaki benzerlik-
leri ve farklılıkları açıklayıcı bir şekilde sunuyoruz. Son aşamada, serpiştirme, dar-
boğaz, eşleştirme mesafeleri ve kararlılık özellikleri üzerinde çalışıyoruz. Ayrıca, bu
psödometrikler arasındaki ilişkileri de inceliyoruz, ve onları hesaplamamıza yardımcı
olacak bazı örnekleri ele alıyoruz.
vi
To my family
vii
ACKNOWLEDGMENTS
As a first thing, I would like to thank my supervisor Assoc. Prof. Dr. Mehmetcik
Pamuk for guiding me along my journey in master study. He introduced me to this
field, and I have learned so many things from him. With his help and encouraging me
about how to write my thesis, the work that started in his office turned into a master’s
thesis.
Lastly, thanks to my friends for their endless support, and I dedicate my thesis to my
parents: Sakine and Muharrem for everything they have done for me.
viii
TABLE OF CONTENTS
ABSTRACT . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . v
ÖZ . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vi
ACKNOWLEDGMENTS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . viii
TABLE OF CONTENTS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix
LIST OF FIGURES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xi
CHAPTERS
1 INTRODUCTION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
2.1 Filtration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
ix
4.1 The Interleaving Distance . . . . . . . . . . . . . . . . . . . . . . . 44
REFERENCES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
x
LIST OF FIGURES
FIGURES
Figure 2.2 The barcode and the 0th and 1st persistence diagrams of the topo-
logical space M obtained by the height function, [13]. . . . . . . . . . . 20
Figure 3.2 Subsets of R2 given in the plane with specified points and the
line L. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
Figure 4.1 Intervals where I1 and I2 are outer light purple squares, I3 is the
inner purple square and J is the red one, [4]. . . . . . . . . . . . . . . . 53
xi
xii
CHAPTER 1
INTRODUCTION
The amount of data (discrete finite set of points in a metric space) we produce is
increasing rapidly and any way to understand the data at hand is welcomed. The
data can come from many different areas such as an engineering testbed, a biological
experiment or results of a questionnaire etc. A relatively new branch of mathematics,
called Topological Data Analysis (TDA), proposes to use topology to study properties
of data. It is an interdisciplinary field which has intersections with algebraic topology,
computational geometry, data analysis, computer science and statistics. The main
idea in TDA is to exploit topology to find out the features of the shape of data set and
present these features through an easily implementable summaries. In TDA, one also
defines (pseudo)metrics to distinguish data sets from each other again by examining
their shapes.
Persistent Homology (PH) is one of the most widely used methods in TDA. PH uses
homology which is a tool in algebraic topology to detect holes in topological spaces.
If we only consider a finite set of points, applying the homology functor directly to
this discrete set does not yield any interesting result. In this case, the only non-trivial
homology is the 0-th homology group which only counts the number of points. For
such a construction, one of the methods that PH suggests is instead of looking at the
data set directly, we consider the union of closed balls where centers are the data
points, and the radius is ϵ ≥ 0 (This is a very naive way to study data sets and we do
not mention this method in our thesis, but for motivational purposes, we want to point
this out here in the introduction section). However, we should not look for a single ϵ
since working with a fixed ϵ is unstable with respect to perturbations of data. Also,
such an approach cannot distinguish big holes from small ones. In fact, there exists
1
no canonical ϵ [23]. For these reasons, we examine all topological spaces that change
with respect to the proximity parameter ϵ. With this way, we obtain a nested sequence
of topological spaces called a filtration. Then, we apply the homology functor to these
topological spaces, or to simplicial complexes obtained from these data sets.
Now, we are ready to track changes in topological features via homology such as
where a homology class emerges and what happens to it along the filtration. After
using homology, we obtain persistence modules and by passing through their decom-
positions, we obtain a complete invariant called the barcode as a list of indecompos-
ables giving necessary information about lifespans of connected components, loops
and higher dimensional holes in 1-parameter case. Hence, by starting from low di-
mensional data, we can deduce higher dimensional results. In addition to a set of
discrete points, PH also works on data obtained by digital images, real valued func-
tions defined on a chosen topological space and so on.
We also compare persistence modules and measure "how far from being isomorphic"
by using stable metrics. We introduce the bottleneck and the interleaving distances,
and see that they are equal to each other in 1-parameter setting.
Shifting focus to the multiparameter setting brings new problems to us. The algebraic
structure of multiparameter persistence modules is much more complex, and there
exists no complete invariant for them, recall that for the 1-parameter case, barcode
is a complete invariant. This increases the importance of distances. We extend the
bottleneck and the interleaving distances to the multiparameter case. Moreover, by the
example of Bjerkevik [4], we see that they are no more equal to each other, and unlike
the interleaving distance, the bottleneck distance is not stable. Since computation of
2
the interleaving distance is so hard, this leads us to the matching distance as a stable
surrogate of the previously mentioned distances in the multiparameter setting [20].
3
4
CHAPTER 2
2.1 Filtration
Example 2.1.1. Let K be a compact topological space lying in some Euclidean space,
and h : K → R be a real valued function. For ai ∈ R, define the sublevel set Kai as
where i ∈ N. Clearly, Kai is a subspace of K for all ai ∈ R, and Kai ⊆ Kaj for
all ai < aj . Moreover, K = Kan for some an ∈ R since K is compact. Hence,
we construct a filtration of K indexed by {0, 1, . . . , n} by setting Ki := Kai for
i = 0, 1, . . . , n. In addition, it is called a sublevel set filtration.
Note here that a0 < a1 < . . . < an are the function values, and the filtering function h
is not necessarily continuous. One example of a filtering function, a height function,
is given in Figure 2.2.
5
finite linear combination
k
X k
X
λi xi such that λi = 1,
i=0 i=0
Now, we put simplices together to form more complex shapes, called simplicial com-
plexes which are suitable for combinatorial purposes.
Example 2.1.2. Let us see an example of a filtration obtained from a simplicial com-
plex K. Let h : K → R be a non-decreasing function, that is, for τ ≤ σ, we have
h(τ ) ≤ h(σ).
6
Consider the sublevel set Kai := h−1 (−∞, ai ]. Since h is non-decreasing, Kai is a
subcomplex of K for any ai ∈ R. Then, we have a filtration of K that is a nested
sequence of subcomplexes:
K0 ⊆ K1 ⊆ · · · ⊆ Kn = K
Until now, we have seen examples in which a filtration ends up with a topological
space whose properties we want to understand. On the other hand, we may start with
a set of discrete points sitting in Rn . In this case, instead of attaining it as an ultimate
space, it becomes the initial subspace in the filtration. We can consider points in the
set as vertices, and by a proximity parameter which we will define, we can obtain a
filtration of simplicial complexes. There are many ways to obtain such a filtration,
and we introduce the Vietoris-Rips complex .
From now on throughout this thesis, X denotes a finite set of points in some Euclidean
space unless otherwise stated.
Here, ϵ is called the proximity parameter. When ϵ = 0, we just have discrete set
of points, X = {xi }ni=1 . When ϵ > 0, we need to specify pairwise distances be-
tween points. If two points have a distance less than or equal to ϵ, then we add an
edge between them. After adding all possible edges, we obtain its 1-skeleton. Since
Vietoris-Rips complex is a flag complex, which means that it is maximal among all
abstract simplicial complexes with its 1-skeleton, it is enough to have 1-skeleton to
construct Rips(X)ϵ .
7
Figure 2.1: A filtration consisting of Vietoris-Rips complexes for some values of ϵ
between ϵ = 0 and ϵ = 2.1 where we start with discrete set of points, but for ϵ ≥ 2.1,
the complex turns into a solid disk, [25].
K : K0 ⊆ K1 ⊆ · · · ⊆ Kn ,
we can consider the corresponding sequence of homology groups to gain more infor-
mation. For i ≤ j, there exists an inclusion map Ki ,→ Kj which induces a homo-
morphism between homology groups: fpi,j : Hp (Ki ) → Hp (Kj ) for each dimension
p. Therefore, any filtration induces a sequence of homology groups connected by
homomorphisms induced by inclusions
Remark 2.2.1. When we pass to induced maps, we actually apply homology functor
to inclusion maps with coefficients in Z. Moreover, all of Hp (Ki ) can also be seen
as Z-modules with scalar multiplication. In fact, if we take homology with field co-
efficients, then all the homology groups become vector spaces, and homomorphisms
between them are linear maps. We revisit these points when we define what a persis-
tence module is.
8
Definition 6. [16] The p-th persistent homology groups Hpi,j (K) of K are the im-
ages of the homomorphisms induced by the inclusions Ki ,→ Kj ,
for 0 ≤ i ≤ j ≤ n. Note that Hpi,j (K) is not defined for i > j. Furthermore, the
p-th persistent Betti numbers are defined as the rank of the corresponding persistent
homology groups:
βpi,j := rank(Hpi,j (K)).
In fact, Hpi,j (K) measures the homology classes which persist or disappear through
the filtration levels. More specifically, we count the homology classes of Ki which
are still alive in Kj . Thus, Hpi,j (K) can also be defined as
/ Hpi−1,i (K).
Definition 7. [16] Let α ∈ Hp (Ki ). We say that α is born at Ki , if α ∈
Moreover, α dies at Kj , if fpi,j−1 (α) is a nontrivial element in Hp (Kj−1 ) while fpi,j (α)
is a trivial element in Hp (Kj ). Alternatively, we say that α dies at Kj if it merges with
an older class (a class that is born earlier) when we pass through from Kj−1 to Kj . In
other words, fpi,j−1 (α) ∈
/ Hpi−1,j−1 (K) but fpi,j (α) ∈ Hpi−1,j (K).
Whenever we say that two homology classes merge together at a particular level, we
mean that they become equal as homology group elements, and turns into a single
class. In such cases, we keep the class which is born earlier to keep better track of the
persistence of homology classes. If these two classes are born at the same time, then
we make a choice by keeping one of them. This rule is known as Elder Rule.
9
Definition 9. [16] Let µi,j
p be the number of p-dimensional classes that are born at
numbers:
µi,j i,j−1
p = (βp − βpi,j ) − (βpi−1,j−1 − βpi−1,j )
for all i < j and for all p. We can consider the meaning of βpi,j as the number of p-
dimensional classes born before or at Ki and die after Kj . So, in the equality above,
the left parenthesis gives the number of classes born before or at Ki and die exactly
at Kj while the right parenthesis gives the number of classes born before or at Ki−1
and die exactly at Kj .
We can compute persistent homology groups at each filtration level, and by studying
the lifespans of homology classes, we may draw some conclusions. Our aim is to
have a meaningful summary showing us the information coming from each filtration
level in a gathered form. However, as a first thing, we need to recall basic terminology
of the category theory. Let us define necessary parts:
Definition 10. [23] A partially ordered set (poset for short) is a set P with a binary
relation ≤ (the partial order) satisfying
i) x ≤ x (Reflexivity),
10
Definition 11. [18, 23] A category C consists of:
ii) A set Mor(X, Y ) of morphisms for each pair X, Y ∈ ObC and a distinguished
identity morphism idX ∈ Mor(X, X) for each X ∈ ObC,
iii) Compositions of morphisms, that is, for each f ∈ Mor(X, Y ) and g ∈ Mor(Y, Z),
we have g ◦ f ∈ Mor(X, Z), satisfying
Remark 2.3.2. For notational purposes, later in the thesis, we might opt to denote
F (X) by FX .
F (f )
FX FY
NX NY
GX G(f )
GY
11
We also mention the equivalence of two categories C and D:
G◦F ∼
= idC and F ◦ G ∼
= idD .
Next, we recall some algebraic definitions, but first let us fix some notation. Through-
out the rest of this chapter, R always denotes a commutative ring with unity, and
(T, ≤) denotes a totally ordered set with T ⊆ R.
Definition 16. [21] A T -graded module M over a graded ring R is a module which
is written as a direct sum,
M∼
M
= Mi ,
i∈T
The set {m1 , . . . , mn } is called a generating set of M , and it is minimal if its cardi-
X n
nality is minimal among its generating sets. The R-linear combinations ri mi is
i=1
n
X
called a relation if ri mi = 0.
i=1
12
Definition 18. [21] A free module M is an R-module which admits a basis, or the
zero module. Since a basis is linearly independent, M does not have any relation, and
if β is its basis, then
M∼
M
= R.
b∈β
fi−1 fi
... Mi−1 Mi Mi+1 ...
f g
F1 F0 M 0
Remark 2.3.4. Note that in the above definitions we can replace T which is a totally
ordered set with a poset.
φM
i,j : Mi → Mj
(i) φM
i,i = idMi
(ii) φM M M
i,k = φj,k ◦ φi,j for all i ≤ j ≤ k.
13
Observe that a persistence module M can be seen as a functor from T (the category
of T ) to the category of R-modules. Moreover, persistence modules form a cate-
gory whose morphisms are natural transformations between persistence modules. We
denote the category of T -indexed 1-parameter persistence modules by 1-mod.
Let us choose T as N for the time being and define a special type of persistence
modules:
j ≥ i ≥ m for some m ∈ N.
Although we see the word module in the definition of persistence module, we have
not given a module structure yet. Let us put that structure on M = {Mi , φM
i,j }i,j∈N .
xj−i : Mi → Mj
Theorem 2.4.1. [28] There exists an equivalence between the category of persistence
modules indexed by N of finite type over R and the category of finitely generated non-
negatively graded modules over R[x].
14
Thanks to Theorem 2.4.1, we can consider every persistence module indexed by N
of finite type over R with a module structure coming from a finitely generated non-
negatively graded module over the polynomial ring R[x]. In addition, a persistence
module is finitely generated if and only if it is of finite type.
Theorem 2.4.2. [28] Let D be a principal ideal domain. Then, every finitely gener-
ated D-module decomposes uniquely into cyclic D-modules in the following form:
M m
β
D ⊕ D/di D ,
i=1
where di ∈ D such that di /di+1 and β ∈ Z. In a similar fashion, every finitely gen-
erated graded module M over a graded ring D which is a PID decomposes uniquely
into the following form:
n
M m
M
αi
x D ⊕ xγj D/dj D
i=1 j=1
Consider the set of polynomials given by k[x] where k is a field. It is an N-graded ring
with the usual polynomial addition and multiplication where Ri = {kxi | k ∈ k}. It
is also a principal ideal domain (PID) since k is a field. Therefore, if we take a finitely
generated graded module M over k[x], then by Theorem 2.4.2, M decomposes into
the form:
n
M m
M
M∼
= αi
x k[x] ⊕ xγj k[x]/(xβj ) . (2.1)
i=1 j=1
where βj ≥ 1. In other words, the reason behind working over polynomial ring
with field coefficients is to obtain a PID, and further a simple decomposition for such
persistence modules.
15
In Remark 2.2.1, we emphasized that if we apply homology functor to a filtration
with field coefficients, then homology groups Hp (Ki ) := Mi , i ∈ N become vector
spaces, and the maps fpi,j : Mi → Mj are linear for all i, j ∈ N. That is to say each
homology group becomes a k-module, and each linear map between them is a module
homomorphism. Therefore, we obtain a persistence module.
From now on, we always consider homology with field coefficients. Thus, a persis-
tence module means a collection of vector spaces and linear maps satisfying condi-
tions given in Definition 21. This in turn implies that a persistence module can be
seen as a functor from T to the category of vector spaces, Vect, over a fixed field k.
Furthermore, in the rest of the thesis, for ease of computation, the field k is chosen as
Z2 .
Let M1 and M2 be two persistence modules indexed by the same totally ordered set
T . We can define a new persistence module by considering their direct sum given as:
1 ⊕M2
(M1 ⊕ M2 )i = ((M1 )i , (M2 )i ) and φM
i,j = (φM M2
i,j , φi,j ).
1
We can further generalize this for a family of persistence modules {Mp | p ∈ J} with
the same index set T by considering their direct sums:
M M
M= Mp where Mi = (Mp )i .
p∈J p∈J
We define the zero persistence module O as the persistence module such that Oi is
the zero vector space for all i ∈ T .
16
Definition 24. [8] An interval module kI is a persistence module {Mi }i∈T satisfying
the conditions:
ii) For k ≤ l, φk,l = idk where k, l ∈ I and φk,l is the zero map otherwise.
Proof. The set of morphisms, {ϕs }s∈T , from kI to itself is the endomorphism ring of
kI . Let us consider ϕs for some s ∈ I. It is a linear map from the field k to itself. So, it
sends x ∈ k to λx for some λ ∈ k. Since all diagrams are commutative, when we fix
ϕs as a multiplication by λ ∈ k, it implies each ϕt is also a multiplication by the same
λ for all t ∈ I. Thus, End(kI ) is isomorphic to k. Assume that kI is decomposable
such that kI ∼= N1 ⊕ N2 . Let π1 be the projection of kI to the first summand of its
decomposition. Clearly, π1 ∈ End(kI ) and π1 ◦ π1 = π1 . Thus, π1 is idempotent in k.
Since idempotents are just 0 and 1 in a field, π1 is the zero or identity map. If π1 = 0,
then it implies N1 = 0, and this gives a contradiction. If π1 = idkI , then it implies
N2 = 0 and this gives a contradiction. Therefore, kI is indecomposable.
Our first aim, while working with 1-parameter persistence modules, is to decompose
them as the sum of interval modules provided by Equation 2.1.
The index set in Theorem 2.4.4 can actually be extended to R with suitable finiteness
conditions (for more details, see [10, Proposition 3]).
i) M is pointwise finite dimensional (p.f.d. for short) if dim(Mi ) < ∞ for all
i ∈ N.
17
ii) M is said to be tame if for all i ∈ N, dim(Mi ) < ∞.
is called q-tame.
Clearly, for a persistence module, being p.f.d. is equivalent to being tame. Further-
more, every p.f.d. persistence module is q-tame, but the converse is not true.
Remark 2.4.1. The relation between a p.f.d. persistence module and a persistence
module of finite type is the following: Every persistence module of finite type is
p.f.d. but the converse is not true.
We stated that finite type persistence modules over k indexed by N decompose into
interval modules. The following result of Botnan and Lesnick [8] generalizes the
result by extending the index set from N to T and persistence modules from finite type
to p.f.d. Thus, [8, Theorem 3.4] shows that 1-parameter p.f.d. persistence modules
are also interval decomposable:
M∼
M
= kI .
I∈B(M )
The multiset B(M ) is one of our meaningful summaries which is called the barcode
of M .
18
The result of Theorem 2.4.5 is also obtained in the work of Dey and Wang [15]. They
stated that the result comes from a more general concept called quiver theory and
gives the result in the following form:
M∼
M
= k⟨bj ,dj ⟩
j∈J
where ⟨bj , dj ⟩ stands for one of the intervals: [bj , dj ], [bj , dj ), (bj , dj ], (bj , dj ).
Let us put together what we did so far. We start with a filtration indexed by N obtained
from the initial data. Then, we apply homology functor with field coefficients to this
filtration. So, we get a persistence module. If it is pointwise finite dimensional, then
we see that it is decomposed into interval modules. Since our indices comes from
N, all interval modules in the decomposition are of the form [i, j) which means a
homology class is born at i-th level, and dies at j-th level. We usually work on this
case, but the theory can be extended to the index set T ⊆ R.
From the beginning, understanding the shape of data is our main aim. This also
covers how we can make a distinction among them, and the answer is that we can
examine the corresponding persistence modules. Another question arises from here:
Is there any complete invariant for 1-parameter setting? First of all, an invariant
means a function from a set of structures, such as graded modules, to some set sending
isomorphic structures to the same value. Moreover, we say an invariant is complete
if it always assigns different values to non-isomorphic structures.
In our case, the structure set is the set of p.f.d. persistence modules and we send
them to their barcodes. With this construction, the barcode is a complete invariant.
Thus, if barcodes of two persistence modules are not equal to each other, then we can
19
Figure 2.2: The barcode and the 0th and 1st persistence diagrams of the topological
space M obtained by the height function, [13].
conclude that they are not isomorphic. Moreover, the computation of the barcode is
independent from the underlying field of a persistence module.
In Theorem 2.4.5, we defined the barcode of a persistence module. The other well-
known meaningful summary is the persistence diagram. Actually, a barcode and a
persistence diagram are very useful to visualize the persistent homology. If we work
on a p.f.d. persistence module indexed by N, then we can define the persistence
diagram of it as follows:
Definition 26. [16] The p-th persistence diagram is a multiset of points in the
extended real plane, R̄2 , and it is denoted as Dgmp (M ) for a persistence module M .
We draw a point at (i, j) if µi,j
p > 0. When the multiplicity increases, we make those
points slightly bigger. If a class is born at Ki and never dies, then we draw a point at
(i, ∞) where the second coordinate can be putted thanks to the extension of R2 . The
index persistence of a class is the vertical distance of the point to the diagonal.
Example 2.4.1. [13] Consider the surface M in R3 given by Figure 2.2. The filtration
function f : M → R is defined by a projection onto the value of z-coordinate. This
is an example of a height function, and we can construct the filtration by choosing
Ma = f −1 (−∞, a] for all a ∈ R.
The vertical red bars correspond to the lifespans of 0-dimensional homology classes.
The first homology class appears when Ma is non-empty for the first time for some
20
a ∈ R, and it never vanishes. On the other hand, after the second homology class
emerges, it merges with the former when the subspace becomes connected, and stops
at that level because of the Elder Rule.
We can find βpi,j from persistence diagram by considering the upper, left quadrant
of the point (i, j). The quadrant is closed along its vertical right side since we can
consider classes die at Kk such that j < k, while the horizontal lower side is not
included since if some class dies at Kj , then it cannot be counted for βpi,j . The number
of points with their multiplicities is equal to βpi,j . Let us see this situation in the
following lemma:
21
diagrams by attaching different tick directions depending on the form of the interval.
For further details, see [12].
If we work on our classical way (i.e. p.f.d. persistence modules indexed by N), then
barcodes and persistence diagrams are equivalent to each other, hence the persistence
diagram is also a complete invariant on them.
We have seen how to obtain a persistence module by starting from a filtration. Then,
we have seen two meaningful summaries to visualize persistent homology: a barcode
and a persistence diagram. Now, what are the other ways to compare two persistence
modules? One of the answers is that we can define a distance between two persistence
modules, and try to draw some conclusions about whether they are isomorphic or not.
Furthermore, we can also classify non-isomorphic ones with respect to the distance.
In this part of the thesis, we discuss two well-known distances: the bottleneck and the
interleaving distances.
We give the definition of the bottleneck distance by following [6]: Let us start with
two p.f.d. persistence modules M and N . Then, according to Theorem 2.4.5, we can
write both modules as sums of interval modules constructing the barcodes C and D
of M and N , respectively. Thanks to the relation between a barcode and a persis-
tence diagram, we can also see elements of the multisets C and D in their persistence
diagrams. Then, we define a matching X between C and D which is a bijection
′ ′ ′ ′
σ : C → D between C ⊆ C and D ⊆ D. Moreover, if we do not see an element
of C or D in the bijection σ, then the element is called unmatched, and we match this
element with its nearest point on the diagonal, {(x, x) | x ∈ R}, of the persistence
diagrams.
′ ′
Define the cost c(I, J) of a matching between I = ⟨a, b⟩ ∈ C and J = ⟨c, d⟩ ∈ D as
22
the cost of a matching X , c(X ), in the following way:
c(X ) := max sup c(I, J), sup c(I) .
(I,J)∈X unmatched I∈C∪D
Remark 2.5.1. To find the bottleneck distance between two persistence modules, their
barcodes must exist, and we satisfy this condition by working on p.f.d. persistence
modules. However, this does not mean that we can find the bottleneck distance only
for p.f.d. ones. For instance, the definition can be extended to q-tame persistence
modules because of Proposition 2.4.8.
The importance of the bottleneck distance comes from the fact that it is stable under
small perturbations of the filtration function or the starting set of points. That is, the
perturbations cause smaller difference in terms of the bottleneck distance. Let us see
the stability of dB in detail:
where j ∈ R and i ∈ N is fixed. Assume that both of the persistence modules are
p.f.d. so that they decompose into interval modules. Therefore, we can pass through
the barcodes of them, namely B(M f ) and B(M g ) and see the stability feature:
23
Theorem 2.5.1. [6] The bottleneck distance satisfies the following:
Let P and Q be two finite set of points in Rn having the same cardinality. Then,
we construct Vietoris-Rips complexes: Rips(P )ϵ , Rips(Q)ϵ with the same ϵ ∈ R
values. After applying homology functor, we obtain two persistence modules. With
the assumption making them pointwise finite dimensional, we have the barcodes:
B(Hi (Rips(P ))), B(Hi (Rips(Q))), where i ∈ N is fixed, and give the stability of
the bottleneck distance between them:
Theorem 2.5.2. [6] For any bijection σ : P → Q such that ||p − σ(p)|| ≤ ϵ for all
p ∈ P , we have
We find the bottleneck distance between two barcodes (or equivalently persistence
diagrams) of persistence modules. Instead, we can directly work on persistence mod-
ules, and look for a different distance structure.
to M (ϵ).
Definition 28. [8, 23] We say that M and N are ϵ-interleaved if there exists mor-
phisms
ϕ : M → N (ϵ) ψ : N → M (ϵ)
24
such that the following diagrams commute for all i ≤ j:
φM
i,i+2ϵ
Mi Mi+2ϵ Mi+ϵ
ϕi ψi+ϵ ψi ϕi+ϵ
φN
i,i+2ϵ
Ni+ϵ Ni Ni+2ϵ
making the triangle diagrams given above and the following parallelogram diagrams
commutative:
φM
i,j φM
i+ϵ,j+ϵ
Mi Mj Mi+ϵ Mj+ϵ
ϕi ϕj ψi ψj
φN
i+ϵ,j+ϵ φN
i,j
Ni+ϵ Nj+ϵ Ni Nj
Indeed, ϵ-interleaved relation measures how far two persistence modules from being
isomorphic. The next proposition is well-known in PH and easy to see.
Proposition 2.6.1. M and N are isomorphic persistence modules if and only if they
are 0-interleaved.
The following proposition is also well-known for which we provide our proof.
Proposition 2.6.2. If two persistence modules M and N are ϵ-interleaved, then they
are also δ-interleaved for all δ > ϵ.
′ ′
Proof. Define two linear maps ϕi : Mi → Ni+δ , ψi : Ni → Mi+δ as the following
compositions:
φM
i+ϵ,i+δ
Mi Mi+ϵ Mi+δ
′
ϕi ψi
ϕi ′
ψi
Ni+ϵ Ni+δ Ni
φN
i+ϵ,i+δ
25
With varying i ∈ R, we have two morphisms ϕ′ : M → N (δ), ψ ′ : N → M (δ).
We need to show triangle diagrams in the Definition 28 commutes. Observe that the
following diagram commutes:
Ni+ϵ Ni+δ
Mi Mi+2δ
′ ′
ϕi ψi+δ
Ni+δ
Mi+ϵ Mi+δ
ψi ϕi+ϵ ϕi+δ
commutes. Thus,
Mi+δ
′ ′
ψi ϕi+δ
Ni Ni+2δ
Definition 29. [15] Let M and N be two R-indexed persistence modules. We define
their interleaving distance dI (M, N ) as
If no such ϵ exists, then define dI (M, N ) = ∞. In this case, we also say they are
∞-interleaved.
26
and N , is 0, then we cannot guarantee that they are 0-interleaved. So, dI (M, N ) = 0
does not imply M and N are isomorphic in all cases.
Example 2.6.1. Let M and N be two persistence modules such that M0 = Z2 , Mi = 0
for all i ∈ R \ {0} and Ni = 0 for all i ∈ R. Then, they are not isomorphic, but
dI (M, N ) = 0. As a second example, consider interval modules k(3,5) and k[3,5) .
Although the interleaving distance between them is 0, they are not isomorphic.
iii) d(x, z) ≤ d(x, y) + d(y, z) for all x, y, z ∈ X where d(x, y), d(y, z) < ∞.
Our last example implies that the interleaving distance is an extended pseudometric on
persistence modules indexed by R. On the other hand, if we work on such modules
indexed by N, and exclude classes born and died at the same time, then dI is an
extended metric.
Throughout this thesis, the word "distance" always means an extended pseudometric.
The following proposition shows how the interleaving distance changes when we take
direct sums of persistence modules.
27
then we have
dI (M, N ) ≤ sup{dI (Mp , Np ) | p ∈ J}.
The interleaving distance and the bottleneck distance have different constructions,
and we want to find a relation between them: Are they equal to each other? If not, can
we say that one of them is always bigger than the other? These are natural questions,
and to answer these questions, we need more.
Proposition 2.6.4. [12] Let I = ⟨a, b⟩, J = ⟨c, d⟩ be two intervals and kI , kJ be the
corresponding interval modules. Then, we have
where the right hand side of the inequality comes from a matching between I and J
in the process to find the bottleneck distance. We can also consider I is unmatched as
another case, i.e., it is matched with its nearest point on the diagonal. Thus, we have
the following analogous equality:
1
dI (kI , O) = c(I) = (b − a)
2
where O is the zero persistence module.
Now, we are ready to give the main result of 1-parameter persistent homology.
Theorem 2.6.5. (The Isometry Theorem) [22, Theorem 3.4] Let M and N be two
p.f.d. persistence modules. Then,
In the literature, the inequality dB (B(M ), B(N )) ≤ dI (M, N ) is called the algebraic
stability theorem, and the converse inequality dI (M, N ) ≤ dB (B(M ), B(N )) is said
to be the converse algebraic stability theorem.
Proof. [12] Let us show that dI (M, N ) ≤ dB (B(M ), B(N )). Since M and N are
pointwise finite dimensional,
M M
M= kIi and N = kJj ,
i∈L1 j∈L2
where Ii ∈ B(M ) and Jj ∈ B(N ). Let X be a matching between the barcodes B(M )
and B(N ) where its cost equals to ϵ. We have three cases:
28
i) If Ii and Jj are matched, then from Theorem 2.6.4, dI (kIi , kJj ) ≤ c(Ii , Jj ) ≤ ϵ.
iii) If Jj is unmatched, then from Theorem 2.6.4, dI (O, kJj ) = c(Jj ) ≤ ϵ. More-
over, for each unmatched interval module, add O to the direct sum decomposi-
tion of M by adding a new index to L1 .
Therefore, index sets in the direct sums become same in terms of cardinality, call it
L. So, after reindexing interval modules with respect to matched ones, we have
M M
M= kIi and N = kJi
i∈L i∈L
where dI (kIi , kJi ) ≤ ϵ for all i ∈ L. Thus, by Theorem 2.6.3, dI (M, N ) ≤ ϵ. Since
the bottleneck distance is the infimum of ϵ values, dI (M, N ) ≤ dB (B(M ), B(N )).
Another proof for the converse algebraic stability theorem is done by Lesnick [22].
The proof of the algebraic stability theorem is given by Chazal, Cohen-Steiner, Glisse,
Guibas and Oudot [12] (also see the work of Bauer and Lesnick [3]).
We have seen that the bottleneck distance is stable, and the interleaving distance is an
extended pseudometric. Since these distances are equal to each other for p.f.d. per-
sistence modules, we conclude that they are both stable and extended pseudometrics.
29
30
CHAPTER 3
Let us see this situation in an example, see Figure 3.1, given by Botnan and Lesnick [8].
As demonstrated, by adding finitely many outliers, the barcode changes a lot. Before
adding outliers, we can easily detect the circular shape of the data from the long bar
by applying the first homology functor. However, after adding them, we cannot see
such a long bar in the barcode, so we cannot determine the circular shape of the data
although it still has that shape. To handle outliers in our data, we need to pass through
multiparameter persistent homology by introducing new parameters such as the den-
sity parameter, that is, a function taking greater value in dense regions of the data than
sparse regions of it. With the help of new parameters, we understand the shape of the
data with higher density, and we can get rid of the complexity coming from outliers.
31
Figure 3.1: (a) Set of points in Rn colorized by a local density estimate. (b) Adding
outliers with low density. (c) The barcode of (a) in 1-dimensional holes. (d) The
barcode of (b) in 1-dimensional holes, [8].
the followings:
i) If i, j ∈ I and i ≤ k ≤ j, then k ∈ I.
L d
b
c
a e
I1 I2 I3
Figure 3.2: Subsets of R2 given in the plane with specified points and the line L.
32
i) I1 is not an interval in R2 since a, b ∈ I1 and a ≤ c ≤ b, but c ∈
/ I1 . We also see
this situation by the restriction of L to I1 . It has two disconnected components
implying our result.
ii) I2 is an interval.
iii) I3 is not an interval because although d, e ∈ I3 , there exists no elements {ki }ni=0
of I3 making second condition satisfied.
As in 1-parameter, intervals are one of our main interests. However, intervals are more
complicated in multiparameter case and to obtain particular results, we sometimes
consider their subsets such as rectangles:
Definition 33. [8] Let Top be the category of topological spaces and P be a poset.
Then, a P -indexed filtration is a functor F : P → Top such that Fx ⊆ Fy for all
x ≤ y ∈ P where Fx := F (x). If we take P = T1 × T2 × . . . × Tn where each Ti
is a totally ordered set for all i ∈ {0, 1, . . . , n}, then we call F a multiparameter or
n-parameter filtration.
.. .. ..
. . .
33
Figure 3.3: Degree-Rips bifiltration where degree parameter is 4, 3, 1 from left to
right, [8].
It’s time to see a few constructions for bifiltrations where number of parameters is 2,
and (X, dX ) is a finite metric space.
Example 3.1.2. [8] Let us give the degree-Rips bilfiltration, DRips(X). We have
two parameters: ϵ is a scale paramater, and d is a degree parameter. DRips(X)d,ϵ is a
maximal subcomplex of Rips(X)ϵ only including vertices have degree at least d − 1
in the 1-skeleton of Rips(X)ϵ . For an illustration, see Figure 3.3.
where a ∈ R is the parameter changing the data set of points with the inverse image
of f , and ϵ ∈ [0, ∞) is the scale parameter of the Vietoris-Rips complex. In addition,
Simp shows the category of simplicial complexes.
34
The crucial part of the above filtrations is the choice of f . For instance, we can choose
f as fr : X → R defined by
where r > 0 is fixed. Moreover, fr is a density function, and in general, the superlevel-
Rips bifiltration obtained from a density function is called a density-Rips bifiltration.
Remark 3.1.1. Choosing a different r > 0 in the construction of fr can change func-
tion values. This dependency on r can lead to deficiencies in the usage of the density-
Rips bifiltration. On the other hand, the degree-Rips bifiltration does not have such
a dependency. In addition, we can define the degree-Rips bifiltration by using fr ,
but this time r is not fixed, and it equals to the scale parameter, ϵ, of Vietoris-Rips
complex.
where |X| is the number of the elements of X. Thus, f gives the average distance
of x to the points of X. It is said to be an eccentricity function, and we call the
superlevel-Rips bifiltration obtained from f as the eccentricity-Rips bifiltration.
Instead of using sublevel or superlevel filtrations, we can take the inverse image of
the function on finite intervals. By this way, we can define a trifiltration. To do that,
start with the set I = {(a, b) | a ≤ b} where a, b ∈ R. So, I can be seen as a poset
with the poset relation induced by Rop × R.
Example 3.1.4. [8] Let f : X → R be a function. The interval-Rips trifiltration
Rips(f )(a,b),ϵ : I × [0, ∞) → Simp is defined as
Example 3.1.5. [23] We have another bifiltration method called the multi-cover bi-
filtration defined as follows: Let X ⊆ Rn be finite, and Cov(X) : Nop × R → Top
be
35
Figure 3.4: Multi-cover bifiltration for the fixed values of k = 2, 3, [23].
Before our last example for a multiparameter filtration, let us define the barycentric
subdivision of a simplicial complex K. Then its barycentric subdivision, denoted
by bary(K) is an abstract simplicial complex whose (k − 1)-simplices are of the
form {σ1 , σ2 , . . . , σk } where each σi is a simplex of K such that σi ⊂ σi+1 for all
i ∈ {1, 2, . . . , k − 1}. Observe that we are taking simplices of K as vertices of
bary(K).
Definition 34. [8] Let Vect be the category of k-vector spaces where morphisms
correspond to linear maps. For (P, ≤) a poset, a P -indexed persistence module is a
functor M : P → Vect. Define Mi := M (i). For i ≤ j, φM
i,j denotes the linear map
36
set for all i ∈ {0, 1, . . . , n}, then we call M a multiparameter or an n-parameter
persistence module. Furthermore, they form a category shown by n-mod where
morphisms are taken as natural transformations.
Example 3.2.1. If we apply homology functor to the filtration obtained in the Example
3.1.1, then we obtain the persistence module M : N × N → Vect with the following
commutative diagram:
.. .. ..
. . .
37
These maps give the structure of a graded module:
M∼
M
= Mi over k[x1 , . . . , xn ].
i∈T
Theorem 3.2.1. [9] The category of persistence modules over k of finite type indexed
by T and the category of finitely generated T -graded modules over k[x1 , . . . , xn ] are
equivalent categories.
The next step is to decide how to decompose our persistence module. We cannot
use the Structure Theorem due to the reason we have mentioned above. On the other
hand, we can still decompose p.f.d. persistence modules:
Theorem 3.2.2. [8, Theorem 4.2] For a poset P and a p.f.d. n-parameter persistence
module M : P → Vect,
M∼
M
= M λ.
λ∈Λ
38
(ii) Suppose that M is separated into indecomposables in two different ways:
M∼ Mλ ∼
M M
= = Mγ.
λ∈Λ γ∈Γ
Remark 3.2.1. The proof of Theorem 3.2.2 has huge background, but Botnan and
Crawley-Boevey proved the first part of the theorem in their article [7] in the recent
past. The second part is coming from the Azumaya–Krull–Remak–Schmidt Theorem,
[1].
In n-parameter case where n ≥ 2, the main problem is that we do not know the
exact list of isomorphism classes of indecomposables, and this makes everything very
complex. In 1-parameter case, those indecomposables are just interval modules and
the set of intervals is called the barcode.
Moreover, the related question is: If we define the barcode on multiparameter p.f.d.
persistence modules as a set of indecomposable components of them obtained by
Theorem 3.2.2 (i), does it fulfill all the properties satisfying by the barcode of a 1-
parameter persistence module?
for all i, j ∈ P .
Observe that every barcode coming from p.f.d. 1-parameter persistence modules is
automatically a good barcode. However, if we start with a p.f.d. n-parameter persis-
tence module M , then there is not always a good barcode of M . Let us see this by
the following example.
Example 3.2.2. [8] Start with the bifiltration shown in Figure 3.5. If we apply
H1 (−; Z2 ), then the corresponding persistence module M can be seen by the follow-
ing diagram:
39
Figure 3.5: Bifiltration indexed by P = {0, 1, 2} × {0, 1, 2}, [8].
id
k k 0
id (1 0)
(1 0)T (1 1)
k k2 k
(0 1)T id
id
0 k k
Clearly, this persistence module is p.f.d. and let us show that it cannot have a good
barcode.
ii) rank (M(0,1) → M(1,2) ) = 1, so there is J ∈ B such that (0, 1), (1, 2) ∈ J.
iii) rank (M(1,0) → M(2,1) ) = 1, so there is K ∈ B such that (1, 0), (2, 1) ∈ K.
rank(M(1,0) → M(1,2) ) = 0.
So, this gives a contradiction. Therefore, M does not have a good barcode.
40
on indecomposables without knowing what they are exactly does not allow us to go
further. Thus, unlike 1-parameter case, the barcode is not a complete invariant among
multiparameter p.f.d. persistence modules, for details see [9].
M∼
M
= kI .
I∈B(M )
Now, let us see an important kind of persistence modules. For P be a poset and
a ∈ P , Qa is a persistence module given by
k if a ≤ x,
a
Qx =
0 otherwise.
idk if a ≤ x,
Qax,y =
0
otherwise.
Example 3.2.3. Consider Q1,0 where all maps between k’s are identity and P = N2 .
41
.. .. ..
. . .
0 k k ...
0 k k ...
0 k k ...
Definition 38. [22] A persistence module F is called free if there exists a multiset A
of elements of P such that
F ∼
M
= Qa .
a∈A
From [23, Proposition 6.40], we can see that F0 and F1 are generated by the set of
generators, and the set of elements generating all relations of M , respectively. Thus,
it is enough to show these sets have finite cardinality to call a persistence module
finitely presented.
42
CHAPTER 4
We have seen that the barcode does not have an extension to p.f.d. multiparameter
persistence modules. Instead, Carlsson and Zomorodian proposed the rank invariant
for multiparameter setting [9].
for every i ≤ j ∈ Rn .
In the literature, the rank invariant is also called persistent Betti numbers, which are
already defined for the 1-paramater case in the Definition 6.
The rank invariant draws the attention because it is easy to compute and equivalent
to the barcode in 1-parameter case implying that the rank invariant is complete in
1-parameter setting. However, its extension to multiparameter case is no more a com-
plete invariant, and this can be seen by the following example:
.. .. .. ..
. . . .
id id2 id id2
(1 0)T id 2 (1 0)T id2
k k2 ... k k2 ...
(0 1)T (1 0)T
id id
0 k ... 0 k ...
M N
43
While M is the free module with two generators at (1, 0) and (0, 1), that is,
M∼
= Q1,0 ⊕ Q0,1 ,
N has another generator at (1, 1), and the homology classes of N at (1, 0) and (0, 1)
generates a single class at (1, 1) as a result of the relation ⟨(1, 0) − (0, 1)⟩.
are 2 and 1, respectively. Therefore, M and N are not isomorphic to each other. Alter-
natively, if we write M and N as a direct sum of indecomposables, then we can see
that there is no bijection giving isomorphisms between corresponding components.
Hence, they are not isomorphic by Theorem 3.2.2. On the other hand, ranks of all
linear maps, φM N
i,j and φi,j for i ≤ j, are equal to each other. This implies that although
M and N are not isomorphic, they have the same rank invariant. Therefore, the rank
invariant is not complete in multiparameter persistent homology.
All in all, prominent candidates, namely the barcode and the rank invariant, are not
complete. Furthermore, there is no known complete invariant for multiparameter
persistent homology. Therefore, the duty of understanding whether two persistence
modules are isomorphic or not belongs to distances. We extend the familiar ones to
n-parameter case, and define another one: the matching distance.
In this subsection, we extend the definition of the interleaving distance to the multipa-
rameter persistent homology. However, we first give the definition of an ϵ-interleaving
n
between two functors F, G : Rn → C where C is an arbitrary category. Let C R de-
note the category of functors from Rn to C.
n n
(−)(⃗ϵ) : C R → C R
44
by
F (⃗ϵ)
F (⃗ϵ)i = Fi+⃗ϵ and φi,j = φFi+⃗ϵ,j+⃗ϵ
Definition 41. [8] For ϵ ∈ [0, ∞), two functors F, G : Rn → C are said to be
ϵ-interleaved if there exist morphisms
ϕ : F → G(⃗ϵ) ψ : F → G(⃗ϵ)
satisfying
φF
i,i+2⃗
ϵ
Fi Fi+2⃗ϵ Fi+⃗ϵ
ϕi ψi+⃗ϵ ψi ϕi+⃗ϵ
φG
i,i+2⃗
ϵ
Gi+⃗ϵ Gi Gi+2⃗ϵ
Definition 42. [8] Let F and G be two functors from Rn to C. We define their
interleaving distance dI (F, G) as
Like in 1-parameter case, two multiparameter persistence modules are said to be iso-
morphic persistence modules if and only if they are 0-interleaved. Moreover, if two
n-parameter persistence modules are ϵ-interleaved, then they are also δ-interleaved
for all δ > ϵ. The proof of it is just the extension of the proof of Proposition 2.6.2.
Recall that, for 1-parameter case, we state that both the bottleneck distance and the
interleaving distance are stable.
45
For multiparameter case, let us first give a general definition of what we mean by a
stable distance: Let X be any topological space with two functions f, g : X → Rn
defined on it. We can obtain persistence modules M f and M g via maps f and g as
homology of sublevel set filtrations S(f ) and S(g) respectively, that is,
where j ∈ Rn and i ∈ N is fixed. In addition, consider Rn with the max norm given
as
||r||∞ = max |ri |
i
Proof. Let sup∥f (x) − g(x)∥∞ = ϵ. Then, we have S(f )j ⊆ S(g)j+⃗ϵ and S(g)j ⊆
x∈X
S(f )j+⃗ϵ. These inclusion maps give an ϵ-interleaving between S(f ) and S(g). Ap-
plying homology functor to these filtrations brings out an ϵ-interleaving between
Hi (S(f )) and Hi (S(g)) which follows from the naturality of the homology functor.
Therefore, M f and M g are ϵ-interleaved. Hence,
46
Definition 44. [22] For i ≥ 0, a pseudometric d on n-mod is called i-universal if
it is i-stable and for any i-stable pseudometric d′ on n-mod, we have d′ (M, N ) ≤
d(M, N ) for all n-parameter persistence modules M and N obtained from i-th ho-
mology of sublevel set filtrations.
Now, suppose we have two n-parameter persistence modules M and N . Recall that to
define the interleaving distance between M and N , these modules are not necessarily
coming from two filtering functions defined on a single topological space. So, if we
want to define stability for a more general setup, that is, for arbitrary persistence mod-
ules, we need a result guaranteeing the existence of such functions and a topological
space. The following theorem solves this problem when the modules are over a prime
field, that is, Q or Zp where p is prime:
Proposition 4.1.2. [22, Proposition 5.8] Let k be a prime field, and M and N be
ϵ-interleaved n-parameter persistence modules whose underlying field is k. Then, for
any i ≥ 1, there exist a CW-complex X and continuous functions f, g : X → Rn
such that
M∼
= Hi (S(f )), N∼
= Hi (S(g)), sup∥f (x) − g(x)∥∞ = ϵ.
x∈X
Therefore, a persistence module over a prime field can be obtained from the i-th
homology of a sublevel set filtration up to isomorphism for all i ≥ 1. This implies
that we can write the statement of Definition 44 for arbitrary persistence modules over
a prime field except for the definiton of 0-universal.
Theorem 4.1.3. [22, Theorem 5.5] For a prime field k and i ≥ 1, dI is i-universal.
47
any i-stable pseudometric d on n-mod where i ≥ 1, then by Definition 43,
Thus, we have
d(M, N ) ≤ ϵ = dI (M, N ).
However, universality of dI over a non-prime field have not showed yet. Proving or
disproving it is an open problem.
Being ϵ-interleaved and having the interleaving distance as ϵ are different. We can-
not guarantee that two persistence modules are ϵ-interleaved when the interleaving
distance between them is ϵ. In fact, for finitely presented ones, they are equivalent:
Theorem 4.1.5. (The Closure Theorem) [22, Theorem 6.1] If M and N are two
finitely presented Rn -indexed persistence modules and dI (M, N ) = ϵ, then M and
N are ϵ-interleaved.
48
4.2 The Bottleneck Distance
M∼ kI and N ∼
M M
= = kJ .
I∈C J∈D
′ ′ ′
We define a matching between C and D as a bijection σ : C → D where C ⊆ C and
′
D ⊆ D. Moreover, if we do not see an element of C or D in the bijection σ, then the
element is unmatched.
Definition 45. [4] With the above construction, we say a matching is an ϵ-matching
if it satisfies the following conditions:
′
i) for all I ∈ C , kI and kσ(I) are ϵ-interleaved.
Observe that if we have an ϵ-matching, then all pairs in the above construction are
ϵ-interleaved with each other. So, they are all δ-interleaved for all δ > ϵ. This implies
that we have a δ-matching as well.
49
Note that this definition of the bottleneck distance for multiparameter persistence
modules is consistent with the 1-parameter case. To see this, let us first suppose that
elements of the barcodes that we consider are of the form [a, b) for some a, b ∈ R.
For an arbitrary persistence module M , we define M ′ so that
dB (M, N ) = dB (M ′ , N ′ ).
This implies that the consistency argument can be proved by working on the barcodes
we have supposed, that is, it does not matter whether we include the endpoints or not.
It is enough to show that the cost of a matching between C and D is smaller than or
equal to ϵ if and only if there exists an ϵ-matching between C and D as in Definition
45. First, start with a matching X between C and D such that c(X ) ≤ ϵ. So, there
′ ′ ′ ′ ′
exists a bijection σ : C → D where C ⊆ C and D ⊆ D. Let I ∈ C and σ(I) = J.
Thus, c(I, J) ≤ ϵ. From the Proposition 2.6.4, we have
Conversely, let us start with an ϵ-matching. Say I = [a, b) and J = [c, d) are matched
with each other. Consider them as two interval modules kI and kJ . Then, they are
ϵ-interleaved. To create the desired matching, we examine some cases. As a first one,
if both intervals does not include any subinterval of the form [t, t + 2ϵ] for an arbitrary
t ∈ R, then we can classify them as unmatched, and obtain a matching with cost at
most ϵ.
50
In any other case, one of the intervals include such a subinterval. We claim that when
they match with each other, we obtain c(I, J) ≤ ϵ. Assume that
Our aim is to get a contradiction by using the ϵ-interleaving between kI and kJ . Let
us suppose that c(I, J) = |c − a| > ϵ and without loss of generality, c > a. So,
c > a + ϵ. Let [t, t + 2ϵ] ⊆ [a, b) for some t ∈ R. Then, we have the following
non-commutative diagram
(kJ )a+ϵ = 0
and this contradicts with the ϵ-interleaving between modules. Thus, c(I, J) ≤ ϵ.
Let [t, t + 2ϵ] ̸⊆ [a, b) for any t ∈ R and [s, s + 2ϵ] ⊆ [c, d) for some s ∈ R. Then, the
following diagram does not commute
(kI )c+ϵ = 0
In addition, say L = [a, b) is matched with the zero module. Then, kL is ϵ-interleaved
b−a
with the zero persistence module. So, 2
≤ ϵ, and this implies that c(L) ≤ ϵ. If
we combine all of this, and write in a single matching X , then c(X ) ≤ ϵ. So, both
constructions for the bottleneck distance in 1-parameter are equivalent to each other.
Moreover, if we have an ϵ-matching between the barcodes C and D, then we can con-
struct an ϵ-interleaving between the corresponding interval decomposable persistence
modules. Therefore, we always have the following statement:
51
To see this, start with an ϵ-matching. Let
M∼ kI and N ∼
M M
= = kJ .
I∈C J∈D
If kI and kJ are matched with each other, then they are ϵ-interleaved with morphisms
ϕI : kI → kJ (ϵ) and ψI : kJ → kI (ϵ). Furthermore, if kL is matched with the zero
module for L ∈ C \C ′ (resp. D\D′ ), then there exist two morphisms ϕL : kL → O and
ψL : O → kL (ϵ) (resp. ϕL : O → kL (ϵ) and ψL : kL → O) giving an ϵ-interleaving.
In this case, we just add the zero module to the direct sum of N (resp. M ). So, this
makes no change in persistence modules up to isomorphism. Also, corresponding C
and D become the same in cardinality after adding new elements with empty support
coming from the zero modules. Then, we can write a bijection β : C → D so that kI
and kJ := kβ(I) are ϵ-interleaved.
where i ∈ Rn . Since each ϕI and ψI are morphisms making all diagrams in Definition
41 commute, ϕ and ψ give an ϵ-interleaving between M and N .
We have seen that in the 1-parameter case, dI = dB via Theorem 2.6.5. So, we
can directly ask that are they also equal to each other in the multiparameter setting?
Actually, we have already showed dI (M, N ) ≤ dB (C, D) by the explanation given in
the previous paragraph. So, it is all about whether the converse is true or not. The
answer is no, and it can be seen by the following counterexample given by Bjerkevik,
[4, Example 5.1]:
52
3
I1
2
J
1
I3
1 2 3
I2
Figure 4.1: Intervals where I1 and I2 are outer light purple squares, I3 is the inner
purple square and J is the red one, [4].
So, C = B(M ) = {I1 , I2 , I3 } and D = B(N ) = {J}. Consider all possible match-
ings between barcodes.
In any case, at least one of I1 or I2 is unmatched. So, they are matched with the zero
module. Without loss of generality, consider the interleavings between kI1 and O.
The morphisms
ϕ : kI1 → O(⃗ϵ) ψ : O → kI1 (⃗ϵ)
are the zero maps where ⃗ϵ = (ϵ, ϵ). Then, we just need to examine triangle diagrams.
Take ϵ = 2, then the internal morphisms of kI1 :
kI1
φp,p+(4,4) : k(I1 )p → k(I1 )p+(4,4)
53
is just the zero map since at least one of k(I1 )p and k(I1 )p+(4,4) is the zero vector space
for I1 = (−3, 1) × (−1, 3). So, we have one of the following cases:
In any case, our diagram commutes. Since we are taking ϕp = 0 = ψp for all p ∈ R2 ,
the other triangle diagram also commutes. Thus, kI1 and O are 2-interleaved.
Claim 1: kI1 and O are not (2 − δ)-interleaved for all δ ∈ R such that 0 < δ ≤ 2.
Thus, p + 2⃗ϵ ∈ I1 ⇒ (kI1 )p+2⃗ϵ = k. So, we have the following triangle diagram:
φp,p+2⃗ϵ=id
k k
ϕp =0 ψp+⃗ϵ=0
0
This diagram does not commute since the horizontal map is the identity map, but
composition of other two maps is just the zero map. This shows that our claim is true.
Thus, kI1 and O are ϵ-interleaved only for all ϵ ≥ 2. The same is true for kI2 . Since
all of four matchings include I1 or I2 as unmatched, they cannot be an ϵ-matching for
any ϵ < 2.
Furthermore, with a similar argument, we can see that kI3 is 1-interleaved with the
zero persistence module.
Now, let us try to find interleavings between kIi and kJ where i ∈ {1, 2, 3}.
In any case, if we take ϕ : kIi → kJ (2) and ψ : kJ → kIi (2) as zero morphisms, then
they are trivial natural transformations. For triangle diagrams, since composition of
54
above morphisms is zero, we just need to show horizontal maps are always zero. This
is true since
φp,p+2(2,2) = φp,p+(4,4) : kKp → kKp+(4,4) = 0
for all K ∈ {I1 , I2 , I3 , J}. Therefore, all triangle diagrams also commutes.
Hence, kIi and kJ are 2-interleaved for all i ∈ {1, 2, 3}. So, all of 4 matchings be-
tween barcodes are 2-matching. Since we showed that they cannot be an ϵ-matching
for ϵ < 2, dB (C, D) = 2.
Claim 3: None of kIi for i ∈ {1, 2, 3} is not (1 − δ)-interleaved with kJ for any
0 < δ ≤ 1.
(kJ )p+(−−−
→
1−δ)
id
k k
0 0
0
and it does not commute. Hence, kI1 is not (1 − δ)-interleaved with kJ for any
0 < δ ≤ 1. The same situation for kI2 and kI3 can be seen in a similar way.
55
If i ∈ I1 and i + (1, 1) ∈ J, then let ϕIi 1 = id where i ∈ R2 . If at least one of i or
i + (1, 1) is not an element of the corresponding interval, then let ϕIi 1 be the zero map.
Similarly, if j ∈ J and j + (1, 1) ∈ I1 , then let ψjI1 = id where j ∈ R2 . If at least one
of j or j + (1, 1) is not an element of the corresponding interval, then let ψjI1 be the
zero map.
I I
ϕi 1 1
ψi+(1,1)
(kJ )i+(1,1)
As a first case, let i, i + (2, 2) ∈ I1 , i + (1, 1) ∈ J. So, our diagonal maps are in the
form: k → k defined as the identity map. So, the corresponding diagram:
id
k k
id id
k
commutes.
id
k k
0 0
0
In addition, we can define morphisms ϕI2 , ψ I2 for kI2 and ϕI3 , ψ I3 for kI3 in a similar
56
way. It can be shown
dI (kI2 , kJ ) = 1 = dI (kI3 , kJ )
showing M and N are 1-interleaved. As a first idea, we can try to define ϕ by using
the following diagram:
kI1 kJ (1)
ϕI1
Mp Mp+(2,2) 0⊕id ⊕0
0⊕k⊕0 0⊕k⊕0
=⇒
ϕp ψp+(1,1)
Np+(1,1) k
However, π I1 = 0 since p ∈
/ I1 ⇒ (kI1 )p = 0. So, ϕ = ϕI1 ◦ π I1 = ϕI1 ◦ 0 = 0. Thus,
the diagram cannot commute. Hence, this way of defining ϕ does not make M and N
1-interleaved.
Now, try another way. In any situation, we have one of the followings:
i) p ∈ I1 ∩ I2 :
ii) p ∈ I1 \ I2 :
So, the function ϕ on the cases i) and ii) is defined in the same way.
57
iii) p ∈ I2 \ I1 :
iv) p ∈
/ I1 ∪ I2 :
Define ϕp = 0.
With the above construction of ϕ : M → N (1) and ψ : N → M (1), all diagrams are
commutative. Thus, M and N are 1-interleaved.
Let us assume there exists such an interleaving. So, we have following morphisms
ϕ : M → N (ϵ) and ψ : N → M (ϵ). Moreover, we can define ϕIi and ψ Ii as functions
commuting following diagrams:
ψ
kIi kJ (kI1 ⊕ kI2 ⊕ kI3 )(ϵ)
ϕIi
f Ii π Ii
ψ Ii
kI1 ⊕ kI2 ⊕ kI3 ϕ
kJ (ϵ) kIi (ϵ)
However, ϕIi and ψ Ii show that kIi and kJ are ϵ-interleaved for some ϵ < 1 by the
following arguments:
ϕIi and ψ Ii are well-defined morphisms. This implies that parallelogram diagrams
are commutative. Moreover,
I Ii
fp i φp,p+2⃗ϵ πp+2⃗
ϵ
(kIi )p Mp Mp+2⃗ϵ (kIi )p+2⃗ϵ
ϕp ψp+⃗ϵ
I I
ϕpi i
ψp+⃗ϵ
(kJ )p+⃗ϵ
58
and
φp−⃗ϵ,p+⃗ϵ φp+⃗ϵ,p+3⃗ϵ
(kIi )p−⃗ϵ (kIi )p+⃗ϵ (kIi )p+3⃗ϵ
commutes, and this contradicts with Claim 3 whose correctness is already shown.
So, we get a contradiction. Thus, M and N are not ϵ-interleaved for ϵ < 1. Thus,
dI (M, N ) = 1. Hence, we obtained
dB (C, D) = 2 ̸= 1 = dI (M, N ).
So, the bottleneck distance and the interleaving distance are not equal to each other
in the multiparameter persistent homology.
Since dI ≤ dB , the bottleneck distance cannot be smaller than the interleaving dis-
tance. So, we can ask that can we bound the bottleneck distance from above by using
the interleaving distance? The answer comes from the following theorem for the case
of rectangle decomposable persistence modules:
Theorem 4.2.1. [4, Theorem 4.3] Let M and N be rectangle decomposable per-
sistence modules indexed by Rn . If M and N are ϵ-interleaved, then there exists a
(2n − 1)ϵ-matching between their barcodes, B(M ) and B(N ). Thus,
3
M
In our example, M = kIi and N = kJ are rectangle decomposable persistence
i=1
modules indexed by R2 , and they satisfy Theorem 4.2.1, but with respect to this ex-
ample, upper bound is not optimal. However, Bjerkevik show this bound is optimal
for n = 2, and give another example satisfying the bound: for details, see [4, Exam-
ple 5.2].
59
Lastly, both of these distances have negative sides in practice. The bottleneck distance
is not stable, so it is unsuitable to use, and as our previous example shows, even if
we work on interval decomposable bipersistence modules, finding the interleaving
distance between them is so hard. Therefore, we are searching for a stable distance
which is easier to compute compared to the interleaving distance.
Computing distances in multiparameter are more difficult than the case of 1-parameter.
As an idea, we can restrict n-parameter persistence modules to 1-parameter ones.
Therefore, we can use our knowledge and methods in 1-parameter such as taking ad-
vantage of easy computability of the bottleneck distance. Informally, we make this
restriction by intersecting n-parameter persistence modules with lines which we will
define in the next definition.
where r = (r1 , . . . , rn ) ∈ Rn .
(ML )i := Mu
and v = jm + b.
Remark 4.3.1. We do not count lines with direction vector having a negative coor-
dinate as admissible lines. The reason is that if we restrict n-parameter persistence
module to one of such a line, points of the intersection of the persistence module and
60
ML
the line are not comparable with each other. Thus, we cannot define linear maps, φi,j
where i ≤ j, of the persistence module.
Definition 48. [20] The matching distance dmatch on rank invariants of q-tame per-
sistence modules M and N is defined as follows:
Recall that if a direction vector of a line L has at least one coordinate as 0, then we
don’t call this line as admissible because mL = 0 for this line. So, when we multi-
ply the bottleneck distance with mL in the process of finding the matching distance,
numbers coming from there are just zero. Thus, they do not have any effect on the
computation.
Moreover, the size of the set of admissible lines causes problems in the computation
of the matching distance. So, we look for a smaller set where the result of dmatch
remains same. We observe that each line in the set of admissible lines is represented
by infinitely many different parametrizations. Hence, it is meaningful to try to pa-
rameterize the set of admissible lines uniquely in terms of the parameters m
⃗ and b,
provided that the computation gives the same number for different parametrizations
of the same line. To achieve this, we first restrict the set of admissible lines by taking
direction vectors m
⃗ as unit vectors with respect to the max norm. That is, we require
⃗ ∈ (0, 1]n because for different direction vectors, we can obtain the same line. As
m
an example, consider the following three lines with direction vectors
m
⃗ 1 = (1/2, . . . , 1/2), m
⃗ 2 = (1, . . . , 1), m
⃗ 3 = (2, . . . , 2)
and take b = 0 in each one. Then, they all give the same line.
61
This situation also happens for b ∈ Rn . For instance, let n = 2 and take three lines
with the same direction vector m
⃗ = (1, 1) and b = (0, 0), (1, 1), (2, 2), respectively.
Then, we have the same line for all of them. In addition, if we take
⃗ ∈ (0, 1]n is
After these restrictions, let Λ be the set of lines L : u = tm + b where m
a unit vector in Rn with respect to the max norm, and ni=1 bi = 0.
P
Pn
/ (0, 1]n and
Proof. Let L : u = tm + b be an admissible line where m ∈ i=1 bi ̸= 0.
Define
m
m′ :=
||m||∞
Observe that m′ ∈ (0, 1]n , ||m′ ||∞ = 1 and the line u = sm′ + b is same as L since
′ s m s
sm + b = .||m||∞ . +b= m + b.
||m||∞ ||m||∞ ||m||∞
So, we obtain direction vector of the line as we want. Now, consider the hyperplane
x1 + . . . + xn = 0 in Rn . Normal vector of this hyperplane is ⃗n = (1, . . . , 1) ∈ Rn
⃗ ′ ̸= 0. Thus, m
and ⃗n · m ⃗ ′ is not parallel to the hyperplane. In addition, dimensions
of the hyperplane (n − 1) and the line (1) are complement of n. This implies that the
hyperplane and the line u = sm′ + b intersect with each other. Therefore, there exists
b′ = (b′1 , . . . , b′n ) ∈ L such that ni=1 b′i = 0. Thus, we rewrite L as u = sm′ + b′ .
P
62
Therefore, L1 : tm + b and L2 : tm + d. Then, let t1 m + b = t2 m + d for some
t1 , t2 ∈ R. So, (t1 − t2 )m = d − b. If we consider each coordinate, we have
(t1 − t2 )mi = di − bi
n
X n
X
⇒ (t1 − t2 )mi = d i − bi
i=1 i=1
n
X n
X n
X
⇒ (t1 − t2 ) mi = di − bi
i=1 i=1 i=1
n
X
⇒ (t1 − t2 ) mi = 0.
i=1
Since we uniquely obtain lines in Λ, we take a line L only once in the computa-
tion of the bottleneck distance. However, we can ask whether the computation of
mL .dB (ML , NL ) is always same for different parametrizations of an admissible line
L. If we change constant b, it creates the same shifting in 1-parameter persistence
modules ML and NL , so the bottleneck distance does not change. Let us see this
situation in practice: Say
are two different parametrizations of the same line L. Assume that when we com-
pute the bottleneck distance between barcodes of ML1 and NL1 , the interval modules
[a1 , c1 ) and [b1 , d1 ) are matched with each other. Consider the corresponding interval
modules of [a1 , c1 ) and [b1 , d1 ) in the decompositions of ML2 and NL2 , respectively:
[a2 , c2 ) and [b2 , d2 ). So, we have
63
Similarly,
d 1 − c1 = d 2 − c2 .
Costs coming from the matchings of pairs are equal to each other:
In a similar way, if [a1 , c1 ) is matched with the zero module, then we obtain the
following relation with the corresponding interval module [a2 , c2 ):
c −a c 2 − a2
1 1
c1 − a1 = c2 − a2 ⇒ c k[a1 ,c1 ) = = = c k[a2 ,c2 ) .
2 2
Hence, costs are again equal to each other. This implies that dB gives the same num-
ber although we change the parametrization of the line with respect to b.
are two different parametrizations of the same line L. If [a, c) and [b, d) are matched
in some matching between ML1 and NL1 , then
ha c h b d
, and ,
k k k k
are matched with each other in the corresponding matching between ML2 and NL2 .
Thus, we have the following relation between costs:
c k[a,c) , k[b,d) = max{|b − a|, |d − c|}
n b−a d−c o
= k. max ,
k k
= k.c k[ ka , kc ) , k[ b , d )
k k
Similarly, if an interval module is matched with the zero module, its cost is divided
by k in the case of L2 . Therefore, the corresponding bottleneck distance is divided by
k, but mL is multiplied by k, so we have the same result for mL .dB . Thus, matching
distance does not be affected from different parametrizations of the same admissible
line.
Remark 4.3.2. As a result of above parts, the matching distance between M and N
can be found by only considering lines from Λ. From now on, L always varies in Λ
when we compute dmatch .
64
L
5
∼
= ⊕
1
1 5
M M1 M2
L
5
∼
= ⊕
1
1 5
N N1 N2
Example 4.3.1. [26] In Figure 4.2, we have two 2-parameter interval decomposable
persistence modules
M∼
= M1 ⊕ M2 and N ∼
= N1 ⊕ N2 .
We claim first that they are not isomorphic. To see this, assume otherwise. Then, by
Theorem 3.2.2 (ii), there is a bijection between index sets {1, 2} giving isomorphisms
between the corresponding modules. Thus, N1 is isomorphic to either M1 or M2 .
These are by definition just the identity maps from the field k to itself. However,
(M1 )(0,1) = 0 → (M2 )(1,1) = k and (M2 )(1,0) = 0 → (M2 )(1,1) = k are zero maps.
Hence, none of the modules M1 and M2 have both maps as identity. So, N1 is not
isomorphic to any of M1 and M2 . This gives a contradiction. Thus, M and N are not
isomorphic.
On the other hand, if we restrict M and N to lines L ∈ Λ, then it can be seen from
Figure 4.2 that corresponding 1-parameter persistence modules ML and NL are equal
to each other. Thus, we have
65
So, we have
dmatch (M, N ) = 0.
Moreover, we can also make an inference about the interleaving distance. To achieve
this, let us first observe that both of M and N are finitely presented. Let 1(i,j) show
the multiplicative identity of k at the grade (i, j) ∈ R2 .
M1 is generated by 1(1,0) , and the relations are generated by 1(1,5) and 1(5,0) . Hence,
taking F1 = Q(1,5) ⊕ Q(5,0) , and F0 = Q(1,0) in Definition 39 makes M1 finitely
presented. In the other ones, we just state the generators of F0 and F1 :
ii) For N1 , 1(1,0) , 1(0,1) generate F0 , and 1(0,5) , 1(5,0) , 1(1,1) generate F1 .
So, all of them are finitely presented. This implies that direct sums M and N are
also finitely presented. Now, let us assume that dI (M, N ) = 0. Then, since they are
also finitely presented, there exists a 0-interleaving between M and N by the Closure
Theorem. However, being 0-interleaved means that these modules are isomorphic to
each other, and we have observed that they are not. Therefore, our assumption is
wrong. Thus, dI (M, N ) > 0 = dmatch (M, N ).
66
In the next step, we want to understand whether the matching distance is stable or
not. We examine this obscurity with the help of the interleaving distance.
ϵ
Lemma 4.3.3. [20] If M and N are ϵ-interleaved, then ML and NL are mL
-
interleaved.
M (u) := Mu
ϕ(u + ⃗ϵ) := ϕu+⃗ϵ,
φM (u, u + 2⃗ϵ) := φM
u,u+2⃗ϵ.
Since M and N are ϵ-interleaved, there exist two morphisms ϕ : M → N (⃗ϵ) and
ψ : N → M (⃗ϵ) such that
and
ϕ(u + ⃗ϵ) ◦ ψ(u) = φN (u, u + 2⃗ϵ)
where u ∈ Rn , so that triangle diagrams commute. Our aim is to show the existence
of two morphisms ϕL : ML → NL mϵL and ψL : NL → ML mϵL such that
ϵ ML ϵ
ψL s + ◦ ϕL (s) = φ s, s + 2
mL mL
and
ϵ NL ϵ
ϕL s + ◦ ψL (s) = φ s, s + 2
mL mL
where s ∈ R.
Consider an arbitrary s ∈ R with points u(s) = sm + b, u′ (s) = s + ϵ
mL
m + b and
u′′ (s) = s + 2 mϵL m + b on L. Consider each coordinate:
67
since mL = min {mi }.
i=1,...,n
ϵ ϵ
Let us define ϕL : ML → NL mL
and ψL : NL → ML mL
by
ϕL (s) = ϕ(u′ − ⃗ϵ) ◦ φM (u, u′ − ⃗ϵ) and ψL (s) = ψ(u′ − ⃗ϵ) ◦ φN (u, u′ − ⃗ϵ)
Since u(s) ≤ u′ (s)−⃗ϵ for all s ∈ R, u ≤ u′ −⃗ϵ. Hence, linear maps φM (u, u′ −⃗ϵ) and
φN (u, u′ − ⃗ϵ) are well-defined. Therefore, ϕL and ψL are well-defined morphisms.
Notice that ψL (s) is defined from NL (s) = N (u) to ML s + mL = M (u′ ). Also,
ϵ
we have φM (u + ⃗ϵ, u′ ) ◦ ψ(u) from N (u) to M (u′ ) and this gives the following
equality:
ψL (s) = φM (u + ⃗ϵ, u′ ) ◦ ψ(u)
M (u + ⃗ϵ) M (u′ )
Consider
ϵ
ψL s + ◦ ϕL (s) = φM (u′ + ⃗ϵ, u′′ ) ◦ ψ(u′ ) ◦ ϕ(u′ − ⃗ϵ) ◦ φM (u, u′ − ⃗ϵ).
mL
Since φM (u′ − ⃗ϵ, u′ + ϵ) = ψ(u′ ) ◦ ϕ(u′ − ⃗ϵ),
ϵ
ψL s + ◦ ϕL (s) = φM (u′ + ⃗ϵ, u′′ ) ◦ φM (u′ − ⃗ϵ, u′ + ϵ) ◦ φM (u, u′ − ⃗ϵ)
mL
= φM (u, u′′ )
ML ϵ
=φ s, s + 2 .
mL
Similarly, ϕL (s) is defined from ML (s) = M (u) to NL s + ϵ
mL
= N (u′ ). So, we
can write ϕL (s) as
ϕL (s) = φN (u + ⃗ϵ, u′ ) ◦ ϕ(u).
Consider
ϵ
ϕL s + ◦ ψL (s) = φN (u′ + ⃗ϵ, u′′ ) ◦ ϕ(u′ ) ◦ ψ(u′ − ⃗ϵ) ◦ φN (u, u′ − ⃗ϵ).
mL
68
Since φN (u′ − ⃗ϵ, u′ + ϵ) = ϕ(u′ ) ◦ ψ(u′ − ⃗ϵ),
ϵ
ϕL s + ◦ ψL (s) = φN (u′ + ⃗ϵ, u′′ ) ◦ φN (u′ − ⃗ϵ, u′ + ϵ) ◦ φN (u, u′ − ⃗ϵ)
mL
= φN (u, u′′ )
ML ϵ
=φ s, s + 2 .
mL
ϵ
Thus, triangle diagrams commute. Therefore, ML and NL are mL
-interleaved.
Theorem 4.3.4. [20] For arbitrary p.f.d. persistence modules M and N , we have
Since M and N are pointwise finite dimensional, so are ML and NL . Thus, from the
Isometry Theorem in 1-parameter, dI (ML , NL ) = dB (B(ML ), B(NL )). So,
So, the matching distance between p.f.d. persistence modules is a lower bound for
a stable distance, namely the interleaving distance. Thus, it satisfies the comparison
in Definition 43 where M f and M g need to be p.f.d. Hence, the matching distance
defined between Rn indexed p.f.d. persistence modules is also stable.
We want to point out that computing the matching distance is not straightforward
since we need to take infinitely many lines into account. On the other hand, if we
know the interleaving distance between two persistence modules, finding the match-
ing distance can be easier in some cases.
Example 4.3.2. Recall Figure 4.1, where we have two persistence modules
69
We have already seen that dI (M, N ) = 1. To calculate the matching distance, we
consider the line L1 : u = t(1, 1) where t ∈ R. Then, mL1 = 1 and
In the last matching, all of the intervals are unmatched. Thus, its cost is
= dmatch(M,N )
≤ dI (M, N ) = 1.
In the above example, note first that the interleaving distance is an upper bound for
the matching distance since we start with p.f.d. persistence modules. Moreover, the
computation of the matching distance on the line L1 realizes this upper bound. Since
the matching distance is calculated as the supremum through all admissible lines,
we can immediately conclude that the matching distance between M and N is 1.
However, this method is impractical because calculating dI is much more difficult.
We may expect to catch the matching distance by considering only lines with slope
1, but the next example shows that, unlike the previous one, it is not sufficient to find
the matching distance.
70
Figure 4.3: 2-parameter persistence modules M and N with lines in R2 , [2].
where
On the other hand, if we consider the lines L : y = x + p where 0 < p < 4, they are
uniquely parametrized as
−p p
L : t(1, 1) + , , t∈R
2 2
so that L ∈ Λ. Let us consider barcodes of restrictions of M and N to such lines.
hp p h p p
B(ML ) = B((M1 )L ) ⊕ B((M2 )L ) = ⊕ 4 − ,7 +
,7 − ,
2 2 2 2
hp p h p p
B(NL ) = B((N1 )L ) ⊕ B((N2 )L ) = , 7 + ⊕ 4 − ,7 − .
2 2 2 2
In all of these lines, mL = 1. So, to analyze effects of these lines on the matching
distance, we only consider the corresponding bottleneck distances. Now, consider all
the matchings:
71
σ1 : (Mi )L is matched with (Ni )L for i = 1, 2. Then, c(σ1 ) = p.
σ2 : (M1 )L is matched with (N2 )L and (M2 )L is matched with (N1 )L . Then,
c(σ2 ) = 4 − p.
Remember that we take infimum of all matchings to find the bottleneck distance. If
p < 2, then minimum of costs of all the matchings is equal to p happening in σ1 . This
implies that dB is smaller than 2 for the case p < 2. If p > 2, then 4 − p coming
from σ2 is the minimum of costs of all matchings. Therefore, dB is also smaller than
2 for the case p > 2. Lastly, we consider all matchings when p = 2 and see that the
corresponding bottleneck distance is 2 which is the biggest one among them.
On the other hand, consider the line L′ passing through (0, 0) and (7, 11). Its equation
is y = 11
7
x. So, the parametrization of it making L′ an element of Λ is
7
L′ : t , 1 , t ∈ R.
11
72
Thus, we have
When the lines are in R2 , we work on the representations coming from Λ instead of
the equations of the form: y = mx + c where m, c ∈ R. The reason is that the first
way can be easily extended to n-parameter case where n ≥ 3.
73
74
REFERENCES
[3] U. Bauer and M. Lesnick. Induced matchings and the algebraic stability of per-
sistence barcodes. Journal of Computational Geometry, 6(2):162–191, 2015.
[5] Ethan D. Bloch. A First Course in Geometric Topology and Differential Geom-
etry. Birkhäuser Boston, MA, 1997.
[11] F. Chazal, D. Cohen-Steiner, M. Glisse, L.J. Guibas, and S.Y. Oudot. Proximity
of persistence modules and their diagrams. In Proceedings of the 25th Annual
Symposium on Computational Geometry, page 237–246, 2009.
75
[12] F. Chazal, V. de Silva, M. Glisse, and S. Oudot. The structure and stability of
persistence modules. Springer International Publishing, 2016.
[15] T. K. Dey and Y. Wang. Computational Topology for Data Analysis. Cambridge
University Press, 2022.
[17] R. Ghrist. Barcodes: The persistent topology of data. Bulletin of the American
Mathematical Society, 45:61–75, 2008.
[19] M. Kerber, M. Lesnick, and S. Oudot. Exact computation of the matching dis-
tance on 2-parameter persistence modules. Journal of Computational Geome-
try, 11(2):4–25, 2020.
[20] C. Landi. The rank invariant stability via interleavings. Springer International
Publishing, 2018.
[23] M. Lesnick. Lecture notes for amat 840: Multiparameter persistence. Univer-
sity at Albany, SUNY, 2023.
76
[25] N. Otter, Porter M.A., Tillmann U., Grindrod P., and Harrington H.A. A
roadmap for the computation of persistent homology. EPJ Data Science, 6(17),
2017.
77