An Econometric Study of Vine Copulas: Dominique Guegan, Pierre-André Maugis
An Econometric Study of Vine Copulas: Dominique Guegan, Pierre-André Maugis
2010.40
Maison des Sciences Économiques, 106-112 boulevard de L'Hôpital, 75647 Paris Cedex 13
http://ces.univ-paris1.fr/cesdp/CES-docs.htm
ISSN : 1955-611X
An Econometric Study Of Vine Copulas
Abstract
1 Introduction
For almost ten years now, copulas have been used in econometrics and finance.
They became an essential tool for pricing complex products, managing portfolios
and evaluating risks in banks and insurance companies. For instance, they can
be used to compute V aR (Value at Risk) and ES (Expected shortfall), Artzner
et al. (1997). Moreover, copulas appear to be a very flexible tool, allowing
for semi-parametric estimation, fast parameter optimisation and time varying
parameters. These advantages make them a very interesting tool, although one
major shortcoming is their use in high dimension. Indeed, elliptical copulas can
be expended to higher dimension, but they are unable to model for financial tail
dependences (Patton, 2009), and the Archimedean copulas are not satisfactory
∗ email: dominique.guegan@univ-paris1.fr.
† e-mail: pierre-andre.maugis@malix.univ-paris1.fr.
In this section, we introduce a new algorithm to build vine copulas. Our ap-
proach has the advantage of being able to coherently describe a large set of vine
copulas – N = n! n−3
2 ∏i=1
i! in dimension n – while also being a simple recursive
algorithm. Moreover the tree structure and the algorithm are fully recursive so
they can be easily expanded to any dimension.
2.1 Formula
Our objective is to compute c1,...,n , the copula density associated with the vector
X. This will be done by factorizing fX in the following form:
fX = ∏ fi ⋅ c1,...,n .
i=1,...n
At each step of the algorithm we associate a tree construction. The root of this
tree is the new copula density term: cα,β∣−(α,β) in expression (1), and the leaves
are the new (n − 1)-variates densities: f−α and f−β in (1). The tree associated
with expression (1) is:
cα,β∣−(α,β)
f−α f−β
This tree structure is also fully recursive. To each term f−α and f−β we could
apply (1), and produce trees. These trees would then be inserted inside the
previous tree replacing the two leaves f−α and f−β by the two new tree roots.
We explain this mechanism further in an example.
2.2 Example
• First step:
f1,2,3 .f1,2,4
f1,2,3,4 = .c3,4∣1,2 (2)
f1,2
c3,4∣1,2
f1,2,3 f1,2,4
1 This is the number of "vine" type graph with n nodes, which is also the number of vine
copulas (see Bedford and Cooke (2002, 2001)). The proof of the formula relies heavily on the
graph structure of vines.
2 Our algorithm can produce more varied decompositions, however we do not consider those
additional copulas as they are not efficient estimators, Bedford and Cooke (2002, 2001).
f1,2 f1,3
and
f1,2 .f2,4
f1,2,4 = .c1,4∣2 (4)
f2
c1,4∣2
f2,4 f1,2
• Third step: We merge formulas (2),(3) and (4), we simplify the f1,2 term
and we expand the bivariate densities using the formula: fα,β = fα .fβ .cα,β .
We also merge the trees and underline the simplified term in the tree.
f1,2,3,4 = f1 .f2 .f3 .f4 .c1,3 .c1,2 .c2,4 .c2,3∣1 .c1,4∣2 .c3,4∣1,2 (5)
c3,4∣1,2
c2,3∣1 c1,4∣2
f1 f2 f1 f3 f2 f4 f1 f2
We have now factorised the density f1,2,3,4 into a product of four univariate
densities and six bivariate copula densities. By construction, this means that
the copula density of (X1 , X2 , X3 , X4 ) can be factorised as follows:
c1,2,3,4 = c1,3 .c1,2 .c2,4 .c2,3∣1 .c1,4∣2 .c3,4∣1,2 .
The unwinding of this algorithm can yield other vine copulas if other parameters
are used. For instance the following formula is another possible vine copula.
c1,2,3,4 = c2,3 .c3,4 .c1,4 .c2,4∣3 .c1,3∣4 .c1,2∣3,4
And its associated tree:
c1,2∣3,4
c1,3∣4 c2,4∣3
f1 f4 f3 f4 f2 f3 f3 f4
̂ θ̂T ) = H −1 ⋅ C ⋅ H −1 ′ .
Cov(
2
∂2L
H = (Hi,j )i,j with Hi,j = ∣
∂θi ∂θj θ̂
and C = (Ci )i with Ci = ( ∂θ
∂L
i
) ∣ .
T θ̂T
Now we focus on the heterogeneity within the set of vine copulas in terms of
asymptotic variance.
In estimation theory when faced with the choice between two estimators we
privilege the estimator with the smallest variance. Indeed such an estimator will
require a smaller sample size T to obtain significant results. Thus we compare
the variance of two different vine copulas’ estimators.
Theorem 2 Under assumptions A1 , A2 and A3 . Let θ̂T and θ̂T′ be the esti-
mator associated with two different vine copulas, then:
The consequence of this theorem is that there is no evidence indicating that one
should favor one type of vine copula over another. This result has important
practical consequences that are discussed in Guégan and Maugis (2010).
The previous results confirm that we have a good estimators for vine copulas in
terms of rate of convergence and that all vine copulas have comparable variance,
thus their use is justified for applications. We now provide such an example.
Given the five main assets composing the CAC40, the French leading index,
using the methodology described above, we estimate their joint density in order
to compute the V aR5 of a portfolio composed of the five assets.
The dataset is taken from Datastream, daily quotes from Total, BNP - Paribas,
Sanofi - Synthelabo, GDF-Suez and France Telecom from 25/4/08 to 21/11/08.
This period is marked by the 2008 crisis, and our purpose is to test the resilience
of our model to this shock and change of regimes. We estimate the parameters
and select the vine copula based on the data ranging from 25/4/08 to 9/12/08,
while the V aR is computed from the remaining dates. The portfolio we consider
is as follows: Total (33%), BNP - Paribas (20%), Sanofi - Synthelabo (20%),
GDF-Suez (14%) and France Telecom (13%).
For each dataset a GARCH(p, q) process is selected using the AIC criterion
(Akaike, 1974) and estimated using pseudo likelihood. On the residuals we esti-
mated the vine copula parameters using maximum likelihood. The parametric
copula families used in this exercise are chosen among a panel of copulas which
take into account most features commonly found in financial time series (Patton,
2009): they are the Clayton, Gumbel, Student and Gaussian copulas.
We select the best vine copula using the methodology described in Guégan
and Maugis (2010) with the second test described in Chen et al. (2004). We
recall that this test associates to each estimated density a χ2 sample. In this
application, the retained vine copula is6 :
In our example the vine copula-V aR has a p-value of 0.96 for the Q statistic, so
it is accepted as a true V aR, while the GARCH-V aR has a p-value of 0.00 and
is rejected according to this test7 . Nevertheless, the vine copula approach fails
to predict the major drop during the crisis, but the prediction remains solid
before and after the crisis. These results make the vine copula methodology we
described an interesting approach for risk management in order to estimate the
multivariate (n > 2) density of a portfolio and to compute its associated V aR.
44
42
40
Price
38
36
34
32
0 5 10 15 20 25 30 35 40 45 50
Time
This paper focuses on the building of vines copulas using a tree-based algorithm.
We provide the asymptotic normality of the vine copula parameter estimate
under regular conditions. We show that, in the case of two competing vine
copulas, no vine copula is better than another one in terms of variance criteria.
This work provides solid statistical ground for the ideas developed in Aas et al.
(2009) and Berg and Aas (2010) and justifies the methodologies used in Czado
et al. (2009), Fischer et al. (2007), Chollete et al. (2008) and Guégan and Maugis
(2010). Moreover, proving that no vine copula or sub-family of vine copula
yields better estimator than others, justifies the use of all N vine copulas which
– as shown in Guégan and Maugis (2010) – enhances vine copulas capacity to
represent more varied distributions. Finally, an application shows vine copulas
usefulness to estimate V aR and provides new and interesting risk management
strategies for managers working with high dimensional portfolios.
Our results opens the possibility for varied uses of vines copulas. Most inter-
estingly, the use of conditional copulas as described in Patton (2006), permits
to relax the conditional independence hypothesis common throughout the vine
copulas literature. This allows for very varied and interesting applications in all
fields of economics and risk management.
6 Acknowledgment
This work has already been presented at the MIT Econometric Seminar (Cam-
bridge, USA, 2009) and at the International Symposium in Computational Eco-
nomics and Finance (Sousse, Tunisia, 2010). We thank the participants of the
seminars and our reviewers for their helpful comments. All remaining errors are
still ours. P.A. Maugis thank Arun Chandrasekhar, Victor Chernozhukov and
Miriam Sofronia for their helpful comments.
References
Aas, K., Czado, C., Frigessi, A., Bakken, H., 2009. Pair-copula constructions of
multiple dependence. Insurance: Mathematics and Economics 44, 182–198.
Akaike, H., 1974. A new look at the statistical model identification. IEEE Trans.
Automatic Control AC-19, 716–723.
Artzner, P., Delboen, F., Eber, J., Heath, D., 1997. Thinking coherency. Risk
10, 68–71.
10
Berg, D., Aas, K., 2010. Models for construction of multivariate dependence.
Forthcoming in The Europen Journal of Finance.
Chen, X., Fan, Y., Patton, A., 2004. Simple Tests for Models of Dependence
Between Multiple Financial Time Series, with Applications to U.S. Equity
Returns and Exchange Rates. London Economics Financial Markets Group
Working Paper 483, london, UK.
Chollete, L., Andrz̆as, H., Valdesogo, A., 2008. Modeling international financial
returns with a multivariate regime switching copula. Journal of Financial
Econometrics 7, 437–480.
Cooke, R. (Ed.), 1997. Markov and entropy properties of tree- and vine-
dependent variables. American Statistical Association Section on Bayesian
Statistical Science, Alexandria, VA.
Czado, C., Gartner, F., Min, A., 2009. Analysis of Australian electricity loads us-
ing joint Bayesian inference of D-Vines with autoregressive margins. Zentrum
Mathematik Technische Universitat Munchen: Working Paper Munchen, Ger-
many.
Fischer, M., Köck, C., Schluẗer, S., Weigert, F., 2007. Multivariate copula mod-
els at work: Outperforming the desert island copula? Tech. Rep. 79, Univer-
sität Erlangen-Nur̈nberg, Lehrstuhl fur̈ Statistik und Ok̈onometrie.
Guégan, D., Maugis, P. A., 2010. Prospects On Vines. Forthcoming in Insurance
Markets and Companies: Analyses and Actuarial Computations.
Joe, H., 1997. Multivariate Models and Dependence Concepts. Chapman & Hall,
London, UK.
Kupiec, P., 1995. Techniques for verifying the accuracy of risk measurement
models. Board of Governors of the Federal Reserve System (U.S.), Washington
24 (95).
Napoles, O. M. (Ed.), 2007. Number Of Vines. 1st Vine Copula Workshop,
Delft, Netherland.
11
7 Annex
cs = ci,j∣k1 ,...,kp (F (Xi ∣(Xl )l=k1 ,...,kp ), F (Xj ∣(Xl )l=k1 ,...,kp )).
For instance {2, 3∣1} is the index of c2,3∣1 (F (x2 ∣x1 ), F (x3 ∣x1 )).
12
We now construct the likelihood functions used to estimate the parameters. For
M ∈ Mn we define the likelihood functions {ψsM T }s∈M as:
Then:
θ0M = 0
[θM (i) ]i∈⟦1,∣M ∣⟧
13
To prove the convergence we make some assumptions that are verified for a vast
majority of parametric copula families:
These assumptions are weaker than assumptions A1 and A2 and are implied by
A1 and A2 .
We introduce the exponent 0 to specify the function are evaluated at θ0M , and
denote:
ϕ0i,j∣k = ϕ(Fi∣k
M
(Xi ∣Xk ; {θ0M }i,k,{i,k} ), Fj∣k
M
(Xj ∣Xk ; {θ0M }j,k,{j,k} ); ξ(Xk ; θ0M i,j∣k )).
Also:
ϕi M,0 = ϕM (Xi ; θi0 ),M ϕi,j 0t = ϕM (F M (Xi ), F M (Xj ); θi0 , θj0 , θi,j
0
),
7.1.3 Operators
Indeed if s ∈ M ∖ m(M (i)) then ψsM T is not dependent in θM (i) so that the
gradient is null.
We introduce M
GradT as the gradient of LM :
M
GradT = ∇LM = (∇θM (i) LM )i∈⟦1,∣M ∣⟧
⎛ ⎞
= ∇θM (i) ∑ ψsM T
⎝ s∈m(M (i)) ⎠
i∈⟦1,∣M ∣⟧
14
⎛ ⎞
= ∇θM (i) ,θM (j) ∑ ψsM T
⎝ s∈m(M (i))∩m(M (j)) ⎠
i,j∈⟦1,∣M ∣⟧2
√ M
We denote M
CT0 the covariance matrix of T ⋅ GradT (θ0 ):
M M M
CT0 = (ET (( ∑ ∇θM (i) logϕ0s ) ( ∑ ∇θM (j) logϕ0s )))
s∈M s∈M i,j∈⟦1,∣M ∣⟧2
⎛ ⎛ M M ⎞⎞
= ET ∑ ∇θM (i) logϕ0s ∇θM (j) logϕ0s
⎝ ⎝s∈m(M (i))∩m(M (j)) ⎠⎠
i,j∈⟦1,∣M ∣⟧2
We now √ prove that the rate of convergence for the maximum likelihood estimate
θ̂TM is T for all models M :
Using the central limit theorem and the uniform bounds on ψ’s derivatives we
obtain the convergence in distribution (we can use these bounds because θ0 is
interior to Θ). The proof of Theorem 1 is complete.
15
m ∶ M → P(M )
c2,3∣1
c1,2 c1,3
f1 f2 f1 f3
Then the map m and the indexes in M (.) defined above are:
7.2.2 Proof
The key point in this analysis is the following: as soon as the ordering of the
model M is done according to the tree structure, and this structure is the same
for all models, then:
16
Definition
• ≃ is defined by:
∀A, B ∈ E 2 (R, ∣∣.∣∣) A ≃ B ⇔ ∣∣A − B∣∣ ≤ ⇔ ∃ν ∈ E, A = B + ν, ∣∣ν∣∣ ≤ .
(r) , ψM ′ (r) are equal if the same parametric pair copula is used throughout
M M
ψM
the estimation, which is the case in most applications (Aas et al. (2009), Berg
and Aas (2010) and Czado et al. (2009)).
Assumption A3
∃M ∈ Mn , s.t < ∣∣M Hess0T ∣∣/∣∣(ki,j )i,j ∣∣
Using the previous assumptions and the definition made above we can write
M
Hess0T as follows:
M ⎛ 0⎞
Hess0T = ∇θM (i) ,θM (j) ∑ ψsM T
⎝ s∈m(M (i))∩m(M (j)) ⎠
i,j∈⟦1,∣M ∣⟧2
⎛ 0⎞
= ∇θM (i) ,θM (j) ∑ ψsM T
⎝ s∈Ki,j ⎠
i,j∈⟦1,∣M ∣⟧2
⎛ ′ ⎞
= ∇θM ′ (i) ,θM ′ (j) ∑ M ψs 0T + (νi,j ki,j )i,j∈⟦1,∣M ∣⟧2
⎝ s∈Ki,j ⎠
i,j∈⟦1,∣M ∣⟧2
′
≃⋅∣∣(ki,j )i,j ∣∣ M Hess0T .
17
⎛ ⎛ M M ⎞⎞
M
CT0 = ET ∑ ∇θM (i) logϕ0s ∇θM (j) logϕ0s
⎝ ⎝s∈m(M (i))∩m(M (j)) ⎠⎠
i,j∈⟦1,∣M ∣⟧2
⎛ ⎛ M M ⎞⎞
= ET ∑ ∇θM (i) logϕ0s ∇θM (j) logϕ0s
⎝ ⎝s∈Ki,j ⎠⎠
i,j∈⟦1,∣M ∣⟧2
⎛ ⎛ ⎞⎞
≃⋅∣∣(ki,j )i,j ∣∣ ET ∑ ∇θM ′ (i) logϕ0s M ′ ∇θM ′ (j) logϕ0s M ′
⎝ ⎝s∈Ki,j ⎠⎠
i,j∈⟦1,∣M ∣⟧2
M′
≃⋅∣∣(ki,j )i,j ∣∣ CT0
′
′
̂M
Cov = (H + ν1 K)−1 (G + ν2 K)(H + ν1 K) −1
T
′
= (I + ν1 H −1 K)−1 H −1 (G + ν2 K)H −1 (I + ν1 KH −1 ) −1
∞ ∞
= (I + ∑(−1)i (ν1 H −1 K)i )(V + ν2 V0 )(I + ∑(−1)i (ν1 H −1 K)i )′
i=1 i=1
= (I + Σν1 )(V + ν2 V0 )(I + Σν1 )′
= V + ν2 V0 + 2Σν1 (V + ν2 V0 ) + Σν1 2 (V + ν2 V0 )
< α⋅
18