103 April 2004 Solution
103 April 2004 Solution
EXAMINATIONS
April 2004
EXAMINERS REPORT
                                                       Faculty of Actuaries
                                                      Institute of Actuaries
           Subject 103 (Stochastic Modelling)                                       April 2004          Examiners Report
                                                                                       2
              df        =        f        X t dt          X t dBt           1
                                                                            2
                                                                                f (        X t2 )dt
                                           1               1        2                   1
                        =            Xt             1.
                                                    2
                                                                        X t2 dt            X t dBt
                                           Xt             X t2                          Xt
                                      1        2
                        =             2
                                                   dt          dBt
                                                    1      2
              ln X t        ln X t0                 2
                                                                 t t0                 Bt       Bt0
                                                                        1       2
               Xt       x exp             Bt       Bt0                  2
                                                                                     t t0            as required.
                                                                                           1    2
                Let f (t , Bt )            x exp               Bt       Bt0                2
                                                                                                      t t0   . Then
                                                                             2
                             f                 1      2        f               f     2
                                                           f,         f,       2
                                                                                       f , so that
                             t                 2               B             B
                    1   2                                  1 2
    df t                      ft dt         f t dBt            f t dt   f t dt   ft dBt
                    2                                      2
    We need to verify that the initial condition holds for this solution. (It clearly does, but
    the check needs to be performed.)
    Considering that this stochastic differential equation is the most frequently used of all,
    this question was on average very poorly answered.
              For invertibility, we should check that the roots of 1 0.45 z 0.34 z 2 0 are
              outside the unit circle. They are ( 0.45 1.25)/( 2*0.34) = 2.5 or 1.18, both
              OK.
              Subject 103 (Stochastic Modelling)                         April 2004         Examiners Report
                              1 = 0.63 0 + 0.0828 2
                                                       2                               2                  + 1.422 2,
                              0 = 0.63(0.63 0 + 0.0828 ) + 1.370                           = 0.3969   0
implying that
Candidates were often unsure of the procedure required to derive the equations for the k but
did rather better at solving them. In particular, many candidates did not take correct
account of the relationship between Xt and past values of et. Marks were awarded for correct
methodology when deriving the solutions, even if the equations being solved were not the
right ones.
3      (i)       It is a jump process because it remains in one state for a period of time and
                 then jumps to another state, or alternatively because it is a continuous-time
                 process with a discrete state space.
                 Given the entire past history of the process, the probability of a member
                 retiring and beginning to receive benefits in the next dt interval is dt, i.e.
                 independent of the past. The same applies to the probability of death between
                 times t and t + dt: it is dependent only on the state (the number of retired
                 members) at time t, and not on anything which happened before that. Thus the
                 Markov property holds.
                                                                                                                       Page 3
              Subject 103 (Stochastic Modelling)       April 2004   Examiners Report
       (iv)      The suggested form of pn does not depend on t, so its derivative is zero. The
                 RHS is
                                         n                  n 1            n 1
                          (    n )e          (n 1) e                e             0,
                                        n!               (n 1)!         (n 1)!
where = / .
       (v)       The implication is that Poisson( ) is a stationary distribution for the Markov
                 jump process. (In this case it is also a limiting distribution, but that is not
                 deducible from the Core Reading.)
Very few candidates suggested that the rate at which deaths take place should be
proportional to the number of retired scheme members.
In (iv) examiners awarded marks for correct attempts to carry out the verification procedure
even when the differential equation in (iii) was wrong.
In (v), terms such as limiting distribution or equilibrium distribution were given full
credit.
4      (i)       Model fitting involves first choosing a suitable family of models, then a
                 suitable state space and finally estimating the values of the model parameters
                 to isolate a single member of the family.
                 Having performed the model fitting process, we have the best-fitting model
                 from the given family. Model validation involves checking whether this is an
                 adequate explanation of the observations.
                 Simulation can be used in model validation to test that the paths typically
                 followed by a simulated version of the process are broadly similar to the paths
                 observed in practice.
       (ii)      (a)     Under the given model, the successive increments        St form a
                         sequence of i.i.d. N( , 2) random variables, so the parameters can be
                         estimated using the sample mean and sample variance of the observed
                         increments.
                 (b)     The first thing to do is to take the log of the observations, obtaining x1,
                           , xn say. Then apply the same technique as in (a).
               Subject 103 (Stochastic Modelling)     April 2004    Examiners Report
                  (c)    This is a time series model, so a time series technique (such as Box-
                         Jenkins) would be appropriate.
                  (a)    The model implies that the increments are Normally distributed, so a
                         standard test of normality, such as the Anderson-Darling test, the
                         Kolmogorov-Smirnov test or a chi-squared goodness-of-fit test can be
                         employed to test this.
                         A similar test is to inspect the sample ACF and PACF of the residuals
                         to see whether the observed values fall outside the 5% significance
                         band.
                         A turning points test, a run test or a sign change test could also be
                         used, in each case comparing a calculated statistic with tables of a
                         reference distribution. These are tests for serial independence (usually
                         applied to a sequence of residuals).
(b) Apply the same tests as in (a) to the logarithm of the data.
                  (c)    Tests can be applied to the sequence of residuals arising from the
                         fitting process. Since, if the model is accurate, these should form a
                         sequence of uncorrelated Normal observations, the same tests as in (a)
                         can be applied.
                         A procedure which might be applied to the model in part (c) but not to
                         the models in (a) or (b) is to inspect the sample partial ACF of the
                         increment process St; if the process really is a first-order
                         autoregression as stated in the model, then the sample PACF should
                         have no significant values at lags higher than 1.
This question differentiated well between candidates, with some very good answers indicating
that the candidates understood the principles and practice of modelling, and others
highlighting deficiencies of understanding.
                                                                                           Page 5
               Subject 103 (Stochastic Modelling)                April 2004          Examiners Report
5      (i)     (a)    If X = B(½) and Y = B(1), then the marginal distribution of X is N(0,0.5)
and the conditional distribution of Y given X = x is N(x, 0.5).
                                       1
                                           exp( ( y 2 2 yx 2 x 2 ))
                                                                              1
                                                        f X ,Y ( x, b)            exp( (b 2 2bx 2 x 2 ))
                                    f X |Y ( x | b)
                                                           fY (b)                   1
                                                                                        exp ½b 2
                                                                                    2
                                           2
                                               exp     2( x ½b) 2
(ii) (a) Here we seek P(Y(1) > 0 | Y(½) = 5) = P(B(1) > 0 | B(½) = 1)
= P B( 12 ) 1
= P 2 B( 12 ) 0 2
= 1 2 = 0.9213.
                                                                         0 ½
                           Using part (i)(b), this is 1                               (1) 0.8413.
                                                                           ¼
This question was designed in such a way that candidates could tackle part (ii) even if part (i)
had not worked out right, but many allowed themselves to become discouraged by difficulties
in the early part of the question and gave up.
6       (i)       Use a linear congruential generator (LCG). Specify three positive integers
                  a c m with m a m c and an initial value x0 , then generate a sequence of
                   x1 x2    xn in the range {0 1 2              m 1} by the recursive rule
       Subject 103 (Stochastic Modelling)               April 2004      Examiners Report
xn (axn 1 c) (mod m) n 12 N
(iii) (a) We use the inverse transform method. The cdf of the Pareto is
                                      x      a                   1
                           F ( x)                 a 1
                                                          1
                                      0   (1 x)               (1 x) a
so the inverse is
                           F 1 ( y ) (1 y )       1a
                                                         1
(iv)      From (iii), we see that the tail of the Pareto distribution is 1 F ( x) (1 x) a ,
          which decreases to zero much more slowly than e.g. the exponential or normal
          distributions.
          Alternatively, it is called fat-tailed because of the relatively high probability
          of producing values which are a long way from the mean/median.
                                                                                                Page 7
           Subject 103 (Stochastic Modelling)      April 2004      Examiners Report
              In general, the Pareto offers a suitable choice for portfolios where there is a
              non-negligible chance of very large claims, which is reasonable in various
              forms of general insurance, in particular insurance that deals with natural
              catastrophes (floods, earthquakes, hurricanes etc).
This question was generally well answered and caused few difficulties.
7      (i)      (a)         We will use the matrix form of the Kolmogorov DE, which states that
        P '(t ) AP (t ) .
                          d T          T dP     T
       It follows that        P(t )               AP(t ) 0T P(t ) 0T .
                          dt              dt
       [The other Kolmogorov DE, P '(t ) P(t ) A , cannot be used to complete the proof]
                      The fact that the differential of this is equal to zero implies that TP(t)
                      = TP(0) = T for all t, in other words Xt has the same distribution for
                      all t.
(ii)          (a)     The states can be labelled as 0, 1, 2:0, 2:1, 2:2, where 0 and 1 represent
                      the number of operators occupied and 2:j means that both operators are
                      occupied and j calls are on hold.
              (b)
                      1/2               1/2                  1/2               1/2
                0                1                 2:0                 2:1                 2:2
                        1/3               2/3                   2/3              2/3
                                            1/ 2            1/ 2                  0           0              0
                                            1/ 3              5/6               1/ 2          0              0
                                  A          0                2/3                 7/6    1/ 2                0
                                             0                0                 2/3       7/6            1/ 2
                                             0                0                   0       2/3                2/3
(d) We have
                                 1          1                                                        3
                                      0             1                                         1              0
                                 2          3                                                        2
                                 5          1                 2                                          9
                                      1             0                 2:0                     2:0            0
                                 6          2                 3                                          8
                                 7              1                 2                                      27
                                      2:0               1                 2:1                 2:1              0
                                 6              2                 3                                      32
                                 7              1                     2                                   81
                                      2:1               2:0                 2:2               2:2                0
                                 6              2                     3                                  128
Many candidates made a good attempt at part (i), though not many scored full marks. In part
(ii) a surprising number of candidates were unable to distinguish between states in which
the process stays for a certain length of time and events, which occur instantaneously and
trigger transitions from one state to another.
8      (i)       Suppose at time t X has just arrived in state i. The probability that X remains
                 in state i until time t + k and then leaves, giving a duration of k + 1 steps in
                 state i, is piik (1 pii ) . In other words, P ( Di ,n d ) piid 1 (1 pii ) , which is
                 the probability function of a geometric random variable.
                 The fact that Di,n is independent of previous durations follows from the
                 Markov property: What happens to X after arriving in state i is independent of
                 anything that happened before that moment.
                                                                                                                              Page 9
       Subject 103 (Stochastic Modelling)        April 2004     Examiners Report
          An alternative indication that the Markov property is dubious is the fact that
          the restorer appears to return to Bath at regular intervals, i.e. regardless of the
          time spent in each state, the path taken appears to lack randomness.
(iii)      pij   nij / ni , where nij is the number of transitions from state i to state j and
          ni+ is the total number of transitions out of state i. For example, of the 8 days
          spent in Warwick, one is followed by a trip to Caernarvon, one by a trip to
          Bath and the remaining 6 are followed by another day in Warwick, so that nWC
          = 1, nWB = 1 and nWW = 6. We therefore have (using the order B, C, S, W)
                          9     2 1
                             0
                         12    12 12
                          2 13
                                0 0
                         15 15
                    P
                             1 8 1
                          0
                            10 10 10
                          1 1      6
                                0
                          8 8      8
(iv)      The model is irreducible. Starting from Bath it is possible to visit Stratford,
          Warwick, Caernarvon and return to Bath, completing the circuit.
It is also aperiodic, since pii > 0 for some (in fact for all) state i.
(v)       We need the entry of P3 corresponding to the row and column of Warwick
          (4th row, 4th column in this case).
           1       1       0       6
             .1417   .0111   .1550   .5729 = 0.4488.
           8       8       8       8
          Each part of this question attracted some unexpected answers as well as some good
          ones. The answers to part (i) were on the whole disappointing, but the final parts were
          rather better answered in general.
           Subject 103 (Stochastic Modelling)                                                April 2004                 Examiners Report
              Noticing that Cov(Dt, Rt) =  Var(Rt), we suggest minus the ratio of the
              sample covariance of D with R divided by the sample variance of the R, i.e.
                                                                                                                                           Page 11
              Subject 103 (Stochastic Modelling)                            April 2004                Examiners Report
                         120
                               (rt     r )(dt          d)
                         t 1
                               120
                                                            , although in fact a better estimate would be obtained
                                                   2
                                      (rt     r)
                                t 1
                 if account was taken of the variable mean of the Rt.
                                                                                                           2
                 (Less appropriate is using the fact that Var( Dt )                                            Var( Rt ) to suggest
                          2
                         sD / sR2 , since the sign of                     is lost in this operation.)
       (v)        X121     X120             R121 D121 . In addition, R121                             1        ( R120         0)      e121 and
                  D121                R121 .
Most candidates made encouraging progress with this question, though part (iv) proved
difficult for most. In part (v), as well, the relationships between the variables were not
always well understood.
                                                        [u c (t s )]                Nt                    [u c (t s ) N t ]    s[ e     1]
       (iii)     (a)       E[Yt        s    | Ft ] e                   E[e               s   | Ft ] e                                        .
This is equal to Yt if c [1 e ] .
                        If, on the other hand, c > , then there is no positive crossing point; the
                        line y = c goes slowly to      as          , whereas [1 e ] tends to
                             exponentially fast as        .
(c) Yes; this is the condition for the company to remain solvent.
                                                  K       u       K
                                      E[YT ] e        e       e
                 (d)    Clearly,              K               K
                                                                      .
                                        1 e            1 e
There was an error in the final part of the question; where the question asked for a lower
bound, the quantity which could be derived from the previous part of the question was in fact
an upper bound. Candidates who reached the last part and were confused by the error were
treated generously.
It appeared that many candidates tried this question when they were short on time. Those
candidates who attempted part (iv) often did quite well on it, even if they had omitted earlier
parts of the question.
Page 13