Tutorial IR

download Tutorial IR

of 30

Transcript of Tutorial IR

  • 7/25/2019 Tutorial IR

    1/30

    An accompaniment to a course on interest rate

    modeling: with discussion of Black-76, Vasicek

    and HJM models and a gentle introduction to the

    multivariate LIBOR Market Model

    Vasily Nekrasov

    http://www.yetanotherquant.de

    July 10, 2013

    Abstract

    The goal of this paper is to help the motivated students with theircourse on interest rate modeling and/or to help them to learn the LIBORmodel by themselves. It implies the reader knows what forward rates,caps, and swap[tion]s are and has some knowledge of the quantitative fi-nance: at least Ito Calculus, Black-Scholes-[Merton] Formula, GirsanovsTheorem (in one dimension is enough) and the risk-neutral pricing. Thisstuff is usually taught in the first course on continuous financial modelingand is relatively easy. The interest rate modeling is much more compli-cated. Still those, who carefully read the wonderful Steven Shreves bookcan learn the short-rate models, change of numeraire and Heath-Jarrow-Morton framework. But not the multivariate LIBOR Model (though thereis a short section on the one factor LIBOR Model and its relation to theHJM). However, the Bond/IR market is essentially multivariate and theLIBOR Model can be introduced independently. But I could not find anytutorial, which would suit me. So I decided to write my own. It concerns

    theory only and not the calibration and computational aspects, which arethe issues for the future papers.

    1 Typical problems of a quant student

    If you read this paper you are probably attending an [advanced] course oninterest-rate modeling. If so, you likely know some short rate models and maybethe HJM framework. Though formally not necessary for studying of the LIBORModel, it is still very helpful to be aware of this stuff. Thats why if you are not, I

    Many thanks to Dmitri Semenchenko, Dr. Frank Wittemann, Dr. Michael Busch, Dr.Michael Meyer, Dr. Thomas Mazzoni.

    1

  • 7/25/2019 Tutorial IR

    2/30

    do recommend to read the respective chapters of Shreves book first. Historicaldevelopment was Black-76 Short rate models HJM LIBOR MarketModel (LIBOR with jumps and stochastic volatility). So starting directlyfrom the LIBOR would mean ignoring the historical roots. So we discuss theearlier models too trying to enlighten some principle ideas, which are frequentlymisunderstood by students.

    1.1 The Black formula (Black-76)

    The Black-Scholes-[Merton] formula was a breakthrough and still remains themain formula in quantitative finance. Black-76 is its modification for the com-modity(!) market though it is far more frequently used to price the interest ratederivatives. Surprisingly, this formula often lacks attention in the lectures. Forexample, if the lecture notes are based on Brigo and Mercurio, it is introducedwithout derivation (and even motivation) and if they are based on Shreve, itmight be ignored at all since Shreve derives it from LIBOR Model, which comesat the very end of the paragraph on the term structure models. We, in turn, willlearn it thoroughly since the motivation behind the LIBOR is its consistencewith Black-76 [for caplets].

    In 1976 Fischer Black wrote his seminal Pricing of commodity contracts.His assumptions were that the short rates are constant, the futures prices are

    lognormally distributed with known variance rate 2 and the expected changein futures prices is zero. Once again, the extension was intended for commoditymarket: Black, himself, did write about farmers and harvests but did not writeanything about options on bonds, caps and swaptions.

    But the market readily adopted his approach ignoring that e.g. bond pricesand forward rates cannot be simultaneously lognormal, let alone that the as-sumption of the constant short rate can do for the grain market but is clearlyabsurd for the bond market. But in the 70th there were few models and if onehas only a hammer at hand, everything around seems to be the nails. So themarket felt fine using the Black-76 long before it was rigorously justified withLIBOR Model. Finally, as Black himself wrote in his another famous paper1:

    In the end, a theory is accepted not because it is confirmed by conventionalempirical tests, but because researches persuade one another that the theory iscorrect and relevant.

    Black used the delta hedging to derive his formula (recall that by his timethe risk neutral pricing was not yet discovered, so the delta hedging was an onlytool to price options from no arbitrage assumption). He also appealed to theCAPM (Capital Asset Pricing Model) to motivate his assumptions empirically,stating that if the covariance of the change in the futures price with the returnon the market portfolio is zero, then the expected change in the futures price will

    1Fischer Black, Noise. The Journal of Finance, Volume 41, Issue 3 July 1986

    2

  • 7/25/2019 Tutorial IR

    3/30

    be zero and then recalling an empirical study2 confirming that this is [nearly]the case for the wheat, corn and soybean futures.Now let us quickly discuss what forward and futures contracts are.

    Definition 1.1. A forward on an assetX is a contract at time t to sellX atfuture timeT(maturity date) at the fixed priceK (strike).

    Example 1.2. Which strike makes the present value of the forward equal tozero at time t if the interest rate r is constant? (We denote such strike withF orX(t, T) and call it the fair price).

    At time t you have no money. The price of the zero T-Bond is P(t, T) =er(Tt). BorrowX(t) [at rate r] and buy one unit ofX. At time T you have

    to pay back X(t)P(t,T)

    and can sell X for X(T) whereX(T) is the spot price ofX

    at time T. So from no arbitrage

    F orX(t, T)

    =

    X(t)

    P(t, T)

    = er(Tt)X(t) (1)

    Thein1holds even if the interest rate is not constant, all we need is a tradableT-Bond at time t. But for thea constant interest rate is needed.Forwards are the OTC contracts. Note that a forward, unlike an option, is

    obligatory for both counterparties.

    There is a credit risk since one side can go bankrupt before T. Moreover,the contract must specify in details the quality and quantity ofX by delivery.Futures on X solve both problems. Futures are traded on a [commodity] ex-change and are standardized by the exchange. One can loosely consider thefutures as forwards, whose fair prices are reset every day. Denote F utX(t, T)the futures price on Xat time t with deliveryT. ClearlyF utX(T, T) =X(T).In order to enter the futures contract one does not need to pay F utX(t, T),indeed it costs nothing to enter the futures contract (fair price). One still paysthe safety margin but to the exchange, not to the counterparty. This is in noway the price of the contract but just a collateral to eliminate the default risk.If, for exampleX(t1, T) = 100,X(t2, T) = 110 and one went long in the futures

    contract at time t1, his gain at time t2 is 10, which is immedeately paid bythe counterparty. If the counterparty cannot pay, the exchange pays from thesafety margin and the contract gets closed. On maturity dateT one pays theX(t1, T) = 100 and receives a commodity. The sum of daily payments generatedby this futures contract is

    (X(t2, T)X(t1, T))+(X(t3, T)X(t2, T)) + . . .+ (X(ti, T)X(ti1, T)) + . . .

    + (X(T, T) X(tn1, T) = X(T, T) X(t1, T) = X(T) X(t1, T) (2)2Dusak K., 1973 Futures trading and investor returns: An investigation of commodity

    market risk premiums, Journal of Political Economy 81, Nov./Dec., 1387-1406

    3

  • 7/25/2019 Tutorial IR

    4/30

    Stop for a while and make sure you do understand the idea of a futures contract(check the Hulls book if you dont). Otherwise you will not understand 5.Note that normally there is a daily accrued interest on the account to(from)which X(ti, T) X(ti1, T) flows on the i-th day, so2 is nota profit(loss) of afutures contract unless the interest rate is equal to zero.

    Now we turn back to the Black-76 formula. For simplicity denote F(t) =F utX(t, T) (we can do so since T is arbitrary but fixed). Under the Blacksassumptions(lognormality and expected zero change of the futures prices) wecan write the dynamics ofF(t) as

    dF(t) =F(t)dW(t) (3)

    At time t we consider a call option with a strike Kand the maturity T on thefutures contract with delivery at time S, t < T < S. For the price of this calloption we writeC(t, F(t), r , T , S , K ) =C(t, F) sincer,T,Sand Kare arbitrarybut fixed. By Ito Formula we obtain

    dC(t, F(t)) =Ctdt+ CFdF(t) +1

    2CFFd

    2F(t)

    =Ctdt+ CFF(t)dW(t) +1

    2CFF

    2F2(t)dt (4)

    We can eliminate the random factor CFF(t)dW(t) from 4 if we go short inCF futures contract at time t. Following Black, we denote the initial value ofthe futures contract withu(t). It follows that u(t) = 0 since it costs nothing toenter the contract with futures priceF(t). But obviously as time goes tot + dt,a futures contract generates a cashflow F(t + dt) F(t), i.e.

    du(t) =dF(t) =F(t)dW(t) (5)

    So a portfolio C(t, F) u(t)CF, i.e. with a call long and CF futures short isriskless (no dW factor) and evolves according to

    d (C(t, F) u(t)CF) =Ctdt+12

    CFF2F2(t)dt= r (C(t, F) u(t)CF) (6)

    The right-hand part is due to no-arbitrage: every riskless portfolio must growaccording to d =rdt. But u(t) = 0 and finally we get

    Ctdt+1

    2CFF

    2F2(t)dt= rC(t, F) (7)

    The solution of this equation is the famous Black-76 Formula

    C(t, F(t), r , T , S , K ) =F(t)er(Tt)N(d1) Ker(Tt)N(d2) (8)where

    d1= 1

    T

    ln

    F(t)

    K

    +

    1

    22T

    4

  • 7/25/2019 Tutorial IR

    5/30

  • 7/25/2019 Tutorial IR

    6/30

    1.2 Short-rate Models and the martingale measure

    Short rate is an instantaneous time dependent interest ratert so that the bankaccount (with initial investment of 1e) grows according to

    B(t) = exp

    t

    0

    r(s)ds

    (11)

    Respectively, by the risk neutral pricing, the price of a T-Bond at time T is

    P(t, T) = EQ

    BtBT

    Ft

    = EQ

    exp

    Tt

    r(s)ds

    Ft

    (12)

    whereQ is the martingale measure. rt can be (and really is) stochastic but attimet we know how our bank account will change in an infinitesimal timespan.On the other hand the respective change of a T-Bond price is unpredictablesince it depends not only on rt but also on future short rates up to T. We canrewrite 11 as dB(t) = rBdt, so B(t) is differentiable and thus less randomthan a T-Bond. We can define a discounted T-Bond by

    Z(t, T) =D(t)P(t, T) =P(t, T)/B(t) (13)

    The dynamics ofZ(t, T) must be a martingale under Q. But the dynamics ofrt need not since bonds are tradable and rt is not.The first short rate model is due to Vasicek (1977)4

    dr(t) = (a brt)dt+ dWt (14)

    In modern books the model is often introduced directly under the risk neu-tral measure Q (so that in14actually stays dWQt ) and a student with a stockprice modeling background gets puzzled: in stock market we observed a stock(a tradable asset) under the real measure than killed the drift to make it mar-tingale. As you [should] remember, there is no arbitrage iff there is a martingalemeasure (so that the discounted prices of all tradable assets are martingales).

    And the market is complete iff the martingale measure is unique. But how toproceed with rt, which is untradable and thus need not to be a martingale?!Moreover, it is unobservable even under the real measure5). Finally, does Qexist at all and if it does, is it unique?!

    We will follow the original Vasiceks paper (which is far more often cited than

    4Indeed, Vasicek started with much more general settings and considered 14 just as aspecial case. Mertons Model (1973) was actually published earlier but still Vasicek is somehowconsidered as the first, well, at least in the most of textbooks.

    5Garatek et al (p. xiii) say that the overnight rate is a good proxy for the short rate butFilipovic (p.10) argues it is not because the motives and needs driving overnight borrowersare very different from those of borrowers who want money for a month or more.

    6

  • 7/25/2019 Tutorial IR

    7/30

    read)6 and see how Vasicek have introduced the market price of risk. There isno martingale measures in his paper since the risk neutral approach was intro-duced several years later by Harrision, Kreps and Pliska. However, as soon aswe have the market price of risk t we can introduce the risk neutral measureby Girsanovs theorem and turn back to more modern risk neutral approach.

    In a short-rate model the bond price depends on t, rt and T so we writeP(t, T) = f(t, rt, T) 7, whereas T is arbitrary but fixed. By Ito formula thedynamics of the bond price is

    df=f(t)dt + f(r)drt +1

    2f(rr)d

    2rt(14)

    = (f(t) + [a brt]f(r) +2

    2 f(rr))dt + f(r)dWt

    (15)which we can rewrite as

    df(t, rt, T) = f(t, rt, T) [(t, rt, T)dt + (t, rt, T)dWt] (16)

    where

    (t, rt, T) =f(t)+ [a brt]f(r)+

    2

    2f(rr)

    f(t, rt, T)

    (t, rt, T) = f(r)f(t, rt, T)

    The formula 16 resembles us the old good lognormal dynamics of a stockprice, where (t, rt, T) and (t, rt, T) are respectively the drift and the volatil-ity coefficients. (A reader, who studied the multivariate stock market modelmight anticipate that we are going to talk about the market price of risk[(t, rt, T) rt]/(t, rt, T))

    Now consider an investor who at time t issues an amount U1 of a bond withmaturity date T1 and simultaneously buys an amount U2 of a bond maturingat time T2. The total value of this portfolio is U = U2U1 and it evolvesaccording to

    dU= [U2(t, rt, T2)U1(t, rt, T1)]dt + [U2(t, rt, T2)U1(t, rt, T1)]dWt (17)

    We can eliminate the stochastic risk factor in17 choosing

    U1= U (t, rt, T2)

    (t, rt, T1) (t, rt, T2) U2= U+ U1= U (t, rt, T1)

    (t, rt, T1) (t, rt, T2)Such portfolio is riskless (there is only the drift term [U2(t, rt, T2)U1(t, rt, T1)]dt) and thus by no arbitrage assumption (cp. with 6)

    U[(t, rt, T2)(t, rt, T1) (t, rt, T1)(t, rt, T2)]

    (t, rt, T1) (t, rt, T2) dt= rtU dt (18)6http://www.wilmott.com/messageview.cfm?catid=4&threadid=859487This holds only ifrt is a Markov process (in our case it is), otherwise P(t, T) may addi-

    tionally depend on rs for all s [0, t]

    7

  • 7/25/2019 Tutorial IR

    8/30

    or equivalently(t, rt, T2) rt

    (t, rt, T2) =

    (t, rt, T1) rt(t, rt, T1)

    =:t (19)

    wheret can be interpreted as the market price of risk. Note that it must bethe same for all bonds and thus does note depend on bond maturity (but ofcourse may depend on t and rt). But what risk do we actually mean? Well, aswe stated before, the bank account is less random than a bond, so we expectan excess return on a bond. The longer is the maturity of a bond the larger isits volatily(t, rt, T) and respectively its expected excess return (t, rt, T) rtbut their ratio does not varies with T.Writing19 as

    (t, rt, T) rt = t(t, rt, T) (20)and substituting for (t, rt, T) and(t, rt, T) from16 we get the term structureequation

    f(t)+ [a t brt]f(r)+ 2

    2f(rr) rtf= 0 (21)

    with the terminal condition

    f(T, rT, T) = 1

    The term structure equation has a solution

    8

    so the model at least does not con-tradict the no arbitrage assumption which we engaged before. Respectively, amartingale measure exists.

    On the other hand the Girsanovs theorem tells us that dWQt = tdt+ dWtfor some processtand it only remains to show that indeed t = t . Rewriting14under Q we obtain

    dr(t) = (a brt t)dt + dWQt (22)Writing as before f =f(t, rt, T) for the price of a T-Bond at time t we obtainby the Ito product rule d[Dtf] = rtDtf+ Dtdf so

    d[Dtf] = rtDtf dt + Dt f(t)+ f(r)(a brt t) +

    2 f(rr) dt + f(r)DtdWQt(23)

    Since the discounted price is a martingale the drift (dt-term) must be zero.Comparing23 with21 we conclude thatt = t and assignt:= a t

    So we showed why and how we can model directly under the risk neutral mea-sure. However, the model does not imply any restrictions from which followsthatt must be unique. Within the model is just required that t must be thesame for all bonds but we can arbitrarily choose it. For different t we will getdifferent bond prices, all of them arbitrage free. But having only model at hand

    8see Shreves book how to use the Ansatz f(t, rt, T) = exp(rtC(t, T)A(t, T))

    8

  • 7/25/2019 Tutorial IR

    9/30

    we cannot choose the right t. Thus the model as such is not complete.What you need to understand now is that the market players, not a model,define the martingale measure. And if the market players have the same infor-mation and the same risk preference the market will be complete. Of courseit is not so in practice but we likeassuming it because if we do, the pricing ofevery bounded contingent claim is, in principle, possible. So we assume thatthe market is complete and moreover, that the market price of risk is constant,i.e. t . Next we realize that bonds were traded long before the quantita-tive models appeared :). Since the term structure equation gives us the [closedform] solution for the bond price, we can calibrate the model and choose theparameters (,b,) to fit the bond prices (and usually some liquid derivativeslike caps and swaptions) as good as possible. Then we can use the model toprice any other derivatives. Note that we can find but not directly, sincethe dynamics of the short rate is unobservable. This is the difference with e.g.the Black-Scholes model, in which we can [at least theoretically] find the driftand thus the market price of risk r

    .

    Note that the assumption of the constant parameters is too restrictive, so theVasicek model generally cannot perfectly fit the current term structure. It isthe difference between the equilibrium and the arbitrage free models. The for-mer usually assume some equilibrium in demand and supply(mean reversion isa kind of such equilibrium) and generate the term structure as an output. The

    latter can be perfectly matched to the current term structure (but not necessar-ily to the current caps and swaptions prices). Vasicek, CIR are the equilibriummodels and Ho Lee, Hull White, Black Karasinski, G2 are the arbitrage freemodels. Such fitting is achieved via a time dependent parameter, e.g. in HullWhite Model is time dependent:

    dr(t) = ((t) brt)dt+ dWQt (24)Recall that in the Vasicek Model is constant.

    1.3 Some aspects of Heath-Jarrow-Morton

    The [one factor] HJM Model is well discussed in Shreves book and it would

    have made little sense to blueprint it here. Instead, we discuss some aspectswhich may help you to go on with Shreve.

    1. Motivation. HJM models not the short rate but rather the whole forwardcurve. The model is complete and the drift under the risk neutral measure turnsout to be fully determined by the volatility. See Shreve for more details.

    2. Instantaneous forward rate. If the interest is continiously compound, theforward ratef(t , T , T + ) is defined from

    P(t, T) = ef(t,T,T+)P(t, T+ ) f(t , T , T + ) = 1

    ln

    P(t, T)

    P(t, T+ )

    9

  • 7/25/2019 Tutorial IR

    10/30

    Letting 0 we obtain

    lim0

    ln(P(t, T)) ln(P(t, T+ ))

    = ln(P(t, T))T

    =:f(t, T) (25)

    SettingT =t we get the instantaneous short rate R(t) =f(t, t). Note that theinstantaneous forward rate is unobservable too.

    3. It turns out that the affine short rate models, i.e. those for which P(t, rt, T) =exp(rtC(t, T)A(t, T)) can be embedded in HJM via the drift term condition.See Shreve for more details.

    4. In Shreves book one confronts the following expression

    d

    Tt

    f(t, s)ds

    = f(t, t)dt

    Tt

    df(t, s)ds

    If you need to check it but already forgot (as I did) the rule how to differentiatesuch integrals, you may proceed as follows

    dt

    Tt

    f(t, s)ds

    =

    dt

    t

    T

    f(t, s)ds

    = lim

    x0

    t+xT

    f(t+ x, s)ds t

    T

    f(t, s)ds

    x

    = limx0

    t+xT

    f(t + x, s)ds tT

    f(t+ x, s)ds

    +

    tT

    f(t + x, s)ds tT

    f(t, s)ds

    x

    = limx0

    f(t + x, t) +

    tT

    limx0

    [f(t + x, s)ds f(t, s)ds]x

    =f(t, t) +

    tT

    dt[f(t, s)ds]

    Here we have implicitely assumed that f(t, T) is sufficiently good, so that wecan exchange limit and integral.

    5. It is impossible to make the instantaneous forward rate lognormal sincein this case it explodes. See Shreve for more details. However, the simply com-pounded forward rates may be made lognormal. Shreve introduces one factorLIBOR model via HJM and I recommend to have a look on it.

    1.4 Change to T-Forward measure, Radon Nykodim pro-cess

    The short-rate models are usually a piece of cake (as long as you understandthat a market, not a model defines a martingale measure Q, and if we calibratea model w.r.t. the current term structure and a set of sufficiently liquid in-struments - usually Caps or Swaptions - we get the short rate dynamics under

    10

  • 7/25/2019 Tutorial IR

    11/30

    Q). The HJM is usually managable too, at least Shreve makes it so. But it isthe change of numeraire / T-Forward measures what makes troubles. Often aT-Forward measureQT is introduced as

    dQT := 1

    P(0, T)BTdQ (26)

    which is actually

    dQT := P(T, T)

    P(0, T)BTdQ (27)

    It is easy to prove that it is indeed a probability measure. (Can youdo it? If not,dont worry, we will do it later). dQT/dQ is a Radon-Nikodym derivative fromt = 0 and withT in mind. But we need a Radon-Nikodym (a.k.a. likelihood)process for al l t [0, T]. It is

    t:= P(t, T)

    P(0, T)Bt(28)

    but unfortenately it often appears just like rabbit out of the hat. Soon we willconsider its derivation in detail but so far just note the term P(t, T) in thenumerator. So you can guess for what I state that 26 is actually27.As soon as we have t we can find at time t the price (XT) of a contingentclaimXTusing the T-Forward measure since

    EQT [XT|Ft] = 1tEQ[XTT|Ft] = B(t)P(0, T)

    P(t, T) EQ

    XT

    P(T, T)

    B(T)P(0, T)

    Ft

    = 1

    P(t, T)EQ

    XT

    B(t)

    B(T)

    Ft

    = 1

    P(t, T)(XT) (29)

    UnderQ there is a quotient ofXT and BT whereas under QT we have onlyXT. So the formula29 is useful when XT and BTare dependent, which is thecase for the interest rate derivatives. However, this formula is just an elegantbut useless theoretical construction until we specify the dynamics ofX(t) underQT. To do this we need to understand the change of numeraire for which, in

    turn, we need the

    2 Multivariate Girsanovs Theorem and Choleskydecomposition

    2.1 Preliminaries

    Before we state the multivariate Girsanovs theorem, we will explain what t isin the discrete case (on a binary tree) and then prove two useful lemmas for tin continuous time.So let us recall the change from the objective measure P to the martingale

    11

  • 7/25/2019 Tutorial IR

    12/30

    measureQ on a binary tree. Qis not a T-Forward measure (Qis called the spotmeasure) but for a discussion of the Radon-Nykodim process this is irrelevant.

    (A):1

    (B): 1q1p

    (C):q

    p

    (D): (1q)2

    (1p)2

    (E): q(1q)p(1p)

    (F): q2

    p2

    (G): q3

    p3

    (H):q2(1q)

    p2(1p)

    (I): q(1q)2

    p(1p)2

    (J): (1q)3

    (1p)3

    (1

    p)

    (1

    q)

    q

    p

    I noted before that the T-Forward measure is defined from t= 0 with T inmind. In our case T corresponds to the last four nodes: (G), (H), (I) and (J).ThoughQ is not a T-Forward measure, the Radon Nykodim derivative (in thesense of the classical Real Analysis) is still defined from t = 0 and with T inmind. (Or in our case it is better to say from (A) with{(G), (H), (I), (J)} inmind). Namely, for a measurable space (, F) and two absolutely continuousmeasuresQ and Pon (, F) (i.e which agree about the measure-zero sets of )there is a unique9 functionf, called the Radon Nykodim derivative, such that

    Q(A) = A

    f()dP()

    = EP[f()1A] = EP[f()1A|F0] A F (30)The in30 is valid ifPis a probability measure (in our case it is by def-

    inition). The question is in which case is Q a probability measure as well?An obvious necessary and sufficient condition is the nonnegativity of f and

    f(w)dP() = 1 must hold true. Indeed, Q is a measure (i.e. Q() = 0 and

    Q is -additive) by the Radon Nykodim theorem. And sinceQ() = 1 it is aprobability measure10.

    9Upto a set of measure zero w.r.t. P10Now I am sure you can prove that QT defined in26 is a probability measure.

    12

  • 7/25/2019 Tutorial IR

    13/30

    In our case the Radon Nykodim derivative is defined according to Table 1:

    Table 1: Radon Nykodim derivative from (A) to{(G), (H), (I), (J)} in mind (G) (H) (I) (J)

    f() q3

    p3q2(1q)p2(1p)

    q(1q)2

    p(1p)2(1q)3

    (1p)3

    Because the measures are discrete, an integral in 30 becomes just a sum and

    Q() =

    f()dP() = q3

    p3p3 + q

    2

    (1 q)p2(1 p) 3p2(1 p)

    +q(1 q)2

    p(1 p)2 3p(1 p)2 +

    (1 q)3(1 p)3 (1 p)

    3 = 1

    Considering in 30 more generally, we want to express the expectationsw.r.t. Q as the expectations w.r.t. P. For any random variable X F it isactually

    EQ[X] =

    XdQ() =

    Xf()dP() = EP[f()X] (31)

    But we are interested not only in thesingleRadon Nykodim derivative. Ratherwe want to define the Radon Nykodim process t so that{(G), (H), (I), (J)}inmind stays but now we are looking from any node, not only from (A). It is justlike replacing P(T, T) and BT in27with P(t, T) and Bt in28. We can definethe Radon Nykodim derivative e.g. from (C) to{(G), (H), (I), (J)} in mind(Table2). Note that it is impossible to move in (J) from (C), so the respective

    Table 2: Radon Nykodim derivative from (C) to{(G), (H), (I), (J)} in mind (G) (H) (I) (J)

    f() q2

    p2q(1q)p(1p)

    (1q)2

    (1p)2 0

    probability is set to zero.

    However, such approach is indeed not what we need. What do we need toprice derivatives? Just to take expectation w.r.t. the martingale measure andsometimes express this expectation via expectation w.r.t. the real and T-forwardmeasures. Now recall31! So we do not need the Radon Nykodim process asa function, we only need it in the sense of expectation, however, conditionalexpectation, given a -algebraFt.So we define the Radon Nykodim process as

    t = EP

    Q

    P

    Ft

    = EP

    f()Ft {(G), (H), (I), (J)} (32)

    13

  • 7/25/2019 Tutorial IR

    14/30

    A conditional expectation t is, informally speaking, a random variable uptotimet but int is takes a concrete value and is no more random. Lets calculatethet fort [1,.., 4], i.e. from start to end of our binary tree

    t= 0 0= EP

    f()F0= EP[f()] = 1

    At time t = 0 the conditional expectation is equal to the unconditional expec-tation and the Randon Nykodim derivative is equal to 1 as we calculated above.Keep this point in mind, in the next section we introduce t more formally incontinuous time and will in particular start with a nonnegative random variable

    Z, s.t. EP[Z] = 1

    t= 2 2=

    p

    q3

    p3+ (1 p) q

    2(1 q)p2(1 p)

    1(F)+

    p

    q2(1 q)p2(1 p)+ (1 p)

    q(1 q)2p(1 p)2

    1(E)

    +

    p

    q(1 q)2p(1 p)2 + (1 p)

    (1 q)3(1 p)3

    1(D)=

    q2

    p21(F)+

    q(1 q)p(1 p) 1(E)+

    (1 q)2(1 p)2 1(D)

    Here 1(F), 1(E), 1(D) respectively mean the event that at time t = 2 we land inthe node (F), (E), (D). In advance we do not know which note it will be, butas time goes to t = 2 we will know it. If, e.g., we are in (D) by the timet= 2

    then2= (1q)2

    (1p)2

    t= 3 3= q3

    p3 1(G)+q2(1

    q)

    p2(1 p) 1(H)+q(1

    q)2

    p(1 p)2 1(I)+(1

    q)3

    (1 p)3 1(J)

    For t = 1 you calculate n1 as an exercise. A property of the nested conditionalexpectations EP[EP[3|F2]|F1] = EP[3|F1] will make the computation easiersince we already calculated 2= EP[3|F2].

    Now let us express the price of a contingent claim X at time t = 1 via theRadon Nykodim process (we write1(X) for this price). For [notation] simplic-ity assume the interest rate is zero. Also assume w.l.o.g. that at time t= 1 weare in node (C) Then

    1(X) = EQ[X3|F1] =q2X(G)+ 2q(1 q)X(H)+ (1 q)2X(I)+ 0X(J)

    =p2q2

    p2X(G)+ 2p(1 p) q(1 q)p(1 p) X(H)+ (1 p)

    2 (1 q)2(1 p)2 X(I)

    =p

    q

    p2

    q3

    p3X(G)+ 2p(1 p) q

    2(1 q)p2(1 p) X(H)+ (1 p)

    2 q(1 q)2p(1 p)2 X(I)

    =p

    qEP[3X3|F1] = 1

    1EP[3X3|F1] (33)

    Analogously assume that at time t = 1 we are in node (B) and show that 33holds in this case too.

    14

  • 7/25/2019 Tutorial IR

    15/30

    In general it holds

    EQ[XT|Fs] = 1s

    EP[TXT|Fs] (34)

    We made a big (though a bit boring) job to understand the idea of the RadonNykodim process by means of concrete examples. Right now we are going torigorously discuss this idea in continuous time.Consider a usual filtered probability space (, F, (F)t, P). Let Z be an a.s.positive random variable, whose value becomes known at time T, satisfyingEP[Z] = 1. Define a new probability measureQ as (cp. 30)

    Q(A) = EP [Z1A] dQdP

    =Z (35)

    Define a Radon Nykodim Process11

    Z(t) = EP[Z|F(t)], 0 t T (36)Z(t) is a martingale since by iterated conditioning

    EP[Z(t)|F(s)] = EP[EP[Z|F(t)]F(s)] = EP[Z|F(s)] =Z(s) (37)

    Now, following Shreve, we prove two lemmas in which we present further prop-erties ofZ(t)

    Lemma 2.1. Let t satisfying0 t Tbe given and let Y be anF(t)-measurablerandom variable. ThenEQ[Y] = EP[Y Z(t)]

    Proof.

    EQ[Y] = EP[Y Z] = EP[EP[Y Z|Ft]]= EP[YEP[Z|Ft]] = EP[Y Z(t)]

    where in we took Y out of the conditional expectation since Y isF(t)-measurable and thus known at t.

    Lemma 2.2. Let s and t satisfying 0 s t T be given and let Y be anF(t)-measurable random variable. Then (cp. 33)

    EQ[Y|F(s)] = 1Z(s)

    EP[Y Z(t)|F(s)]

    Proof. According to the definition of the conditional expectation, 1Z(s)

    EP[Y Z(t)|F(s)]must beFs-measurable (which is obvious sinceFs Ft) and the partial-averaging property must hold, i.e.

    A

    1

    Z(s)EP[Y Z(t)|Fs]dQ !=

    A

    Y dQ A Fs

    11Here I abuse with notation a little bit writing Z(t) instead of t in order to keep theformulae consistent with Shreves book.

    15

  • 7/25/2019 Tutorial IR

    16/30

    With some algebra, where in () we take in 1A which is known, since A FsA

    1

    Z(s)EP[Y Z(t)|Fs]dQ def= EQ

    1A

    1

    Z(s)EP[Y Z(t)|Fs]

    2.1

    =

    EP

    Z(s)1A

    1

    Z(s)EP[Y Z(t)|Fs]

    ()= EP[EP[1AY Z(t)|Fs]]2.1= EQ[1AY] def=

    A

    Y dQ

    2.2 Girsanovs Theorem

    Exercise 2.3. Take your Financial Math lecture notes or Shreves book and gothrough the proof of the Girsanovs theorem in one dimension.

    Theorem 2.4. (Girsanov, multiple dimension) Let T be an arbitrary but fixedpositive time, (t) = [1(t),..,d(t)] be a d-dimensional adapted process andWP(t) = [WP1 (t),..,W

    Pd (t)]be a d-dimensional uncorrelated

    12 Brownian Motionunder a measureP. Define

    Z(t) = exp

    t

    0

    (u)dWP(u) 12

    t

    0

    (u)2du

    (38)

    WQ(t) =WP(t) +

    t0

    (u)du

    and13 assume that

    EP

    T

    0

    (u)2Z2(u)du <

    SetZ=Z(T). ThenEP[Z] = 1 and under the probability measureQ given by

    Q(A) = A

    Z()dP() A F (39)

    the processWQ(t)is a d-dimensional Brownian motion. Moreover, if theWP(t)is the only source of uncertainty (i.e. it generates the filtration of our probabilityspace) then the converse theorem holds too, i.e. all absolutely continuous changesof measure are in form of39.

    12In a sense, the theorem holds for correlated processes too, however, there are some nuances,see below.13This is Novikov condition, which guarantees that the Radon Nykodim process Z(t) does

    not explode.

    16

  • 7/25/2019 Tutorial IR

    17/30

    Some remarks on notation:

    WQ(t) = [WQ1 (t),..,WQd (t)] W

    Qj (t) =W

    Pj (t) +

    t0

    j(u)du j = 1,..,d

    (u) = d

    j=1

    2j (u)

    t

    0

    (u)dW(u) =

    t

    0

    dj=1

    j(u)dWj(u) =

    dj=1

    t

    0

    j(u)dWj(u)

    Note thatdZ(t) =(t)Z(t)dWP(t) (40)

    and (t) is sometimes called the Girsanov kernel. If you have thoroughly donethe exercise2.3the proof of the multidimensional case shall be nothing new butsome technical details, as declared in many textbooks (Shreve p. 225, Bjorkp.165) and so it may be omitted. However, if the components of WP(t) arecorrelated, the things are not so easy: we demonstrate it by means of examplewithd= 2, = [1, 2] and dW

    P1 (t)dW

    P2 (t) =dt. In this case we have

    dZ(t) =Z(t)[1

    dWP

    1 (t) +

    2dWP

    2 (t)

    1

    2(2

    1+ 2

    2)dt]

    +1

    2Z(t)[21d

    2WP1 (t) + 22d

    2WP2 (t) + 212dWP1 (t)dW

    P2 (t)]

    =Z(t)[1dWP

    1 (t) + 2dWP

    2 (t) + 12dt] (41)

    and the drift part 12dt= 0 thus Z(t) is not a martingale. If we modify38so thatZ(t) = exp(

    t0

    (u)dWP(s) 12

    t0

    [WP]t) (it is called the Doleans-Dadeexponential, [W]t stands for the quadratic variation of W(t)) then Z(t) is a

    martingale but now one should be careful withWQj (t) =WPj (t) +

    t0

    j(u)du. . .However, we can always switch from a correlated Wiener Process to the uncor-related one by means of

    2.3 Cholesky decompositionLetW(t) = [W1(t),..,Wd(t)] be a d-dimensional correlated Wiener process. Weknow that a multivariate normal random variable is completely characterized by(, ), i.e. by its covariance matrix and the vector of expectations. It is also thecase for the multivariate Wiener process, whereas its vector of drifts is zero andso the covariance matrix (t) is only what matters14. (t) may be time depen-

    14By the Ito Isometry we can introduce the L2-norm ofW(t) defined on [0, T]:

    W2L2 =d

    i=1

    d

    j=1

    T

    0

    ij(s)i(s)j(s)ds

    Since the covariance matrix is positive definite, W2L2 is nonnegative. If you like functionalanalysis, check that all other norm properties hold true.

    17

  • 7/25/2019 Tutorial IR

    18/30

    dent and even random (but should be Ftmeasurable). Since (t) is positive def-inite, there exists a unique triangular matrix H(t) such that (t) =H(t)HT(t).(One can consider H as a square root from ). Let B(t) = [B1(t), . . ,Bd(t)]be uncorrelated standard Wiener process. Then WT(t) = H(t)BT(t) sinceH(t)BT(t) has the covariance matrix (t).We consider the simplest example with

    =

    21 1212

    22

    !=

    a 0

    b c

    a b0 c

    =

    a2 abab b2 + c2

    (42)

    We immediately see that a = 1, b = 2 and c = 21 2. So W1(t)W2(t)

    =

    1 0

    2 2

    1 2

    B1(t)B2(t)

    =

    1B1(t)

    2B1(t) + 2

    1 2B2(t)

    Exercise 2.5. Check that this process has indeed the covariance matrix

    Exercise 2.6. Consider the cases d=3 and d=4 and calculate the Choleskydecomposition with Maple (with(linalg) then cholesky()). You may also try ford=3 manually if you like exercising in identical transformations.Though the elements ofHwill be much longer, the main idea remains the same:W1 = 1B1(t), W2(t) is the weighted sum of B1(t), B2(t) and W3(t) is theweighted sum ofB1(t), B2(t), B3(t)where the weights are the respective elements

    ofH.

    Cholesky decomposition allows us to turn a correlated Wiener process to anuncorrelated one by BP(t) =H1(t)WP(t), commit Girsanovs transformationonBP(t), obtain BQ(t) and finally multiply it with H(t).

    Exercise 2.7. Do this for the from42and= [1, 2].Check that underQ the covariance matrix remains the same. Compare it to thefact that one-dimensional Girsanov transformation does not change the volatil-ity.

    Cholesky decomposition also allows us to simulate a multivariate correlatedWiener Process. For that we just need a random number generator (e.g. Box-

    Muller), which generates the i.i.d standard normal random variables.

    3 Change of Numeraire

    Now we go on with the change of numeraire, which is practically useful onlytogether with change of measure, for which, in turn, we need the multivariateGirsanovs theorem. Technically, a numeraire is just a strictly positive process,economically it is the asset, in which all other assets are denominated. To bea numeraire an asset should not pay dividends (they cause the price drop). Acommon numeraire is a bond or a bank account, so you have often considereda discounted stock price S(t)/B(t) and a discounted bond price B(t)/B(t) =

    18

  • 7/25/2019 Tutorial IR

    19/30

    1. But technically nothing stops us from using S(t) as a numeraire. ThenS(t)/S(t) = 1 and we must consider the dynamics ofB(t)/S(t). You see thatthe price of a numeraire-asset, discounted [in terms of numeraire itself] isalways one.The problem by the change of numeraire is that the risk neutral measure changestoo.

    Exercise 3.1. Find the dynamics ofB(t)/S(t)in the Black-Scholes model underthe real measure. Hint: just use Ito product rule.Also find the risk neutral measure, assosicated with the new numeraire. Hint:

    just kill the drift.Exercise 3.2. Consider a market with two correlated risky assets and a risklessbank account. Let the market evolve under the spot martingale measure Q asfollows

    dS1(t) =S1(t)[rdt+ 1dB1(t)]

    dS2(t) =S2(t)[rdt+ 2dB1(t) + 2

    1 2dB2(t)]dS0(t) =rS0(t)dt

    (43)

    Change the numeraire to S1 and calculate the market dynamics under the newnumeraire. Try to find the Girsanov kernel, which makes all assets to martin-gales under the new numeraire.

    There are plausible economic interpretations of the change of numeraire.First of all it is the case when we deal with both domestic and foreign currency,but ok, in both cases there are just bank accounts. To be closer to the exercise3.1,consider the following idea: as the Crisis2008 came, many investors decidedto turn their dollars and euros to gold. So sitting on a pile of gold they canestimate how many other assets (among them bonds) they can buy for theirgold.Exercise3.1 is also a useful trick to derive the Black-Scholes-Merton formulawith random interest rate (s. Shreve, p. 394, Theorem 9.4.2).

    Theorem 3.3. Let X0(t), X1(t),..,XN(t) be traded assets so that X0(t)X0(t)

    = 1,X1(t)X0(t)

    ,.., XN(t)X0(t)

    are martingales (under some measureQ). FixT > t and define

    a new probability measure by

    Qi(A) =X0(0)

    Xi(0)

    A

    Xi(T)

    X0(T)dQ (44)

    Then X0(t)Xi(t)

    X1(t)Xi(t)

    ,.., XN(t)Xi(t)

    are martingales underQi

    Proof. By the Radon Nykodim theorem Qi is a measure. We must showthat it is a probability measure, i.e. that Qi() = 1. It follows

    Qi() = X0(0)

    Xi(0)

    Xi(T)

    X0(T)dQ=

    X0(0)

    Xi(0)EQ

    Xi(T)

    X0(T)

    ()=

    X0(0)

    Xi(0)

    Xi(0)

    X0(0) = 1

    19

  • 7/25/2019 Tutorial IR

    20/30

    In () we used the martingale property of Xi(t)X0(t)

    under Q.

    Now we show that Xj(t)Xi(t)

    is a martingale under Qi. From 44 we can readily

    identify the Radon Nykodim derivative, it is X0(0)Xi(0)

    Xi(T)X0(T)

    . We want to obtain

    the Radon Nykodim process which is

    t = EQ

    X0(0)

    Xi(0)

    Xi(T)

    X0(T)

    Ft ()

    = X0(0)

    Xi(0)

    Xi(t)

    X0(t)

    where in () we again used the martingale property. Moreover

    EQi

    Xj(T)

    Xi(T)

    Ft

    2.2=

    Xi(0)

    X0(0)

    X0(t)

    Xi(t)EQ

    X0(0)

    Xi(0)

    Xi(T)

    X0(T)

    Xj(T)Xi(T)

    Ft

    (45)

    Of course until we specify the dynamics of Xj(t)Xi(t)

    underQi the pricing formula

    45is useless for practice. Indeed, changing the numeraire we change the volatil-ity of discounted assets (and it should be obvious since now Xi(t)/Xi(t) = 1 soits vola is zero but X0(t)/Xi(t) is volatile). You may look at details in Shrevesbook. We, however, can proceed further with LIBOR without them.

    4 The Multivariate LIBOR Market Model

    Finally, we can start with our main topic. The main advantage of the LIBORMarket Model15 (besides it provides a theoretical justification for Black-76) isthat the forward rates are either directly observable on an interbank marketor are calculable from [government] bond prices16. Let us at first recall, howthe LIBOR forward rates are calculated. One fixes a tenor, which is a yearfraction: usually one, three or six months. We consider thesimply compoundedforward ratesL(t , T , T + ) i.e.

    P(t, T+ )(1 + L(t , T , T + )) =P(t, T) (46)

    The LIBOR model is essentially discrete, i.e. T ={T0=, T1= 2,..,Tn=(n+ 1)

    }. For notation simplicity we write Lj(t) instead of L(t, Tj1, Tj) =

    L(t, Tj1, Tij+ ). From46 we immediately derive

    Lj(t) = 1

    P(t, Tj1)

    P(t, Tj1+ ) 1

    =

    1

    P(t, Tj1)

    P(t, Tj) 1

    (47)

    15Following an established tradition, we say LIBOR Model though it is actually a forwardrates model. There is a big family of IBORs, among which the LIBOR and the EURIBORare the most important.16Unfortunately not all forward rates can be uniquely derived from bond prices, since in

    reality the distance between the maturities of the long term bonds are much longer than theLIBOR tenor. E.g. if there are the bonds with matirities Ti1 = 10 years, Ti = 15 years andthe tenor is 6 months, we have to interpolate the bond prices, which is a non trivial task inpractice.

    20

  • 7/25/2019 Tutorial IR

    21/30

    Further we introduce the forward discount factor

    Pj(t) =P(t, Tj1)

    P(t, Tj) Lj(t) = 1

    (Pj(t) 1) (48)

    The equation47tells us how to reproduce the Lj(t) with a bond portfolio.So the LIBORs are tradable. On the other hand we can derive the prices of thebonds with maturitiesT1, T2,..,Tn from the LIBOR rates. Thus it may (and itdoes) make sense to model the dynamics of LIBOR rates directly.

    So in LIBOR Model we postulate that each LIBOR rate Lj has a dynamicsunder the Tj-Forward measureQTj according to

    dLj(t) =j(t)Lj(t)dWQTj (t) (49)

    WhyTj-Forward martingale measure QTj and not simply the [spot] martingalemeasureQ?! Well, first of allP(t, Tj) in the denominator in47can give us anidea about a suitable numeraire and secondly, the Black-76 is consistent underthe T-Forward measure but is not under the spot measure.If we want to price caplets/floorlets only, we do not need anything else, sincethe Black-76 Formula immediately follows from49 (prove it!). Even if we needa price of Caps/Floors,49will still do, since they are just a sum of independent

    caplets/floorlets. But if we need to price e.g. swaptions, we need to considerthecorrelationbetween different LIBOR rates and they are obviously correlated(indeed strongly but not perfectlymultifactor model). So we haven LIBORrates and thusn correlated Wiener Processes. In order to decorrelate it, we usethe Cholesky toolkit17 and write49 in vector notation as

    WQT1

    1 (t)...

    WQTnn (t)

    =

    h11(t) . . . 0...

    . . . ...

    h1n(t) . . . hnn(t)

    BQT11 (t)

    ...

    BQTnn (t)

    (50)

    If you carefully did exercise2.5you see that for each j = [1,..,n]49follows from50, i.e.

    j(t)dWQTj =

    nk=1

    hjk(t)dBQTkk (t) (51)

    Please note that I deliberately write n correlated Wiener Processes andavoid writing n-dimensional Wiener Process. As a matter of fact an n-dimensional Wiener Process should be defined under some measure, the samefor all its components. W.l.o.g. we choose it to be the measure QTj . FromGirsanovs theorem we know that changing from one measure to another simply

    17If we do not want to model time dependent volatilities and correlations, we can simplyestimate the covariance matrix from historical data and then calculate H.

    21

  • 7/25/2019 Tutorial IR

    22/30

    means changing the drift term. So we can rewrite50 as

    WQTj

    1 (t) j1(t)...

    WQTjj (t)

    ...

    WQTjn (t)

    jn(t)

    =

    h11(t) . . . 0...

    . . . ...

    h1n(t) . . . hnn(t)

    BQTj1 (t) mj1(t)

    ...

    BQTjj (t) mjj(t)

    ...

    BQTjn (t)

    mjn(t)

    (52)

    We do not yet know the values ofj1(t),..,jn(t) and must find them. However,

    we do not need to look for the explicit values ofmj1(t),..,mjn(t)

    From Theorem3.3 we obtain

    j1j (t) := dQTj1

    dQTj

    Ft= P(0, Tj)P(0, Tj1)

    P(t, Tj1)

    P(t, Tj) (53)

    Further

    d(j1j (t))48=

    P(0, Tj)

    P(0, Tj1)d(Pj(t)) =

    P(0, Tj)

    P(0, Tj1)d(1 + Lj(t))

    = P(0, Tj)

    P(0, Tj1)

    P(t, Tj1)

    P(t, Tj) j1j (t)

    P(t, Tj)

    P(t, Tj1) 1

    Pj(t)

    d(1 + Lj(t))

    =j1j (t)

    Pj(t) j(t)Lj(t)dW

    QTj =j1j (t)j(t)Lj(t)

    1 + Lj(t)dWQTj

    =j1j (t) Lj(t)

    1 + Lj(t)

    n

    k=1 hjk(t)dBQTj (t)

    =j1j (t) Lj(t)

    1 + Lj(t)[hj1(t),..,hjn(t)]

    dBQTj1 (t) dmj1(t)

    ...

    dBQTjj (t) dmjj(t)

    ...

    dBQTjn (t) dmjn(t)

    (54)

    From40 we immediately identify j(t)Lj(t)

    1+Lj(t) =

    Lj(t)1+Lj(t)

    [hj1(t),..,hjn(t)] as Gir-

    22

  • 7/25/2019 Tutorial IR

    23/30

    sanov kernel(t). Respectively

    dBQTj1 (t) dmj1(t)

    ...

    dBQTjj (t) dmjj(t)

    ...

    dBQTjn (t) dmjn(t)

    = Lj(t)

    (1 + Lj(t))

    hj1(t)...

    hjj(t)...

    hjn(t)

    +

    dBQTj11 (t) dmj1(t)

    ...

    dBQTj1j (t) dmjj(t)

    ...

    dBQTj1n (t) dmjn(t)

    (55)and

    j(t)dWQTj = [hj1(t),..,hjj (t), . . ,hjn(t)]

    Lj(t)

    (1 + Lj(t))

    hj1(t)...

    hjj(t)...

    hjn(t)

    +

    dBQTj11 (t) dmj1(t)

    ...

    dBQTj1j (t) dmjj(t)

    ...

    dBQTj1n (t) dmjn(t)

    = Lj(t)

    (1 + Lj(t))

    2j (t) + j(t)dWQTj1 (t) (56)

    Plugging this into49 yields together with51

    dLj(t) =Lj(t)

    Lj(t)

    (1 + Lj(t))2j (t) + j(t)dW

    QTj1 (t)

    (57)

    Rearranging56 we obtain

    j(t)dWQTj1 (t) =j(t)dW

    QTj (t) Lj(t)(1 + Lj(t))

    2j (t) (58)

    or equivalently

    dWQTj1 (t) =dWQTj (t) Lj(t)(1 + Lj(t))

    nk=1

    h2jk(t) (59)

    Similarly for dWQTj2 (t)

    dWQTj2 (t) =dWQTj1 (t) Lj1(t)(1 + Lj1(t))

    nk=1

    h2(j1)k(t) (60)

    23

  • 7/25/2019 Tutorial IR

    24/30

    Further

    dLj(t) =Lj(t)

    nk=1

    h2jk(t)

    dWQTj1 (t) + Lj(t)

    (1 + Lj(t))

    nk=1

    h2jk(t)

    =

    Lj(t)

    nk=1

    h2jk(t)

    dWQTj2 (t) + Lj1(t)

    (1 + Lj1(t))

    nk=1

    h2(j1)k

    (t) + Lj(t)

    (1 +Lj(t))

    nk=1

    h2jk(t)

    =Lj(t)j(t)dWQTj2 (t) +

    Lj1(t)

    (1 + Lj1(t))

    j(t)(j1)(t)(j1)j(t) + Lj(t)

    (1 + Lj(t))

    2j (t) (61)Proceeding so iteratively we obtain

    dWQTj (t) =

    dWQTp (t) p

    k=j+1

    Lk(t)1+Lk(t)

    k(t)dt j < p

    dWQTp (t) j =p

    dWQTp (t) +j1k=p

    Lk(t)1+Lk(t)

    k(t)dt j > p

    (62)

    And

    dLj(t) =

    Lj(t)j(t)dWQTp (t) Lj(t)

    p

    k=j+1Lk(t)

    1+Lk(t)j(t)k(t)jkdt j < p

    Lj(t)j(t)dWQTp (t) j = p

    Lj(t)j(t)dWQTp (t) + Lj(t)

    j1k=p

    Lk(t)1+Lk(t)

    j(t)k(t)jkdt j > p

    (63)Finally, it is useful to note that driftsdmj1(t),..,dm

    jn(t) in55cancel out thus

    we can, using62and63and fixing some measure (let it be w.l.o.g QT1), rewrite52as

    dWQT1

    1 (t)...

    dWQT1j (t)

    ...

    dWQT1n (t)

    =

    0...

    j1

    k=1Lk(t)

    1+L

    k(t

    )

    k(t)dt

    ...n1k=1

    Lk(t)1+Lk(t)

    k(t)dt

    +h11(t) . . . 0

    .

    .. .

    . . .

    ..h1n(t) . . . hnn(t)

    dBQT11 (t)

    ...

    dBQT1j (t)

    ...

    dBQT1n (t))

    (64)The formulae62 , 63 and 64 are the main result of this section. Of course it

    would be also interesting to prove that all bond prices, discounted with the Tj-bond as numeraire are indeed martingales under QTj . But it is mainly technicalquestion and we can spare it (or postpone till the next tutorial).But the last task we have to do is to derive the dynamics of LIBOR rate underthe spot martingale measure Q, i.e. the measure associated with the bank

    24

  • 7/25/2019 Tutorial IR

    25/30

    account as the numeraire. For this we need to introduce the idea of a [discrete]bank account. It is as follows: assumeTj1< t < Tj, i.e. Lj1(the interest ratefrom Tj1 to Tj) is already fixed and no more random. We define the rollingover bank account at time t as follows: invest 1e at T0 and reinvest on eachreset date until Tj1. At time t the present value of such bank account is

    B(t) =P(t, Tj)

    j1k=0

    (1 + Lk) (65)

    We derive the dynamics ofWQ

    via the dynamics ofWQT

    j+1 . Inverting28yields

    (t) = dQ

    dQTj+1 =

    B(t)P(0, Tj+1)

    P(t, Tj+1) =

    P(t, Tj)

    :=C j1k=0

    (1 + Lk)P(0, Tj+1)

    P(t, Tj+1) (66)

    Analogous to53we obtain

    d[(t)] =C d[Pj+1(t)] =C d[1 + Lj+1(t)] =C j+1(t)Lj+1(t)dWQTj+1

    =C j+1(t)Lj+1(t)

    P(t, Tj)

    P(t, Tj+1)

    P(t, Tj+1)

    P(t, Tj) dW

    QTj+1

    =(t)

    j+1(t)Lj+1(t)

    1 + Lj+1(t) dW

    QTj+1

    (67)

    So j+1(t)Lj+1(t)

    1+Lj+1(t) is the Girsanov kernel and you know how proceed further.

    5 Whats next?

    We did big (and hopefully good) job, so that now you are well prepared to yourexam in interest rate modeling. However, this theory is only the top of icebergso for your job [in order both to find and to do it] you may need to know thefollowing things:

    1). The LIBOR market model, which we have considered is actually the forwardLIBOR model. It is not compatible with the swapLIBOR model (just like Black-76 for caps is not compatible with Black76 for swaptions). However, there isa surprisingly good approximation for the swaption prices in the forward model.

    2). Implied volatilities and implied correlations. As I steadily pointed out,the LIBOR market model is consistent with Black-76. Unfortunately the mar-ket prices of caps and floors are not! This phenomenon is known as volatilitysmile (analogously correlation smile in case of swaptions). So if you put thehistorical volatility into Black-76, you will hardly get the current market priceof a cap or a floor. But you can proceed other way around and find an impliedvolatility, which makes Black-76 consistent with current price. This is known

    25

  • 7/25/2019 Tutorial IR

    26/30

    as plugging the wrong value into the wrong formula to get the right price.Implied volatility can be time dependent and even strike dependent. It reflectsthe market expectation of the future volatility. There is nothing unnatural inthe dependence on strike. Obviously, if the price of an asset goes too high ortoo low there will be a big hype and the volatility will grow.Analogously the implied correlations can be extracted from swaptions. Swap-tions and Caps(Floors) are very liquid derivatives, so one can consider them asprimary traded assets as well. We use them to calibrate our models and thento price not so liquid derivatives.

    3). Practically, it is difficult to extract the implied correlations from the market,so one introduce a parametrization. jk = exp(|Tj Tk|) is a popular choice.

    4). There are many models for stochastic implied volatility and the most pop-ular is probably the SABR model. There is a whole book on the LIBOR withSABR (see References).

    5). We have derived the dynamics of the LIBOR rates under different riskneutral measures. Due to pretty complicated drifts we should carefully choosethe measure under which to simulate - in order to achieve good numerical ac-curacy.

    6). The LIBOR model is essentially discrete but how about derivatives, whosematurities are not the multiples of the tenor ? In this case one has to interpo-late the LIBOR dynamics but this is in no way trivial task if we want to stayarbitrage free and keep the positivity of all rates.

    7). So far we have assumed that the LIBOR rates are driven by a multidimen-sional Wiener Process. To model a defaultable term structure and, in general,for better reality approximation we might need to add jump processes.

    8). In our model the number of Brownian Motions is equal to the numberof LIBOR rates. This is indeed too many. The Principle Component Analysisshows that 3 random factors are enough, they are interpreted as parallel shifts,tilting(affects the slope) and flex(affect the curvature) of the forward curve.

    9). After the Crisis2008 the world of interest rates became multicurve. In par-ticular, it means that the LIBOR or EURIBOR rates are still used as a referencefor IR Swaps but for discounting one takes another curve, see Bianchetti andCarlicchi(2012). Moreover, the swaps are usually collateralized, which makeseven a pricing of plain-vanilla swaps very challenging, see Brigo(2012).

    26

  • 7/25/2019 Tutorial IR

    27/30

    6 References

    Fischer Black (1976). The pricing of commodity contracts, Journal of FinancialEconomics, 3, 167-179.

    Oldrich Vasicek (1977). An Equilibrium Characterisation of the Term Struc-ture. Journal of Financial Economics 5 (2): 177188

    John C. Hull - Options, Futures and Other Derivatives, 7th Ed. - PrenticeHall, 2008

    Damiano Brigo and Fabio Mercurio - Interest Rate Models - Theory and Prac-tice: With Smile, Inflation and Credit, 2nd Ed. - Springer, 2006

    Damiano Brigo - Counterparty Risk FAQ: Credit VaR, PFE, CVA, DVA, Close-out, Netting, Collateral, Re-hypothecation, WWR, Basel, Funding, CCDS andMargin Lending, 2012 - http://arxiv.org/pdf/1111.1331v3.pdf

    Marco Bianchetti and Mattia Carlicchi - Interest Rates After The Credit Crunch:Multiple-Curve Vanilla Derivatives and SABR, 2012http://arxiv.org/ftp/arxiv/papers/1103/1103.2567.pdf

    Damir Filipovic - Term Structure Models: A Graduate Course - Springer, 2009

    Riccardo Rebonato, Kenneth McKay and Richard White - The SABR/LIBORMarket Model: Pricing, Calibration and Hedging for Complex Interest RateDerivatives - Wiley, 2009

    Dariusz Gatarek, Przemyslaw Bachert and Robert Maksymiuk - The LIBORMarket Model in Practice - Wiley, 2012

    Thomas Bjork - Arbitrage Theory in Continuous Time, 3rd Ed. - Oxford Fi-nance, 2009

    Steven E. Shreve - Stochastic Calculus for Finance II: Continuous-Time Models,

    2nd Ed. - Springer, 2004

    James Jardine - An Examination and Implementation of the Libor MarketModel (Master Thesis), 2006 - http://www.jimme.net/Thesis.pdf

    27

  • 7/25/2019 Tutorial IR

    28/30

    7 Solutions to Exercises

    Exercise 2.5

    cov[dW1(t), dW1(t)] =V AR[dW1(t)] =21 V AR[dB1(t)] =

    21dt

    cov[dW2(t), dW2(t)] =V AR[dW2(t)]()= 222V AR[dB1(t)]+(12)22V AR[dB2(t)] =22dt

    cov[dW1(t), dW2(t)] = E[dW1(t) dW2(t)] E[dW1(t)]E[dW2(t)]

    =0= E[12dB1(t)dB1(t) + 12

    1 2 dB1(t)dB2(t)

    =0()

    ] =12dt

    where in () we used the independence ofB1(t) and B2(t)

    Exercise 2.7

    dBQ1(t)

    dBQ2(t)

    =

    1dt+ dB

    P1(t)

    2dt+ dBP2(t)

    Thus 1 0

    2 2

    1 2

    dBQ1 (t)

    dBQ2 (t)

    =

    1dB

    Q1(t)

    2dBQ1(t) + 2

    1 2dBQ2(t)

    =

    dWQ1 (t)

    dWQ2 (t)

    =

    1[1dt+ dB

    P1(t)]

    2[1dt+ dBP1(t)] + 2

    1 2[2dt + dBP2(t)]

    =

    11dt+ dW

    P1 (t)

    2[1+

    1 22]dt+ dWP2 (t)]

    =:

    1dt+ dW

    P1 (t)

    2dt+ dWP

    2 (t)]

    Respectively, if the drifts coefficients 1 and 2 are given and you are going tokill the drifts by WP1 (t) andW

    P1 (t), you have to choose 1= 11

    and2

    =

    2

    2

    1 /1 2Looking at this solution you are probably trying to reconcile it with section5.4.3 of Shreves book. In this case, do not be confused with Example 5.4.4,note that in this example there aretwo stocks but onlyoneBrownian Motion.

    Exercise 3.1

    Recall: dBt = rBtdt und dSt =St(dt+ dWt)

    d

    BtSt

    =

    dBtSt

    + Btd

    1

    St

    =

    rBtdt

    St+ Bt

    1

    S2tdSt+

    1

    2

    2StS4t

    d2St

    28

  • 7/25/2019 Tutorial IR

    29/30

    = BtSt

    [r + 2]dt dWt

    =

    BtSt

    [r + 2]dt + dWt

    sinceW(t) is statistically indistinguishable fromW(t). We kill the drift ofBtunder numeraireSt as follows:

    [r + 2]dt+ dWt

    =

    [r

    + ]dt+ dWt

    =: dWQSt

    Now consider

    dlnBtSt

    = StBt

    dBtSt

    12

    S2t

    B2td2 Bt

    St= [r + 2

    2 ]dt + dWt

    And finally

    BtSt

    = B0S0

    exp

    [r +

    2

    2]t+ Wt

    =

    1

    S0exp

    [r +

    2

    2 ]t + Wt

    Exercise 3.2

    Here to keep the notation simpler I omit (t), writing e.g. S1 instead ofS1(t).

    dS2

    S1

    = S2d

    1S1

    + 1

    S1dS2+ dS2d

    1S1

    =S2

    1

    S21S1(rdt+ 1dB1) +

    1

    S121 dt

    +

    S2S1

    rdt+ 2dB1+ 2

    1 2dB2

    +S2S1

    (rdt 1dB1+ 21dt)(rdt+ 2dB1+ 2

    1 2dB2) 12dB1dB1remains only

    =S2S1

    (21 12)dt+ (2 1)dB1+ 2

    1 2dB2

    The dynamics of S0S1

    follows from the exercise3.1 setting r = , i.e.

    d

    S0S1

    =

    S0S1

    1(1dt+ dB1) =S0S1

    1dBQS11

    Respectively (recalling that we have already replaceddB1 with dB1 in theprevious equation and thus must do it in the following too) we obtain

    S2S1

    (21 12)dt+ (2 1)dB1+ 2

    1 2dB2

    =S2S1

    (21 12)dt+ (1 2)(1dt+ dB1) (1 2)1dt+ 2

    1 2dB2

    29

  • 7/25/2019 Tutorial IR

    30/30

    =S2S1

    (1 2)(1dt + dB1) + 2

    1 2dB2

    =

    S2S1

    (1 2)dBQS11 + 2

    1 2dBQS12

    Note that the drift under the neu numeraire S1 was completely absorbed withdB1. This is because we have already started with the risk-neutral dynamicsunder the numeraire S0. Had we started under the real world measure like inExercise3.1 it would not be the case and a part of the drift would be absorbedwithdB2.

    30