Notes by KS  /  last changed Mon Nov 26 17:43:10 EST 2012  /  Private, Not For Distribution

Black font is a paraphrase of Gali 08, blue is a direct quote, red is my comment




Gali 2008: "Monetary Policy, Inflation, and the Business Cycle"

Chapter 1 Introduction

1.1 Real Business Cycle (RBC) model

The RBC model "firmly established the use of DSGE models as a central tool" for macro. The DSGE framework developed in the 80's [Kydland & Prescott 82, Prescott 86].

Methodologically: "Behavioral equations describing aggregate variables were thus replaced by first-order conditions of intertemporal problems facing consumers and firms". "Rational expectations" replaces "ad hoc assumptions on the formation of expectations".

Conceptually: Cooley & Hansen 89 add a monetary sector to this RBC model, and this results in the classical monetary model, treated in chapter 2. It was influential in academic circles but not used by banks---because it finds that central banks don't have the "power to influence output and employment, at least in the short run" [references Friedman & Schwartz 1963 for empirical support]. Mentions the "Friedman Rule", which is to keep the nominal rate zero. Conclusion was that the classical model is incomplete.

1.2 The new Keynesian model

The New Keynesian (NK) model developed in the late 70's, parallel to the RBC theory, and then incorporated the DSGE framework.

In their simplest forms, both RBC and NK assume

Although endogenous capital accumulation is a key element in RBC theory and not in the canonical versions of the NK theory, "it is easy to incorporate and is a common feature of medium-scale versions [of NK]."

The major and important differences between the resulting NK theory and the RBC model are In this NK framework, response to shocks is generally inefficient. And, "...the non-neutrality of monetary policy...makes room for potentially welfare-enhancing interventions by the monetary authority in order to minimize the existing distortions." Also, the NK models are arguably immune from the Lucas Critique.

1.2.1 Evidence of nominal rigidities and policy non-neutralities

Two distinctive features and essential ingredients of the NK models require empirical evidence: nominal rigidities and the real effects of monetary policy.

1.2.1.1 Evidence of nominal rigidities

References empirical studies based on micro data of price and wage rigidities. The evidence points to price durations in the range 6-12 months.

1.2.1.2 Evidence of monetary policy non-neutralities

Gali says establishing this evidence is more difficult than establishing rigidities. I would add that it's more subtle, interesting, and problematic, given that it involves more complicated statistical inference.

This subject is more difficult. The crux of the matter is that in general, policy depends on measurable economic variables and vice-versa. The trick is to disentangle cause and effect "...by identifying changes in policy that are autonomous, i.e., that is, not the result of the central bank's response to movements in other variables."

Recent literature is cited, by authors such as Christiano, Eichenbaum, & Evans 99 [some graphs reproduced]; Sims 92; Gali 92; Bernanke & Mihov 98; and Uhlig 05; Peersman & Smets 03; and Romer & Romer 89.

1.3 Organization of book



Chapter 2 A classical monetary model (CMM)

This model is characterized by perfect competition and full flexible (not sticky) prices. Money plays a very limited role---in §§1-4 it is just the unit of account, and in §5 it generates utility to households.

2.1 Households

The representative household seeks to maximize the utility

(1)      E0 Σ βt U(Ct, Nt)

where C is consumption and N is labor (number of hours, but can be considered continuous), and the sum is from 0 to ∞ unless otherwise mentioned. The utility function U is assumed increasing in C and nondecreasing in N. The second derivatives of U (marginal utilities) are assumed nonincreasing in C and N. Below, Ut denotes U(Ct, Nt).

Et is the expectation operator at time t. It seems that the system will be driven exogenously with random variables, in particular through the technology coefficient that determines production. The notation Et[x] indicates the expectation of x, given that all random variables for all periods up to and including t have already been chosen. Thus, variables at time t are not random and are constant with respect to Et. That is, Et[xt] = xt. This becomes important in what follows.

The household's maximization of U is subject to the flow budget constraint

(2)      PtCt + QtBt    ≤    Bt−1 + WtNt − Tt

where P is the price of CG, W is the nominal wage. B is the quantity of one-period nominally riskless bonds, purchased in period t and maturing in period t+1. Each bond (a discount bond) costs Qt at time t and pays a unit of money at maturity. T is lump-sum additions of subtractions to income, so it can include taxes and dividends. The household knows P, Q, and W at time t when deciding on C, N at time t.

I think the sign in the Tt term should be "+", as it is in chapter 3, see below. That is, the convention is that Tt is an addition to income.

How is B determined? If the flow constraint were an equality constraint, it would be determined from the other variables as the one-period, discounted bonds bought with unspent money (positive B) or as loans (negative B). What is the interest rate on such loans? Can we assume that the loans are one-period and discounted at the same effective rate as the one-period discounted bonds that would be bought?

Notice that the income (RHS) at time t is determined by the choice of labor N at time t, along with the choice of consumption. That is, there is no delay between the choice of labor and the resultant household income. The bond maturity of one time step ties everything to the one time scale, with no allowance (in this model) for smoothing or delays.

To me it seems we should take the budget constraint (2) as an equality, which I will do in these notes unless I say otherwise.

There is also a "solvency constraint", that the limit as t goes to infinity of the expected value of B is nonnegative. This prevents the accumulation of debt.

2.1.1 Optimal C and N

First, at a fixed time t, vary C and N while respecting the budget constraint, and consider everything else in the utility function fixed. Stationarity and the budget constraint demand that

     Uc,t dC + Un,t dN = 0

where the second subscripts on Ut denote partial derivatives, and

     PtdC = WtdN

This leads to Gali's first necessary condition

(4)      −Un,t/Uc,t = Wt/Pt

Notice that if U is separable in C and N, Uc is a function of C alone, Un is a function of N alone, and (4) allows us, in principle, to find N(C) (at time t for every t). For the simple power law utility coming soon, the calculation is simple. It is enough then to look for C. (An example with nonseparable utility is given in §2.5.2.)

An intertemporal condition is next. Suppose we consider reallocating consumption between periods t and t+1, keeping everything else fixed. To balance an increase in consumption at time t+1 we must consume less at time t, investing the savings in one-period bonds at the discount rate Q, so there is the condition

     Pt+1dCt+1 = −PtdCt/Qt

The RHS is the number of bonds we can buy at time t with PtdCt units of money, and each is worth one unit of money at time t+1.

Note: This is similar to the budget constraint above but is intertemporal. Therefore a decrease in consumption at time t in general affects Pt+1. If this is taken into account it leads to the condition

     P*t+1dCt+1 = −PtdCt/Qt

where P*t+1 is the price of CG at time t+1 when consumption at time t is perturbed by dCt. But this agrees with Pt+1dCt+1 to first order in dCt.


The first-order stationarity condition is

(4-intertemporal)      Uc,tdCt + βEt[Uc,t+1dCt+1] = 0

where the expectation is over the random variables at time t, that in general affect utility at time t+1. Combining this with the intertemporal budget constraint gives Gali's second condition

(5)      Qt = βEt[(Uc,t+1/Uc,t)(Pt/Pt+1)]

The following separable form for U, with the required nonincreasing marginal utility, is assumed in "much of what follows":

     U(Ct, Nt) = Ct1−σ/(1−σ) − Nt1+φ/(1+φ)

The optimality conditions (4) and (5) can be solved explicitly with this form, yielding Gali's (6), which is an explicit intratemporal relation between C and N; and (7), which involves an intertemporal expectation.

(6)      Wt/Pt = CtσNtφ

(7)      Qt = βEt[(Ct+1/Ct)−σ(Pt/Pt+1)] = Et[ β(Ct+1/Ct)−σ(Pt/Pt+1)]

(β is a constant.) Using lower-case letters to denote natural logs of variables, yields the nice log-linear form for the optimality condition (6):

(8)      w − p = σc + φn

We'll use natural logs throughout. This can be interpreted as a competitive labor supply schedule, determining the "quantity of labor supplied as a function of the real wage, given the marginal utility of consumption (which under the assumptions is a function of consumption only)".

This can be written as

(8a)      n = (w − p)/φ − σc/φ

which means that labor increases with the difference between wage and CG price (called the "real wage" below); and decreases with consumption. Question: are these implications consistent with intuition?

The second optimality condition, Gali's (7), involves an intertemporal expectation, and requires some approximations to write in a simple form in terms of readily interpretable quantities. This is where things start to get interesting, because one approximation is around a "steady state with constant rates of inflation and consumption growth". Gali gives details of the expansion around the steady state in Appendix 2.1. I'll go slowly here.


First, the bond discount Q = 1/(1 + yield), so we can define the interest rate as −log Q, since

     i ≡ −log Q = log(1 + yield) ≈ yield

for small yield.

I'll call this Approximation 1, which might conceivably cause a problem if the interest rate is high.

Taking −log of (7) gives us

(7a)      it = − log { Et[β(Ct+1/Ct)−σ(Pt/Pt+1)] }

Now define

     ρ ≡ −log β

     Δct+1 = ct+1 − ct

and inflation (of log prices) as

     πt+1 = pt+1 − pt

and rewrite (7a) as

(7b)      it = − log { Et[ exp( −ρ − σΔct+1 − πt+1) ] }

and further, raising e to this power,

(7c)      1 = exp( it )Et[ exp( −ρ − σΔct+1 − πt+1) ]

Gali starts Appendix 2.1 with

(44)      1 = Et[ exp( it − ρ − σΔct+1 − πt+1) ]

which is described as a rewrite of (7).

Slipping the interest factor inside the expectation is OK here because it is determined at time t, and is therefore a constant with respect to the expectation Et, as noted above. Actually, it is not even a random variable if the interest rate is determined exogenously, but becomes a random variable when the central policy depends on random system variables. Were it not for this transparency the derivation would be vulnerable to the Lucas critique.

To see that the exponent in (44) is close to zero, assume a hypothetical perfect-foresight steady state with constant inflation π and constant growth γ, so that the interest rate can be written

     i = ρ + π + σγ

Substituting for ρ in (44), the exponent can be written

     (it − i) − σ(Δct+1 − γ) − (πt+1 − π)

This is close to zero by the definition of what is meant by a steady state, with γ interpreted as the steady-state increase in (log) consumption. We are therefore justified in approximating the exponential in (44) with the first two terms in its Taylor series, exp(x) ≈ 1 + x. Using this and the fact that Et[xt] = xt in (44) yields

(9)      ct = Et[ct+1] − (1/σ)(it − Ett+1] − ρ)

This is how Gali writes it. I'll call this two-term Taylor series Approximation 2. In the general, the procedure of (1) taking logs and (2) using "a first-order Taylor series expansion about a point (usually a steady state)" is called "log linearization" (Eric Sims, Notes, Graduate Macro II, Spring 2010, Notre Dame). It's a standard tool in modern macro, especially in formalisms like the NKM.

In the framework so far the household has no explicit motive motive for holding money balances. In some cases Gali will "postulate a demand for real balances" given by

(10)      mt − pt = yt − ηit

"A money demand equation similar to (10) can be derived under a variety of assumptions. For instance, in §2.5 it is derived as an optimality condition for the household when money balances yield utility."

This is an equation in log variables, what Gali calls log-linear form. mt is described in §2.5 as representing the (log of) "holdings of money in period t". Therefore mt − pt = log(Mt/Pt) is thought of as the postulated real demand for money, being normalized by price. y is production, defined below, which is equal to c in equilibrium, and η ≥ 0 "denotes the interest semi-elasticity of money demand". This can be interpreted as meaning that the demand for money balances (which yield no interest) varies inversely with the interest rate. This is because keeping a cash balance costs the interest that would be lost by investing in bonds. This demand function for real real balances is derived in §2.5.1 assuming a separable household utility function that includes dependence on mt.

2.2 Firms

A representative firm is assumed with the production function (at time t)

(11)      Yt = AtNt1 − α

Here the absence of capital as a factor of production is a significant departure from any of our models, and it will be interesting to see exactly how capital has been or could be incorporated in this and the following models in the general DSGE framework. This extension is discussed in §8.1 ("Extensions"). This is also the point at which Gali introduces an explicit random variable, A, which is assumed to "evolve exogenously".

"Each period the firm maximizes profit", with the price of CG and wage considered given:

(12)      PtYt − WtNt

And here is another departure, the firm's maximization of profit---in contrast with trying to maintain an optimal split between labor and capital with a budget established by income. Question: are these two principles of action in any way equivalent?

It's simple enough to substitute (11) in (12) and set the derivative with respect to N to zero. Gali interprets the result as the condition when the marginal cost of hiring labor to produce a unit of CG, W/YN, is equal to P.


(13)      Wt/Pt = A(1 − α)AtNt− α

"In log-linear terms"

(14)      wt − pt = at − αnt + log(1 − α)

This can be interpreted as a labor demand schedule.

This is a companion to the labor supply schedule (8). Gali calls the LHS, w − p, the "real wage".

2.3 Equilibrium

"The baseline model abstracts away from aggregate demand components like investment, government purchases, or net exports. Accordingly the goods market clearing condition is given by"

(15)      yt = ct

Combining the (intratemporal) household and firm optimality conditions (8) and (14) with the equilibrium condition (5) and the log-linear production function

(16)      yt = at + (1 − α)nt

yields the equilibrium levels of employment and output:

(17)      ntequil = ψnaat + ϑn
(18)      ytequil = ψyaat + ϑy

where the constants ψ and ϑ are simple functions of α, σ, and φ. There is no general assumption of stationarity, so the expectations can depend on t. It turns out that in equilibrium log output is an increasing function of the technology constant because ψya > 0.

I checked this algebra.

Note that the equilibrium presently computed does not depend on the investment discount Q (or i ≡ −log Q), or household utility discount β (or ρ ≡ −log β).

We can now use the intertemporal condition (9) to derive the "implied real interest rate", defined by

(21, rearranged and used here as a definition, not the "Fisherian equation")      rt ≡ it − Ett+1]

This makes sense because it is the interest rate stemming from the bond discount rate, and πt+1 is the increase in CG price from period t to period t+1, a measure of inflation. This equation defines the real interest rate in this development and is called the Fisher equation; it's actually an approximation valid for small rates, see the wikipedia entry for the Fisher equation. The exact equation is actually (1 + rt) = (1 + it)/(1 + Ett+1]), because the value one period in the future is increased at the nominal interest rate but discounted by inflation.

At equilibrium with yt = ct, (9) becomes

(19)      rtequil = ρ + σEt[Δyt+1] = ρ + σψyaEt[Δat+1]

where the latter equation uses the equilibrium value (18) for y.

The real wage ωt ≡ wt − pt is, using (14),

(20)      ωtequil = ψωat + ϑω

where the constants ψ and ϑ are simple functions of α, σ, and φ.

This is an important conclusion from the RBC model: ...the equilibrium dynamics [sic] of employment, output, and the real interest rate are determined independently of the monetary policy. In other words, monetary policy is neutral with respect to those variables. ...output and employment fluctuate in response to variations in technology, which is assumed to be the only real driving force. In particular, output always rises in the face of productivity increase...[t]he same is true for the real wage. On the other hand, the sign of the employment is ambiguous, depending on whether σ (which measures the strength of the wealth effect of labor supply) is larger or smaller than 1."

The emphasis is Gali's. On the other hand, the equilibrium values of nominal variables, like inflation, nominal interest rate, and nominal prices, are not determined uniquely by real forces---for that we need to specify the monetary policy. Adding a utility for holding money balances implies that the money supply is also indeterminate. Examples of monetary policy rules come next.

Note on the case of zero expected inflation in the CMM: zero inflation implies that the expected nominal interest rate itequil is equal to the expected real interest rate rtequil [eq. (21)]. Furthermore, the latter is determined by the real system parameters [eq. (19)], including the expected value of the first difference of the driving technology shocks. Thus, stipulating zero expected inflation determines the equilibrium nominal interest rate. In the case of zero secular growth, Et[Δat+1] = 0, and (19) shows that then this equilibrium nominal interest rate is ρ.

2.4 Monetary Policy and Price Level Determination

2.4.1 An Exogeneous Path for the Nominal Interest Rate  → indeterminate equilibrium

In this case we choose it to be an exogenous stationary process with mean ρ, "which is consistent with a steady state with zero inflation and no secular growth".

Yes, my discussion immediately above shows that the equilibrium nominal interest rate with no inflation and no growth is in fact ρ, and that justifies this claim.

Constant interest is a special case. From the definition of real interest rate

     Ett+1] = it − rt

and rt is determined independently of the monetary policy. Thus, expected inflation is determined, but not the inflation itself. Any path that satisfies

     pt+1 = pt + it − rt + ξt+1

will be consistent with the equilibrium, where ξt+1 is any zero-mean random variable, called sunspot shocks. In this system "nonfundamental factors may cause fluctuations in one or more variables" and an equilibrium with this property is called an indeterminate equilibrium. In particular, in the CMM an "exogenous nominal interest rate leads to price level indeterminacy".

2.4.2 A Simple Inflation-Based Interest Rate Rule  → sometimes determinate and sometimes indeterminate equilibrium

The central bank adjusts the nominal interest rate according to the rule

     it = ρ + φππt

where φπ ≥ 0. Substituting it = Ett+1] + rt this can be written

(22)      φππt = Ett+1] + (rt − ρ)

Summary

Case 1: φπ > 1. (22) has only one stationary solution, that is, "a solution that stays in the neighborhood of the steady state". An explicit solution for πt is obtained, and price levels are fully determined as a function of the path of the real interest rate and hence, by (19) as a function of fundamentals. The larger φπ the smaller the impact of the real shock on inflation.

Case 2: φπ < 1. This case is similar to that in §2.4.1, with an exogenous nominal rate. The path will be consistent with sunspot shocks, and price levels and inflation are not determined uniquely. This is an example of

The Taylor principle and corresponding Taylor rule: The central bank must adjust nominal interest rates more than one-for-one in response to inflation to bring about a determinate equilibrium.

The wikipedia entry for Taylor rule is very interesting, especially the section "Empirical relevance". According to the wikipedia page, Taylor proposed the rule in 1993, and Dale W. Henderson and Warwick McKibbin did so simultaneously.

2.4.3 An Exogenous Path for the Money Supply

In §2.4.3 "the central bank sets an exogenous path for money supply {mt}" using the postulated money demand (10), then determines the nominal interest rate. The results are

2.4.4 Optimal Monetary Policy

We note that in the baseline CMC monetary policy does not affect real variables, but does affect nominal variables like prices. But because the utility of the household is a function only of consumption and hours worked---two real variables---no monetary policy is better than any other (in terms of utility). This result is clearly "extreme and empirically unappealing", but can be "overcome" by adding monetary wealth (money balances) to the household utility, which is done in the next section. However, the overall assessment of the CMC "cannot be positive". It fails because That leads to the introduction of "nominal frictions" and the Basic New Keynesian Model in chapter 3. The rest of chapter 2 is concerned mainly with the introduction of money balances in the utility.

2.5 Money in the Utility Function

Optimality conditions are derived for the utility function with cash balances; this amounts to adjusting the flow budget constraint to accommodate the change in the model [eq. (27)], and the derivation of one new condition [eq. (28)]. Two examples with more specific assumptions on the form of the utility function follow in the next two sections.

2.5.1 An Example with Separable Utility

Note that the Taylor series applied to log(1−exp(−it)) in (29) is about some value i that is not defined---in order to get a linear rather than logarithmic dependence on it. It is evidently an equilibrium value for the rate it, but i don't see why such an equilibrium value must exist. In any event, this example case leads to the "conventional linear demand for real balances":

(31)      mt − pt = ct − ηit = yt − ηit

and this is used later in the book without explicit citation.

The example ends with "As in the analysis of the cashless economy, the usefulness of (30), or (31), is confined to the determination of the equilibrium values for inflation and other nominal variables whenever the description of monetary policy involves the quantity of money in circulation. Otherwise, the only use of the money demand equation is to determine the quantity of money that the central bank will need to supply in order to support, in equilibrium, the nominal interest rate implied by the policy rule."

I interpret this as meaning that when the policy rule depends on the money supply, (31) can be used to determine the rate path, and hence a derivation like that in §2.4.2, Case 1 will determine the nominal rate and inflation.

2.5.2 An Example with Nonseparable Utility

About five pages of analysis, which I will skip for now. A main conclusion, on p. 30: "Condition (39) points to a key implication of nonseparability (ω ≠ 0): Equilibrium output is no longer invariant to monetary policy, at least to the extent that the latter implies variations in the nominal interest rate."

2.5.3 Optimal Monetary Policy in a Classical Economy with Money in the Utility Function

"The problem facing a hypothetical social planner seeking to maximize the utility of the representative household is presented and solved."

This leads to Friedman's rule, it = 0 for all t, mentioned in §1.1. This "policy implies an average (steady state) rate of inflation π = −ρ < 0, i.e., prices will decline on average at the rate of time preference."

There follows some discussion of implementing Freidman's rule, then Notes on the Literature, and some exercises.



Chapter 3 The Basic New Keynesian Model

This involves two changes in the CMM: "The resulting framework...has become in recent years the workhorse for the analysis of monetary policy, fluctuations, and welfare."

3.1 Households

The utility function is identical to that in the CMM model, (2.1), where consumption is now replaced by the aggregate consumption index

     Ct ≡ (∫ Ct(i)(ε−1)/ε di)ε/(ε−1)

where we assume a continuum of goods indexed by i ∈ [0,1], and the integral is over that continuum. Note the parsimonious notation, which can be confusing. The point, though, is to define variables like Pt and Ct so that when possible the equations become formally identical to those in chapter 2. The ε is not discussed, but this seems to be a standard way to model diminishing utility of commodities wrt to other commodities (with ε > 1); when ε = 2, for example, this is

     Ct ≡ (∫ Ct(i)1/2 di)2

The household seeks to maximize this utility subject to the budget constraint corresponding to (2.2), except the cost-of-consumption is now appropriately averaged:

     ∫ Pt(i)Ct(i)di + QtBt    ≤    Bt−1 + WtNt − Tt,      t = 0, 1, 2 ...,

where P(i) is the price of good i. We also assume the same solvency constraint as in chapter 2, which is that the limit of the expected bond holdings at infinity is nonnegative for every t.

Of course there's no need for a continuum of goods. It's not realistic and it just makes the optimization steps harder to justify rigorously. For example, we want to say that

     (d/dCt(j)) ∫ Pt(i)Ct(i)di = Pt(j)

I think it would be better to make the integration a finite sum, and I'll think of it that way. In Gali 11, which extends the NKM so that it models unemployment, he introduces a two-dimensional continuum of members of a representative household. It's the convention in this field.


The household now has an additional decision to make: it "must decide how to allocate its consumption expenditures among the different goods." This is done in Appendix 3.1 by maximizing the Lagrangian

     Ct − λ(∫ Pt(i)Ct(i)di − Zt)

This is to maximize utility subject for a fixed expenditure level Zt. Differentiating wrt Ct(i) gives us the first-order conditions

     Ct−1/ε(i)Ct1/ε = λPt(i)

Thus, for every i and j

     λ = Ct−1/ε(i)Ct1/ε/Pt(i) = Ct−1/ε(j)Ct1/ε/Pt(j)      ∀ i, j

or, assuming Ct ≠ 0,

     λ = Ct−1/ε(i)/Pt(i) = Ct−1/ε(j)/Pt(j)      ∀ i, j

Writing Ct(i) in terms of Ct(j) and substituting in the constraint that expenditures ∫ Pt(i)Ct(i)di = Zt,

     Ct(j) = (Pt(j)/Pt)−ε(Zt/Pt)      ∀ j

where Pt is the aggregate price index, defined by

(A)      Pt ≡ (∫ Pt(i)1−εdi)1/(1−ε)

Substituting this in the definition of the aggregate consumption index Ct yields

(B)      Zt = ∫ Pt(i)Ct(i)di = PtCt

That is, the total expenditure for consumer goods, defined as ∫ Pt(i)Ct(i)di, is the product of the aggregate price index and the aggregate consumption index. Note that this result depends on the optimality condition maximizing utility. Combining (A), (B) and using the definition of Zt gets to the main result of Appendix 3.1, which is the demand schedule, the consumption as a function of price:

(1)      Ct(i) = (Pt(i)/Pt)−εCt

I checked all the algebra in Appendix 3.1. Note that this is the equality condition in Holder's inequality. Write (B) as

(C)      ∫ Pt(i)Ct(i)di = (∫ Pt(i)μdi)1/μ (∫ Ct(i)νdi)1/ν

where μ = 1−ε, ν = (ε−1)/ε, and 1/μ+1/ν = 1. The N&S condition for equality is the linear dependence of Pt(i)μ and Ct(i)ν, which is just equation (1).


Now the budget constraint can be written in a way formally identical to (2.2).

For some reason the sign of the term Ti in the budget constraint is "−" in in chapter and "+" here in chapter 3. It represents "lump-sum additions or subtractions to period income", and as an "addition to income" on the RHS of (2.2) should have a "+" sign, so the "− is evidently a typo in chapter 2. It hasn't been used yet, so it hardly matters.

The derivation of the optimization conditions (2.4) and (2.5) is exactly the same, and, in fact, the rest of the derivation in chapter 2 now follows, up to the treatment of firms in §2.2, in the same way, given that the starting points are identical. In particular, the same separable utility function leads to the same conditions (2.8) and (2.9).

3.2 Firms

"Assume a continuum of firms indexed" in the same way as the goods (a finite set would be better here as well, in my opinion)

(5)      Yt(i) = AtNt(i)1−α

The level of technology At is "assumed to be common to all firms and to evolve exogenously over time". "All firms face an identical isoelastic demand schedule given by (1), and take the aggregate price level Pt and the aggregate consumption index Ct as given."

Now come staggered prices, which is due to Calvo (1983) and works like this: "Each firm may reset its price only with probability 1−θ in any given period, independent of the time elapsed since the last adjustment. Thus, each period a measure 1−θ of producers reset their prices, while a fraction θ keep their prices unchanged. As a result, the average duration of a price is given by (1−θ)−1. In this context, θ becomes a natural index of price stickiness."

The convention here is that if a firm sets its price at period t, it gets its first opportunity to reset it at period t+1. Therefore the average life is 1⋅(1−θ)+2⋅θ⋅(1−θ)+3⋅θ2(1−θ)+... = (1−θ)⋅( 1+2⋅θ+3⋅θ2+...) = (1−θ)−1.

Note that the goods and firms are "differentiated" (so that they can be treated differently), but not "heterogenous", since the technology constant is the same across all firms. It is pointed out in §3.2.1 that "...all firms will choose the same price because they face an identical problem." This refers to the new price Pt* for those prices that are reset from Pt−1. As we go from step t to t+1, some prices are preserved and some are reset, so in general at any step there will be a distribution of prices.

Interpretation of the continuum of labor    The consumption and price, Ct(i) and Pt(i), are clearly functions on [0,1], and there should be no problem defining the corresponding aggregate indices in the manner above. The labor, Nt(i), however, seems to present the problem that production needs to be defined in a way consistent with the notation and equations in the CMM (Chapter 2). The aggregate labor at time t is (as on p. 46: "Market clearing in the labor market requires")

(D)      Nt = ∫ Nt(i)di

This must be the Nt used in the optimality conditions that are formally identical to those in Chapter 2 (pp. 42-43). In particular, the cost of labor must be written in the form WtNt for the balance equation corresponding to (2.2), and the form of the household utility must use this Nt. Let's examine the production that follows from Gali's definitions. From (5), the production corresponding to firm i is

(E)      Yt(i) = AtNt(i)1−α

and therefore the aggregate output of goods is

(F)      Yt ≡ (∫ Yt(i)(ε−1)/ε di)ε/(ε−1) = (∫ [AtNt(i)1−α](ε−1)/ε di)ε/(ε−1) = At(∫ [Nt(i)1−α](ε−1)/ε di)ε/(ε−1)

using as the definition the same norm as consumption, as in §3.3, p. 45. This would seem to be required by the equilibrium condition. Take as an example the case α = 1/2 and ε = 2:

(G)      Yt = At(∫ Nt(i)1/4 di)2

Without worrying for now about the interpretation of Nt(i), I'd like to try to find a clear question or problem with this formulation. A simple observation is that there can be many different Nt(i) with the same aggregate labor Nt (D), but with different aggregate outputs (F or G).

This seems to be the same point mentioned in Christiano et al. 2010 [C10], p. 294: "A notable feature of the New Keynesian model is the absence of an aggregate production function. That is, given information about aggregate inputs and technology, it is not possible to say what aggregate output, Yt, is. This is because Yt also depends on how inputs are distributed among the various intermediate good producers."

It may be that the only "labor" that matters in the analysis in Chapter 3 (NKM) is Nt, since Gali bends over backwards to press the new analysis into the same mold as Chapter 2 (CMM). Thus, there seems to be no solution given for anything like the Nt(i), and that would explain the apparent vagueness of the continuum-of-firms model---it's a fiction designed to derive the price dynamics in the next section, which seems to be good enough for Gali's purposes. I'll check this and try to confirm with other sources.


3.2.1 Aggregate Price Dynamics

Begin with Appendix 3.2. Calculate the aggregate price from the definition, letting S = the set of firms not reoptimizing:

     Pt = [∫S Pt−1(i)1−εdi + (1−θ)(P*t)1−ε ]1/(1−ε) = [θ(Pt−1)1−ε + (1−θ)(P*t)1−ε ]1/(1−ε)

The second equality,

     ∫S Pt−1(i)1−εdi = θ(Pt−1)1−ε

follows from the fact that the distribution of prices among firms not adjusting prices in period t−1 is the same as the distribution of all prices in that period. Dividing both sides by Pt−1 and defining Πt ≡ Pt/Pt−1,

(6 and 34)      Πt1−ε = θ + (1−θ)(P*t/Pt−1)1−ε

Notice that in a steady state with zero inflation (Π = 1), P*t = Pt−1 = Pt, ∀ t. (p. 62)

"Furthermore, a log-linear approximation to the aggregate price index around that steady state yields" (p. 44)

(7)      πt = (1−θ)(p*t − pt−1)

Call this Approximation 3, which, like Approximation 2, assumes we are near equilibrium. To derive eq. (7) from (6) take the log of (6)

(4)      (1−ε)πt = log(θ + (1−θ)(P*t/Pt−1)1−ε) = log[1 + (1−θ)((P*t/Pt−1)1−ε − 1) ] ≈ (1−θ)[(P*t/Pt−1)1−ε − 1]

since the last log is of the form log(1+x) for small x. Divide by (1−ε):

(H)      πt ≈ ((1−θ)/(1−ε))[(P*t/Pt−1)1−ε − 1]

Letting R = (P*t/Pt−1) and r = log R, (H) is

(I)      πt ≈ ((1−θ)/(1−ε))[R1−ε − 1] = ((1−θ)/(1−ε))[er(1−ε) − 1] ≈ ((1−θ)/(1−ε))[r(1−ε)] = (1−θ)r = (1−θ)(p*t − pt−1)

which is the same as (7). I'm not sure there isn't a simpler, one-step derivation that follows the log-linear approximation pattern in Sims more closely; this one uses two power series approximations, one after another. Note once more that this again assumes we are near equilibrium.


"[Equation (7)] makes it clear that, in the present setup, inflation results from the fact that firms reoptimizing in any given period choose a price that differs from the economy's average price in the previous period."

3.2.2 Optimal Price Setting

When a firm reoptimizes in this model, it chooses the price P*t to maximize "the current market value of the profits generated while that price remains effective. Formally, it solves the problem"

(J)      maxP*t Σ θk Et{Qt,t+k[P*tYt+k|t − Ψt+k(Yt+k|t )]}

subject to the demand constraints (from (1))

(8)      Yt+k|t(i) = (P*t(i)/Pt+k)−εCt, ∀ k = 0, 1, ...

where

(K)      Qt,t+k ≡ βk (Ct+k/Ct)−σ(Pt/Pt+k)

"is the stochastic discount factor for nominal payments."

(L)      Ψ(⋅)

"is the cost function"

and

(M)      Yt+k|t

"denotes output in period t+k for a firm that last reset its price in period t."

The fraction of firms retaining the price P* at step k is θk, which accounts for the factor θk. To explain Qt,t+k, notice the similarity between (K) and (2.7), which is derived from the intertemporal optimality condition:

(2.7)      Qt = βEt[(Ct+1/Ct)−σ(Pt/Pt+1)] = Et[ β(Ct+1/Ct)−σ(Pt/Pt+1)]

Qt,t+k is therefore the price at time t of a bond that pays off one unit of money at time t+k, and so represents the (stochastic) measure of how much future payments in nominal units should be discounted.


Substituting the constraints (8) into (J) and setting the derivative with respect to P*t to zero yields the first-order optimality condition

(9)      Σ θk Et{Qt,t+kYt+k|t[P*t − Mψt+k]} = 0

where ψt+k|t = Ψ 't+k(Yt+k|t ) is the marginal cost for a firm that last reset its price in period t, and M = ε/(ε−1). Notice that Gali suppresses the argument i of P*t and Yt+k|t ; this optimality condition must hold for all i∈[0,1]. (I checked the algebra.)

Note that in the limiting case of no price rigidities (θ = 0), the previous condition collapses to the familiar optimal price-setting condition under flexible prices (see §2.2)

     P*t = M ψt|t

which allows us to interpret M as the desired markup in the absence of constraints on the frequency of price adjustment. Henceforth, M is referred to as the desired or frictionless markup.


We linearize this around the zero-inflation steady state. To do this, rewrite it in terms of variables that have a well-defined value in that steady state. Dividing by Pt−1 gets us

(10)      Σ θk Et{Qt,t+kYt+k|t [P*t/Pt−1 − M MCt+k|tΠt−1,t+k]} = 0

where MCt+k|t ≡ ψt+k|t/Pt+k is the real marginal cost, and Πt,t+k ≡ Pt+k/Pt. I think this means that Πt−1,t+k ≡ Pt+k/Pt−1, and I use that below.

In the zero inflation steady state, P*t/Pt−1 = 1 and Πt−1,t+k = 1. Furthermore, constancy of the price level implies that P*t = Pt+k in that steady state, from which it follows that Yt+k|t = Y and MCt+k|t = MC, because all firms will be producing the same quantity of output. In addition, Qt,t+k = βk must hold in that steady state. Accordingly MC = 1/M. A first-order Taylor expansion of (10) around the zero inflation steady state yields

(11)      p*t − pt−1 = (1 − βθ) Σ (βθ)k Et{ mct+k|t + (pt+k − pt−1)}

where mct+k|t ≡ mct+k|t − mc denotes the log deviation of marginal cost from its steady state value mc = −μ, and where μ ≡ log M is the log of the desired gross markup (which, for M close to one, is approximately equal to the net markup M−1).


To derive (11) from (10), write (10) as

(N)      Σ θk Et{Qt,t+kYt+k|t f(x,y,z)} = 0

where x = P*t/Pt−1, y = Πt−1,t+k ≡ Pt+k/Pt, z = M MCt+k|t, and f(x,y,z) = Qt,t+kYt+k|t[x − zy]. From the preceding, the equilibrium is at (x*,y*,z*) = (1,1,1), and f(x*,y*,z*) = 0. We use the three-variable Taylor series like the two-variable series in Sims,

(O)      f(x,y,z) ≈ f(x*,y*,z*) + fx(x*,y*,z*)(x − x*) + fy(x*,y*,z*)(y − y*) + fz(x*,y*,z*)(z − z*) = fx(1,1,1)(x − 1) + fy(1,1,1)(y − 1) + fz(1,1,1)(z − 1) = 0

For x near 1, x−1 ≈ log(x), and similarly for y and z, and therefore

(P)      f(x,y,z) ≈ fx(1,1,1)(p*t − pt−1) + fy(1,1,1)(pt+k − pt−1) + fz(1,1,1) log M MCt+k|t

At this point we set Qt,t+k = βk and Yt+k|t = Y (the equilibrium output), The equilibrium values, as mentioned above. Note that this ignores the dependence of these variables on the price---I guess because the dependence of (10) on Q and Y near equilibrium will be second order, but I'm not sure of this reasoning right now. With this assumption, the evaluated partials in (P) are

(Q)      fx(1,1,1) = βkY,      fy(1,1,1) = −βkY,      fz(1,1,1) = −βkY,

and (P) is

(R)      f(x,y,z) ≈ βkY (p*t − pt−1) − βkY (pt+k − pt−1), − βkY log M MCt+k|t

Then (10) becomes

(S)      Σ (βθ)k Et{ (p*t − pt−1) − (pt+k − pt−1) − log M MCt+k|t } ≈ 0

or

(T)      Σ (βθ)k Et{ (p*t − pt+k) − log M MCt+k|t } ≈ 0

First, we'll ignore the ignore M part of the sums and work on the P part. Using the fact that Et{p*t} = p*t,

(U)      −p*t/(1 − βθ) + Σ (βθ)k Et{ pt+k) } ≈ 0

Adding and subtracting pt−1/(1 − βθ), and then multiplying by (1 − βθ):

(W)      −(p*t − pt−1) + (1 − βθ) Σ (βθ)k Et{ pt+k − pt−1 } ≈ 0

Which, together with

(X)      log M MCt+k|t = mct+k|t ≡ mct+k|t − mc

yields (11) from (10). Recall that mc = −μ, and μ = log M is the log of the desired gross markup.


To get some insight into the meaning of (11), rearrange it as

(Y)      p*t = μ + (1 − βθ) Σ (βθ)k Et{mct+k|t + pt+k},

which can be interpreted as "...firms resetting their prices will choose a price that corresponds to the desired markup over a weighted average of their current and expected (nominal) marginal costs, with the weights being proportional to the probability of the price remaining effective at each horizon θk."

3.3 Equilibrium

"Market clearing in the good market requires condition is Yt(i) = Ct(i) for all i∈[0,1] and all t." Note that, with aggregate output index as already defined above,

(F)      Yt ≡ (∫ Yt(i)(ε−1)/ε di)ε/(ε−1)

market clearing then implies that Yt = Ct for all t. Combining this with the "consumer's Euler equation" (2.9) (just replace c by y) yields

(12)      yt = Et[yt+1] − (1/σ)(it − Ett+1] − ρ)

As mentioned above, "Market clearing in the labor market requires"

(D)      Nt = ∫ Nt(i)di

Using (5) (the Cobb-Douglas production function), this becomes

(Z)      Nt = ∫ (Yt(i)/At)1/(1−α) di

and using goods market clearing, Yt(i) = Ct(i), and eq. (1),

(AA)      Nt = (Yt/At)1/(1−α) ∫ (Pt(i)/Pt)−ε/(1−α) di.

Taking logs,

(BB)      (1 − α)nt = yt − at + dt

where

(CC)      dt ≡ (1 − α) log ∫ (Pt(i)/Pt)−ε/(1−α) di

"is a measure of price (and, hence, output) dispersion across the firms." Appendix 3.3 is devoted to showing that dt is zero up to a first approximation in a neighborhood of the steady state. To go over this, rewrite the definition of price index, eq. (A), and use a second-order Taylor's series assuming that we are near the zero-inflation steady state:

(DD)      1 = ∫ (Pt(i)/Pt)1−ε di = ∫ exp[ (1−ε) (pt(i) − pt)] di ≈ 1 + (1−ε) ∫ (pt(i) − pt) di + (1/2)(1−ε)2 ∫ (pt(i) − pt)2 di

or, to second order,

(EE)      pt ≈ Ei[pt(i)] + (1/2)(1−ε) ∫ (pt(i) − pt)2 di.

Notice the notation Ei[pt(i)], which is defined to be ∫ pt(i) di, the "cross-sectional mean of (log) prices".

This approximation requires that (pt(i) − pt) be small. Why is this true? Continue with appendix 3.3.




Here is a start at making a table comparing the FM and NKM.

Flow Model Basic New Keynesian Model
Household, consume/save split savings ∝ i optimizes expected utility, disutility for working
Household, consume/work split fixed labor resource employed optimizes expected utility, disutility for working
CG Firm, capital/labor split optimizes productivity for a given budget no capital factor, optimizes profit for given price and wage
CG Firm, price competitive market monopolistic competition, posted price optimizes expected utility, sticky prices
nominal interest rate i endogenous, iteratively adjusted based on demand and supply of available funds exogenous, determined by central bank policy
financing for capital credit determined by available funds from savings no financing of capital
shocks in basic model monetary policy technology, monetary policy


Christiano, Trabandt, Walentin (2010) mention four possible orthogonal shocks in their review: technology, labor supply, Phillips curve, and monetary policy.