Time Series Regression

Serial Correlation and Unit Root

Time series data are obtained from observations for different time periods. Examples are

unemployment rates, inflation rate, GDP, stock price, etc. Another common type of data is cross

sectional data, which is obtained from the observations for different individuals or different states

or different cities, or different countries for the same time period.

Many time series data are readily available from various websites, and we tend to have more

chances to use time series in regression analysis.

Time series data are denoted by

{ }

, ,..., T

yy y

. Subscript denotes the time periods, and T is the

total number of observations. Although this is more popular notation, sometimes people use n

instead of T to denote the sample size.

The most distinguished characteristic of time series data is in the serial correlation. The value of

next period observation depends on the value of the current period observation. This is very

different from the cross sectional data where all the observations are independent from each other.

(There is no such thing like order of observations in cross-section data.)

For example, individual income data could be obtained for different people, then it is a cross-

sectional data. It is safe to assume that the individual income is independent of each other. But,

we can imagine the income data of the same person observed over time. Then, certainly the next

period income will depend on the income of the current period --- namely, the data are serially

correlated.

It is useful to model the serial correlations of time series data. The simplest model, but still is very

popular, is called autoregressive process:

1t tt

ρε

−

= +

where

is the unobserved disturbance term. We typically assume that the error terms

123

{ , , ,... }

εεε ε

are independent mean zero process.

In the above model, the value of ρ indicates the degree of serial correlation. Higher ρ value

signifies higher correlation across different time period. For example, if ρ = 0,

, so the

series is independent. If ρ is positive, the series is positively correlated: If yt-1 is big, yt tends to

be big. If ρ is negative, the series is negatively correlated: If yt-1 is big, yt tends to be small.

The following picture shows the movement of the four series where values of ρ = 0, 0.5, 1, -0.5.

In any time series that I know of, the value of ρ does not exceed 1 in absolute value. If ρ is

greater than one, the series explode as time period extends. As is seen in the picture, time series is

more jagged for smaller value of ρ. The series is the most jagged when ρ = -0.5 in the picture,

and is most smooth when ρ = 1.

Docsity.com

Time Series Regression - Econometrics - Lecture Notes, Study notes of Econometrics and Mathematical Economics

Related documents

Partial preview of the text

Download Time Series Regression - Econometrics - Lecture Notes and more Study notes Econometrics and Mathematical Economics in PDF only on Docsity!

Serial Correlation and Unit Root

Time series data are denoted by { y 1 , y 2 , ..., yT }. Subscript denotes the time periods, and T is the

yt = ρ yt − 1 + ε t ,

where ε t is the unobserved disturbance term. We typically assume that the error terms

{ ε 1 , ε 2 , ε 3 , ... ε T } are independent mean zero process.

signifies higher correlation across different time period. For example, if ρ = 0, yt = ε t , so the

rho = -0.

unempt = α + β 1 unempt − 1 + β 2 unempt − 2 + β 3 unempt − 3 +ε t

unempt = α + ( β 1 + β 2 + β 3 ) unempt − 1 − β 2 ( unempt − 1 − unempt − 2 ) − β 3 ( unempt − 1 − unempt − 3 ) +ε t

∆ unempt = α + ( β 1 + β 2 + β 3 − 1) unempt − 1 − β 2 ∆ unempt − 1 − β 3 ∆ unemp t − 2 +ε t

where ∆ unemp t = unempt − unempt − 1. Therefore, we run the regression ∆unempt on the

yt = y t − 1 + ε t = yt − 2 + ε t − 1 + ε t = ... = ε t + ε t − 1 + ε t − 2,....

yt = α + γ t + ρ yt − 1 +ε t

Suppose that the unemployment follows the unit root process: unempt = unempt − 1 + ε t , and

unempT + 1 = unempT + ε T + 1. Here, the error term (or disturbance term), ε T + 1 , is something that

ε T + 1. The simplest assumption is that { ε 1 , ε 2 , ε 3 , ..., ε T , ε T + 1 , ...} follows independent normal

We need to estimate σ^2. Since ε t = unempt − unempt − 1. Variance of ε can be estimate using

1 T

s unemp

T =

ε T + 1 + ε T + 2 , which is normal distribution with mean zero and variance 2σ

deviation is 2 σ. It is estimated by 2 s = 0.255. Therefore, the 95% confidence interval is

incorporate them in prediction.

Prediction of Trend

We can see that the series is positively correlated, implying that the higher growth rate

tends to be followed by higher growth rate.

When we run regression of ln(GDPt) on ln(GDPt-1), we have

Unlike unit root test, t-statistic follows standard normal distribution under the null

hypothesis that the growth rate, ∆ln(GDPt), is serially independent. Null hypothesis is rejected,

residuals