Prepare for your exams
Get points
Guidelines and tips

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search Store documents

The best documents sold by students who completed their studies

Search through all study resources

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

University Rankings

Discover the best universities in your country according to Docsity users

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

From our blog

Exams and Study

Go to the blog

Selection Models, Models for Counts - Econometric Analysis of Panel Data - Lecture Slides, Slides of Econometrics and Mathematical Economics

Veer Bahadur Singh Purvanchal University Econometrics and Mathematical Economics

Selection Models, Models for Counts, Hazard Models, Dynamic Models, Canonical Sample Selection Model, Marginal Effects, Extensions of the Poisson Model, Overdispersion are points which describes this lecture importance in Econometric Analysis of Panel Data course.

Typology: Slides

2011/2012

Uploaded on 11/10/2012

uzman 🇮🇳

4.8

(12)

148 documents

1 / 47

This page cannot be seen from the preview

Don't miss anything!

Econometric Analysis of Panel Data

23. Selection Models, Models for Counts,

Hazard Models, Dynamic Models

Docsity.com

Partial preview of the text

Download Selection Models, Models for Counts - Econometric Analysis of Panel Data - Lecture Slides and more Slides Econometrics and Mathematical Economics in PDF only on Docsity!

Econometric Analysis of Panel Data

23. Selection Models, Models for Counts,

Hazard Models, Dynamic Models

Canonical Sample Selection Model

Regression Equation

y*=x +

Sample Selection Mechanism

d=z +u; d=1[d > 0] (probit)

y = y* if d = 1; not observed otherwise

Is the sample 'nonrandomly selected?'

E[y*|x,d=1] = x +E[ | x, d 1]

= x +E[ | x,u z ]

= x something if Cor[ ,u|x] 0

A left out variable problem (again)

Incidental truncation

Two Step Estimation

i i i i

Step 1: Estimate the probit model

d *= +u ; d =1[d * > 0] (probit).

Estimation of by. Now compute

Step 2: Estimate the regression model with estimated re

λ = 

z γ

γ γ

z γ

i i

i i i

i i i i i i

i i i

gressor

y *= +

y = y * if d = 1; not observed otherwise

E[y *|x ,d =1] = +E[ | x , d 1]

Linearly regress y on x ,.

Step2a. Fix standard errors (Murphy

ε =

θλ

x β

and Topel). Estimate

and using and /n

σ θ e'e

The “LAMBDA”

FIML Estimation

d 0

i i i

2 d 1

i i

d 0

i i i

logL log

log exp

Let

logL log

log exp y ( 1 ) ( y )

= Φ − γ

−ε γ + ρε σ

 σ π 

− ρ  

ε = −

θ = σ δ β σ τ

= Φ − γ

 θ   

θ + Φ + τ γ + τ θ +

π  

 

i i

x β

x δ z x δ

Note : no inverse Mills ratio appears anywhere in the model.

Panel Data Extensions

 Mundlak Treatment: Zabel, Economics Letters, 1992

 Two step treatments: Wooldridge, 1995, etc. (See text)

 Both Fixed Effects: Greene, 2002- ‘Brute force’ (WIP)

 Random Parameters: Greene, 2003- (WIP), classical

simulation based

 Interesting survey: Jensen, Rosholm, Verner, CIM/CLS,

“A Comparison of Different Estimators…”

Models for Counts

German Health Care Usage Data , 7,293 Individuals, Varying Numbers of Periods

Variables in the file are

Data downloaded from Journal of Applied Econometrics Archive. This is an unbalanced panel with 7,

individuals. They can be used for regression, count models, binary choice, ordered choice, and bivariate binary

choice. This is a large data set. There are altogether 27,326 observations. The number of observations ranges

from 1 to 7. (Frequencies are: 1=1525, 2=2158, 3=825, 4=926, 5=1051, 6=1000, 7=987). Note, the variable

NUMOBS below tells how many observations there are for each person. This variable is repeated in each row of

the data for the person. (Downlo0aded from the JAE Archive)

DOCTOR = 1(Number of doctor visits > 0)

HSAT = health satisfaction, coded 0 (low) - 10 (high)

DOCVIS = number of doctor visits in last three months

HOSPVIS = number of hospital visits in last calendar year

PUBLIC = insured in public health insurance = 1; otherwise = 0

ADDON = insured by add-on insurance = 1; otherswise = 0

HHNINC = household nominal monthly net income in German marks / 10000.

(4 observations with income=0 were dropped)

HHKIDS = children under age 16 in the household = 1; otherwise = 0

EDUC = years of schooling

AGE = age in years

MARRIED = marital status

EDUC = years of education

Hospital Visits

Histogram for Variable HOSPITAL

Frequency

HOSPITAL

694

1388

2082

2776

0 1 2 3 4 5 6 7 8 9 10

Choice Based Sample: Censored at Y=10, then 90% of the zeros were deleted.

Hospital Visits

+---------------------------------------------+

| Poisson Regression |

| Number of observations 4916 |

| Iterations completed 7 |

| Log likelihood function -5967.059 |

| Restricted log likelihood -5995.100 |

| Chi squared 56.08026 |

| Degrees of freedom 5 |

| Prob[ChiSqd > value] = .0000000 |

| Chi- squared = 10292.78230 RsqP= .0144 |

| G - squared = 6704.29865 RsqD= .0083 |

| Overdispersion tests: g=mu(i) : 7.283 |

| Overdispersion tests: g=mu(i)^2: 7.358 |

+---------------------------------------------+

+---------+--------------+----------------+--------+---------+----------+

+---------+--------------+----------------+--------+---------+----------+

Constant -.01097644 .12877669 -.085.

AGE .00492571 .00168005 2.932 .0034 44.

HHNINC .18287767 .09558999 1.913 .0557.

HHKIDS .01073511 .04023519 .267 .7896.

EDUC -.05292805 .00860326 -6.152 .0000 11.

MARRIED -.04487271 .04372825 -1.026 .3048.

Extensions of the Poisson Model

 Overdispersion

 Zero Inflation (As already discussed in class)

 Sample Selection

 Panel Data

 Endogenous RHS variables

 Semiparametric Approaches: GMM Estimators

 (The literature is vast)

Overdispersion

 In the Poisson model, Var[y|x]=E[y|x]

 Equidispersion is a strong assumption

 Negbin II: Var[y|x]=E[y|x] + σ

E[y|x]

 How does overdispersion arise:

 NonPoissonness

 Omitted Heterogeneity

exp( )

Prob[y=j|x,u]= , exp(x u)

j!

Prob[y=j|x]= Prob[y=j|x,u]f(u)du

exp( u)u

If f(exp(u))= (Gamma with mean 1)

Then Prob[y=j|x] is negative binomial.

α α−

∫

Testing for Overdispersion

 Regression based test: Regress (y-mean)

on

mean

 Neyman – Pearson tests in NegBin regression

| Overdispersion tests: g=mu(i) : 7.283 |

| Overdispersion tests: g=mu(i)^2: 7.358 |

Dispersion parameter for count data model

Alpha .63363306 .03061167 20.699.

Sample Selection

An approach modeled on Heckman's model

Regression Equation:

Prob[y=j|x,u]=P(λ); λ=exp(x β+θu)

Selection Equation:

d=1[z δ+ε>0] (The usual probit)

[u,ε]~n[0,0,1,1,ρ] (Var[u] is

absorbed in θ)

Estimation:

Nonlinear Least Squares: [Terza (1998, see cite in text).]

Φ(z δ+ρ)

E[y|x,d=1]=exp(x β+θρ )

Φ(z δ)

FIML using Hermite quadrature: [Greene (Stern wp, 97-02, 1997)]

Modeling Duration

 Time until business failure

 Time until exercise of a warranty

 Length of an unemployment spell

 Length of time between children

 Time between business cycles

 Time between wars or civil insurrections

 Time between policy changes

 Etc.

Hazard Models for Duration

Selection Models, Models for Counts - Econometric Analysis of Panel Data - Lecture Slides, Slides of Econometrics and Mathematical Economics

Related documents

Partial preview of the text

Download Selection Models, Models for Counts - Econometric Analysis of Panel Data - Lecture Slides and more Slides Econometrics and Mathematical Economics in PDF only on Docsity!

Econometric Analysis of Panel Data

23. Selection Models, Models for Counts,

Hazard Models, Dynamic Models

Regression Equation

y*=x +

Sample Selection Mechanism

d=z +u; d=1[d > 0] (probit)

y = y* if d = 1; not observed otherwise

Is the sample 'nonrandomly selected?'

E[y*|x,d=1] = x +E[ | x, d 1]

= x +E[ | x,u z ]

= x something if Cor[ ,u|x] 0

A left out variable problem (again)

Incidental truncation

FIML Estimation

 Overdispersion

 Zero Inflation (As already discussed in class)

 Sample Selection

 Panel Data

 Endogenous RHS variables

 Semiparametric Approaches: GMM Estimators

 (The literature is vast)

 In the Poisson model, Var[y|x]=E[y|x]

 Equidispersion is a strong assumption

 Negbin II: Var[y|x]=E[y|x] + σ

E[y|x]

 How does overdispersion arise:

exp( )

Prob[y=j|x,u]= , exp(x u)

j!

Prob[y=j|x]= Prob[y=j|x,u]f(u)du

exp( u)u

If f(exp(u))= (Gamma with mean 1)

Then Prob[y=j|x] is negative binomial.

 Regression based test: Regress (y-mean)

on

mean

 Neyman – Pearson tests in NegBin regression

An approach modeled on Heckman's model

Regression Equation:

Prob[y=j|x,u]=P(λ); λ=exp(x β+θu)

Selection Equation:

d=1[z δ+ε>0] (The usual probit)

[u,ε]~n[0,0,1,1,ρ] (Var[u] is

absorbed in θ)

Estimation:

Nonlinear Least Squares: [Terza (1998, see cite in text).]

Φ(z δ+ρ)

E[y|x,d=1]=exp(x β+θρ )

Φ(z δ)

FIML using Hermite quadrature: [Greene (Stern wp, 97-02, 1997)]

 Time until business failure

 Time until exercise of a warranty

 Length of an unemployment spell

 Length of time between children

 Time between business cycles

 Time between wars or civil insurrections

 Time between policy changes

 Etc.

 Basic hazard rate model

 Parametric models

 Duration dependence

 Censoring

 Time varying covariates

 Sample selection