Videos and questions for Chapter 5 of the course "Empirical Economics with R" at Ulm University (taught by Sebastian Kranz)
Make a guess which percentage of the job seekers who got the option for intensive counselling accepted it?
Consider the regression \[\text{job_6m}_i = \beta_0 + \beta_1 \text{treated}_i + u_i\] Is our estimator \(\hat \beta_1\) likely a consistent estimator of the effect of job counseling on those who received treatment?
In the following two quizzes let us consider two stories that could yield a bias in \(\hat \beta_1\) in the regression:
\[\text{job_6m}_i = \beta_0 + \beta_1 \text{treated}_i + u_i\]
Assume the that only those job seekers reject intensive counseling who can easily find themselves a good new job.
Assume the that only those job seekers reject intensive counseling who are demotivated and generally have low chances to find a new job.
Consider again our regression \[\text{job_6m}_i = \beta_0 + \beta_1 \text{treated}_i + u_i\]
and let the dummy variable \(\text{treat_option}_i\), which indicates whether subject \(i\) had the option for intensive counseling, be the instrument for \(\text{treated}_i\).
What is the exogeneity condition for the instrument in our example?
Is the exogeneity condition for the instrument \(\text{treat_option}_i\) satisfied in our application if we assume that subjects' assignment to treatment and control group was perfectly randomized?
What can we say about the relevance condition for the instrument \(\text{treat_option}_i\) in our application?
We found an IV estimator of \(\hat \beta_1 = 0.102\) and a lower OLS estimator of just \(\hat \beta_1 = 0.079\).
Comparing the OLS estimator, which story seems on average more consistent with some subjects' decision to reject the intensive counseling?
Note: At around 5 minutes 30 seconds, I say "increase by 10 percent", where of course it should be "increase by 10 percentage points", one should always be careful with this distinction...
Which condition would our instrument violate in the short IV regression given the problem explained in the video?
Could we, in principle, solve the problem by adding region specific fixed effects to our IV regression?
Great, you have finished the video lectures for Chapter 5. Now would be a good time to start with the RTutor problem set in order to perform some IV estimation yourself.