Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Effects Random effect: (1) different statistical model of Random effect: Intercept only models in
regression or ANOVA model which assumes that an random effects MLR are equivalent to
independent variable is random; (2) generally used if ANOVA, random random effects ANOVA
the levels of the independent variable are thought to be effects regression and inclusion of one or
a small subset of the possible values which one wishes Fixed effect: fixed more level-1 predictors
to generalize to; (3) will probably produce larger effects ANOVA, makes the model
standard errors (less powerful). Fixed effect: (1) fixed effects equivalent to a random
statistical model typically used in regression and regression effects ANCOVA when
ANOVA assuming independent variable is fixed; (2) slopes do not vary
generalization of the results apply to similar values of across groups.
independent variable in the population or in other
studies; (3) will probably produce smaller standard
errors (more powerful).
Coefficients Random coefficient: term applies only to MLR Random Both used in MLR.
analyses in which intercepts, slopes, and variances can coefficient: the Slopes and intercept
be assumed to be random. MLR analyses most level-2 predictor, values can be
typically assume random coefficients. One can average income, is considered to be fixed or
conceptualize the coefficients obtained from the level-1 used to predict random, depending on
regressions as a type of random variable which comes school performance researchers'
from and generalizes to a distribution of possible in each school. assumptions and how
values. Groups are conceived of as a subset of the Intercept values for the model is specified.
possible groups. school performance The average intercept or
are assumed to be a slope is referred to as a
Fixed coefficient: a coefficient can be fixed to be non- sample of the "fixed effect." Variances
varying (invariant) across groups by setting the intercepts from a of the slopes and
between-group variance to zero. larger population of intercepts (if allowed to
schools. vary across groups) are
Random coefficients must be variable across groups. called “random
Conceptually, fixed coefficients may be invariant or Fixed coefficient: coefficients."
varying across groups. slopes or intercepts
constrained to be
equal over different
schools.
1. PART 3 – Long Answer Type (IE)
1. (1) What is the model that you fit to this data? Write the regression equation for this
model, define each variable and explain what would be the sign of the coefficients and
why? (10 M)
Ans.
m1
Strength
m2
II
I
Regression is done over Strength and Water-Cement Ratio. Here, the regressor
is X = Water- Cement Ratio while the Regressand is Y = Strength
When D = 0,
The equation is of the form
Y = A + B*X
This represents the first part of the graph. From visual inspection of graph, we
can see that A is positive and B is negative. This equation will give strength when
the ratio is below threshold ratio.
When D = 1,
Y = A + (B – C) *X – C*X’
This represents the second part of graph. Here, (B-C) shall be negative, as evident
from the graph. Since, B is -ve already (from the previous equation) and slope of
this part is more negative than the first part, we conclude that C also is -ve. Here
X’ is 70% in this case
1. (2) Suppose you ignore the scatter plot as shown above and fit the following regression
equation: Strength= α+β water/cement +u. (5 M) How will you interpret? What is wrong
with this model?
Ans.
Given,
Strength = A + B * Water-Cement Ratio + u
This is a binary regression model between strength and W-C ratio. To fit this
equation to the scatter plot, A should be positive, and B will be negative. While u refers
to the residual between the actual value and estimated values.
• In such scenario, value of u shall be higher such that summation of absolute
value of u or squared values of u will be higher in this case compared to previous.
Which in turn will affect the total error and error due to regression such that R-
square value will reduce. Thus, this model will less of a good fit to the scenario.
In the above model, threshold value will not be observable. One of the aims of the
experiment is to determine the critical value of water-cement ratio and use it as a
specification for mixing cement. Hence the whole purpose of experiment is failed. A
simple regression will not incorporate the sharp change. So, a piecewise linear function
is used for this type of regression for better R-square value. Maximum Likelihood model
can also be is to plot higher degree of regression to trace mean value of samples more
accurately.
2. (1) What econometric technique should be used by the team to solve this problem?
And why? What would be the dependent variable? (6 M)
Ans. As banks want to identify that whether they should off for loan or not. Therefore
regression is of qualitative nature. That is yes or no
We have three models to solve these kind of problems
1) LPM (linear probability model)
2) Lokgit
3) Probit
LPM has many problems like
1. Non normality of error term
2. Heteroscedasticity of error term etc
Many researchers use either logit or probit model
Logit is preferred for its mathematical simplicity.
The equation will look like
𝑝𝑖
𝑙𝑛 ( ) = 𝛽1 + 𝛽2 𝑥𝑖 + ⋯
1 − 𝑝𝑖
Pi is the no. of yes of loan offer accepted.
1-Pi is the number of loan offers rejected in a sample.
𝑝𝑖
In logit 𝑙𝑛 ( ) is used as a dependent variable.
1−𝑝𝑖
2.(2) What specific characteristics of the credit card history and personal details would
affect the dependent variable and how? Explain (6 M)
Ans. Specific characteristics of the credit card history that affect loan offers are
1. Number of times default to pay
This is normally calculated by C I B I L score. Better the score better is the person to
offer loan.
2. Number of times loan already taken and payment done on time
Specific characteristics of personal details that would affect dependent variables are
1. Salary - higher the salary more probability to pay bills on time
2. Age - lower the age more is the time left for retirement high paying potential
3. Number of dependents - more number of dependents mod is a chance of
defaulting payment.
2.(3) Suppose one of the independent variables is income of the customer. How can we
obtain the marginal effect of income on the dependent variable? (3 M)
𝑝𝑖
𝑙𝑛 ( ) = 𝛽1 + 𝛽2 𝐼 + ⋯ (where I is the Income)
1−𝑝𝑖
Marginal effect of income is a change in the dependent variable per unit change in
income.
Therefore,
Taking I to increase by one unit change in
𝑝𝑖
𝑙𝑛 ( ) = 𝛽2
1 − 𝑝𝑖
Taking antilog
𝑝𝑖
( ) = 𝑒 𝛽2
1 − 𝑝𝑖
𝑝𝑖 = 𝑒 𝛽2 (1 − 𝑝𝑖 )
𝑒 𝛽2
𝑝𝑖 =
1 + 𝑒𝛽2