Добавил:

Upload Опубликованный материал нарушает ваши авторские права? Сообщите нам.

Вуз:

Московский государственный физико-технический университет (МФТИ)

Предмет:

[НЕСОРТИРОВАННОЕ]

Файл:

EViews Guides BITCH / EV72.pdf

Скачиваний:

670

Добавлен:

03.06.2015

Размер:

8.25 Mб

Скачать

☆

<<< < Предыдущая 29 30 31 32 33 34 35 36 37 38 39 4041 / 11941 42 43 44 45 46 47 48 49 50 51 52 53 > Следующая >>>

266—Chapter 26. Discrete and Limited Dependent Variable Models

Then run the artificial regression by clicking on Quick/Estimate Equation…, selecting

Least Squares, and entering:

brmr_y brmr_x (psi*(-xb)*fac)

You can obtain the fitted values by clicking on the Forecast button in the equation toolbar of this artificial regression. The LM test statistic is the sum of squares of these fitted values. If the fitted values from the artificial regression are saved in BRMR_YF, the test statistic can be saved as a scalar named LM_TEST:

scalar lm_test=@sumsq(brmr_yf)

which contains the value 1.5408. You can compare the value of this test statistic with the critical values from the chi-square table with one degree of freedom. To save the p-value as a scalar, enter the command:

scalar p_val=1-@cchisq(lm_test,1)

To examine the value of LM_TEST or P_VAL, double click on the name in the workfile window; the value will be displayed in the status line at the bottom of the EViews window. The p-value in this example is roughly 0.21, so we have little evidence against the null hypothesis of homoskedasticity.

Ordered Dependent Variable Models

EViews estimates the ordered-response model of Aitchison and Silvey (1957) under a variety of assumptions about the latent error distribution. In ordered dependent variable models, the observed y denotes outcomes representing ordered or ranked categories. For example, we may observe individuals who choose between one of four educational outcomes: less than high school, high school, college, advanced degree. Or we may observe individuals who are employed, partially retired, or fully retired.

As in the binary dependent variable model, we can model the observed response by considering a latent variable yi that depends linearly on the explanatory variables xi :

yi = xi¢b + ei

(26.18)

where is ei are independent and identically distributed random variables. The observed yi is determined from yi using the rule:

						Ordered Dependent Variable Models—267

			£ g
	0	if y	£ g		1
		i		< y	1	£ g
	1	if g	1	< y		£ g	2
yi =			1	i			2
yi =	2	if g	2	< y		£ g	(26.19)
			2	i			2
	M	M
	M	if gM < yi

It is worth noting that the actual values chosen to represent the categories in y are completely arbitrary. All the ordered specification requires is for ordering to be preserved so that yi < yj implies that yi < yj .

It follows that the probabilities of observing each value of y are given by

Pr(yi = 0		xi, b, g) = F(g1 – xi¢b)

Pr(yi = 1		xi, b, g) = F(g2 – xi¢b)–F(g1 – xi¢b)	(26.20)

Pr(yi = 2		xi, b, g) = F(g3 – xi¢b)–F(g2 – xi¢b)

	º

Pr(yi = M xi, b, g) = 1–F(gM – xi¢b)

where F is the cumulative distribution function of e .

The threshold values g are estimated along with the b coefficients by maximizing the log likelihood function:


N	M
l(b, g) = Â Â log(Pr(yi = j		xi, b, g)) 1(yi = j)	(26.21)
			(26.21)

i = 1	j = 0

where 1(.) is an indicator function which takes the value 1 if the argument is true, and 0 if the argument is false. By default, EViews uses analytic second derivative methods to obtain parameter and variance matrix of the estimated coefficient estimates (see “Quadratic hillclimbing (Goldfeld-Quandt)” on page 757).

Estimating Ordered Models in EViews

Suppose that the dependent variable DANGER is an index ordered from 1 (least dangerous animal) to 5 (most dangerous animal). We wish to model this ordered dependent variable as a function of the explanatory variables, BODY, BRAIN and SLEEP. Note that the values that we have assigned to the dependent variable are not relevant, only the ordering implied by those values. EViews will estimate an identical model if the dependent variable is recorded to take the values 1, 2, 3, 4, 5 or 10, 234, 3243, 54321, 123456.

268—Chapter 26. Discrete and Limited Dependent Variable Models

(The data, which are from Allison, Truett, and D.V. Cicchetti (1976).“Sleep in Mammals: Ecological and Constitutional Correlates,” Science, 194, 732-734, are available in the “Order.WF1” dataset. A more complete version of the data may be obtained from StatLib: http://lib.stat.cmu.edu/datasets/sleep).

To estimate this model, select Quick/Estimate Equation… from the main menu. From the Equation Estimation dialog, select estimation method ORDERED. The standard estimation dialog will change to match this specification.

There are three parts to specifying an ordered variable model: the equation specification, the error specification, and the sample specification. First, in the Equation specification field, you should type the name of the ordered dependent variable followed by the list of your regressors, or you may enter an explicit expression for the index. In our example, you will enter:

danger body brain sleep

Also keep in mind that:

•A separate constant term is not separately identified from the limit points g , so EViews will ignore any constant term in your specification. Thus, the model:

danger c body brain sleep

is equivalent to the specification above.

•EViews requires the dependent variable to be integer valued, otherwise you will see an error message, and estimation will stop. This is not, however, a serious restriction, since you can easily convert the series into an integer using @round, @floor or @ceil in an auto-series expression.

Next, select between the ordered logit, ordered probit, and the ordered extreme value models by choosing one of the three distributions for the latent error term.

Lastly, specify the estimation sample.

You may click on the Options tab to set the iteration limit, convergence criterion, optimization algorithm, and most importantly, method for computing coefficient covariances. See “Technical Notes” on page 296 for a discussion of these methods.

Now click on OK, EViews will estimate the parameters of the model using iterative procedures.

Once the estimation procedure converges, EViews will display the estimation results in the equation window. The first part of the table contains the usual header information, including the assumed error distribution, estimation sample, iteration and convergence information, number of distinct values for y , and the method of computing the coefficient covariance matrix.

Ordered Dependent Variable Models—269

Dependent Variable: DANGER
Method: ML - Ordered Probit (Quadratic hill climbing)
Date: 08/12/09	Time: 00:13
Sample (adjusted): 1 61
Included observations: 58 after adjustments
Number of ordered indicator values: 5
Convergence achieved after 7 iterations
Covariance matrix computed using second derivatives

Variable	Coefficient	Std. Error	z-Statistic	Prob.

BODY	0.000247	0.000421	0.587475	0.5569
BRAIN	-0.000397	0.000418	-0.950366	0.3419
SLEEP	-0.199508	0.041641	-4.791138	0.0000

Below the header information are the coefficient estimates and asymptotic standard errors, and the corresponding z-statistics and significance levels. The estimated coefficients of the ordered model must be interpreted with care (see Greene (2008, section 23.10) or Johnston and DiNardo (1997, section 13.9)).

The sign of ˆ j shows the direction of the change in the probability of falling in the endpoint b

rankings (y = 0 or y = 1 ) when xij changes. Pr(y = 0 ) changes in the opposite direc-
ˆ			ˆ
tion of the sign of bj and Pr(y		= M ) changes in the same direction as the sign of bj . The
effects on the probability of falling in any of the middle rankings are given by:
∂Pr(y = k)	=	∂F(gk + 1 – xi¢b)	∂F(gk – xi¢b)
----------------------------∂bj		----------------------------------------- –	---------------------------------- (26.22)
		∂bj	∂bj

for k = 1, 2, º, M – 1 . It is impossible to determine the signs of these terms, a priori.

The lower part of the estimation output, labeled “Limit Points”, presents the estimates of the g coefficients and the associated standard errors and probability values:

Limit Points

LIMIT _2:C(4)	-2.798449	0.514784	-5.436166	0.0000
LIMIT _3:C(5)	-2.038945	0.492198	-4.142527	0.0000
LIMIT _4:C(6)	-1.434567	0.473679	-3.028563	0.0025
LIMIT _5:C(7)	-0.601211	0.449109	-1.338675	0.1807


Pseudo R-squared	0.147588	Akaike info criterion		2.890028
Schwarz criterion	3.138702	Log likelihood		-76.81081
Hannan-Quinn criter.	2.986891	Restr. log likelihood		-90.10996
LR statistic	26.59830	Avg. log likelihood		-1.324324
Prob(LR statistic)	0.000007

Note that the coefficients are labeled both with the identity of the limit point, and the coefficient number. Just below the limit points are the summary statistics for the equation.

270—Chapter 26. Discrete and Limited Dependent Variable Models

Estimation Problems

Most of the previous discussion of estimation problems for binary models (“Estimation Problems” on page 254) also holds for ordered models. In general, these models are wellbehaved and will require little intervention.

There are cases, however, where problems will arise. First, EViews currently has a limit of 750 total coefficients in an ordered dependent variable model. Thus, if you have 25 righthand side variables, and a dependent variable with 726 distinct values, you will be unable to estimate your model using EViews.

Second, you may run into identification problems and estimation difficulties if you have some groups where there are very few observations. If necessary, you may choose to combine adjacent groups and re-estimate the model.

EViews may stop estimation with the message “Parameter estimates for limit points are nonascending”, most likely on the first iteration. This error indicates that parameter values for the limit points were invalid, and that EViews was unable to adjust these values to make them valid. Make certain that if you are using user defined parameters, the limit points are strictly increasing. Better yet, we recommend that you employ the EViews starting values since they are based on a consistent first-stage estimation procedure, and should therefore be quite well-behaved.

Views of Ordered Equations

EViews provides you with several views of an ordered equation. As with other equations, you can examine the specification and estimated covariance matrix as well as perform Wald and likelihood ratio tests on coefficients of the model. In addition, there are several views that are specialized for the ordered model:

•Dependent Variable Frequencies — computes a one-way frequency table for the ordered dependent variable for the observations in the estimation sample. EViews presents both the frequency table and the cumulative frequency table in levels and percentages.

•Prediction Evaluation— classifies observations on the basis of the predicted response. EViews performs the classification on the basis of the category with the maximum predicted probability.

The first portion of the output shows results for the estimated equation and for the constant probability (no regressor) specifications.

Ordered Dependent Variable Models—271

Prediction Evaluation for Ordered Specification

Equation: EQ_ORDER

Date: 08/12/09 Time: 00:20

Estimated Equation

Dep. Value	Obs.	Correct	Incorrect	% Correct		% Incorrect


1	18	10	8	55.	556	44.444
2	14	6	8	42.	857	57.143
3	10	0	10	0.	000	100.000
4	9	3	6	33.	333	66.667
5	7	6	1	85.	714	14.286
Total	58	25	33	43.	103	56.897

		Constant Probability Spec.

Dep. Value	Obs.	Correct	Incorrect	% Correct		% Incorrect

1	18	18	0	100.	000	0.000
2	14	0	14	0.	000	100.000
3	10	0	10	0.	000	100.000
4	9	0	9	0.	000	100.000
5	7	0	7	0.	000	100.000
Total	58	18	40	31.	034	68.966

Each row represents a distinct value for the dependent variable. The “Obs” column indicates the number of observations with that value. Of those, the number of “Correct” observations are those for which the predicted probability of the response is the highest. Thus, 10 of the 18 individuals with a DANGER value of 1 were correctly specified. Overall, 43% of the observations were correctly specified for the fitted model versus 31% for the constant probability model.

The bottom portion of the output shows additional statistics measuring this improvement

Gain over Constant Prob. Spec.

		Equation	Constant
Dep. Value	Obs.	% Incorrect	% Incorrect		Total Gain*		Pct. Gain**


1	18	44.444	0.	000	-44.444		NA
2	14	57.143	100.	000	42.	857	42.857
3	10	100.000	100.	000	0.	000	0.000
4	9	66.667	100.	000	33.	333	33.333
5	7	14.286	100.	000	85.	714	85.714
Total	58	56.897	68.	966	12.	069	17.500

Note the improvement in the prediction for DANGER values 2, 4, and especially 5 comes from refinement of the constant only prediction of DANGER=1.

272—Chapter 26. Discrete and Limited Dependent Variable Models

Procedures for Ordered Equations

Make Ordered Limit Vector/Matrix

The full set of coefficients and the covariance matrix may be obtained from the estimated equation in the usual fashion (see “Working With Equation Statistics” on page 16). In some circumstances, however, you may wish to perform inference using only the estimates of the g coefficients and the associated covariances.

The Make Ordered Limit Vector and Make Ordered Limit Covariance Matrix procedures provide a shortcut method of obtaining the estimates associated with the g coefficients. The first procedure creates a vector (using the next unused name of the form LIMITS01, LIMITS02, etc.) containing the estimated g coefficients. The latter procedure creates a symmetric matrix containing the estimated covariance matrix of the g . The matrix will be given an unused name of the form VLIMITS01, VLIMITS02, etc., where the “V” is used to indicate that these are the variances of the estimated limit points.

Forecasting using Models

You cannot forecast directly from an estimated ordered model since the dependent variable represents categorical or rank data. EViews does, however, allow you to forecast the probability associated with each category. To forecast these probabilities, you must first create a model. Choose Proc/Make Model and EViews will open an untitled model window containing a system of equations, with a separate equation for the probability of each ordered response value.

To forecast from this model, simply click the Solve button in the model window toolbar. If you select Scenario 1 as your solution scenario, the default settings will save your results in a set of named series with “_1” appended to the end of the each underlying name. See Chapter 34. “Models,” beginning on page 511 for additional detail on modifying and solving models.

For this example, the series I_DANGER_1 will contain the fitted linear index xi¢ˆ . The fitted b

probability of falling in category 1 will be stored as a series named DANGER_1_1, the fitted probability of falling in category 2 will be stored as a series named DANGER_2_1, and so on. Note that for each observation, the fitted probability of falling in each of the categories sums up to one.

Make Residual Series

The generalized residuals of the ordered model are the derivatives of the log likelihood with respect to a hypothetical unit- x variable. These residuals are defined to be uncorrelated with the explanatory variables of the model (see Chesher and Irish (1987), and Gourieroux, Monfort, Renault and Trognon (1987) for details), and thus may be used in a variety of specification tests.

<<< < Предыдущая 29 30 31 32 33 34 35 36 37 38 39 4041 / 11941 42 43 44 45 46 47 48 49 50 51 52 53 > Следующая >>>

Соседние файлы в папке EViews Guides BITCH

#
03.06.20158.25 Mб670EV72.pdf
#
03.06.201513.69 Mб244EViews_Illustrated.pdf
#
03.06.20151 Mб196EViews_tutorial.pdf