
- •Preface
- •Part IV. Basic Single Equation Analysis
- •Chapter 18. Basic Regression Analysis
- •Equation Objects
- •Specifying an Equation in EViews
- •Estimating an Equation in EViews
- •Equation Output
- •Working with Equations
- •Estimation Problems
- •References
- •Chapter 19. Additional Regression Tools
- •Special Equation Expressions
- •Robust Standard Errors
- •Weighted Least Squares
- •Nonlinear Least Squares
- •Stepwise Least Squares Regression
- •References
- •Chapter 20. Instrumental Variables and GMM
- •Background
- •Two-stage Least Squares
- •Nonlinear Two-stage Least Squares
- •Limited Information Maximum Likelihood and K-Class Estimation
- •Generalized Method of Moments
- •IV Diagnostics and Tests
- •References
- •Chapter 21. Time Series Regression
- •Serial Correlation Theory
- •Testing for Serial Correlation
- •Estimating AR Models
- •ARIMA Theory
- •Estimating ARIMA Models
- •ARMA Equation Diagnostics
- •References
- •Chapter 22. Forecasting from an Equation
- •Forecasting from Equations in EViews
- •An Illustration
- •Forecast Basics
- •Forecasts with Lagged Dependent Variables
- •Forecasting with ARMA Errors
- •Forecasting from Equations with Expressions
- •Forecasting with Nonlinear and PDL Specifications
- •References
- •Chapter 23. Specification and Diagnostic Tests
- •Background
- •Coefficient Diagnostics
- •Residual Diagnostics
- •Stability Diagnostics
- •Applications
- •References
- •Part V. Advanced Single Equation Analysis
- •Chapter 24. ARCH and GARCH Estimation
- •Basic ARCH Specifications
- •Estimating ARCH Models in EViews
- •Working with ARCH Models
- •Additional ARCH Models
- •Examples
- •References
- •Chapter 25. Cointegrating Regression
- •Background
- •Estimating a Cointegrating Regression
- •Testing for Cointegration
- •Working with an Equation
- •References
- •Binary Dependent Variable Models
- •Ordered Dependent Variable Models
- •Censored Regression Models
- •Truncated Regression Models
- •Count Models
- •Technical Notes
- •References
- •Chapter 27. Generalized Linear Models
- •Overview
- •How to Estimate a GLM in EViews
- •Examples
- •Working with a GLM Equation
- •Technical Details
- •References
- •Chapter 28. Quantile Regression
- •Estimating Quantile Regression in EViews
- •Views and Procedures
- •Background
- •References
- •Chapter 29. The Log Likelihood (LogL) Object
- •Overview
- •Specification
- •Estimation
- •LogL Views
- •LogL Procs
- •Troubleshooting
- •Limitations
- •Examples
- •References
- •Part VI. Advanced Univariate Analysis
- •Chapter 30. Univariate Time Series Analysis
- •Unit Root Testing
- •Panel Unit Root Test
- •Variance Ratio Test
- •BDS Independence Test
- •References
- •Part VII. Multiple Equation Analysis
- •Chapter 31. System Estimation
- •Background
- •System Estimation Methods
- •How to Create and Specify a System
- •Working With Systems
- •Technical Discussion
- •References
- •Vector Autoregressions (VARs)
- •Estimating a VAR in EViews
- •VAR Estimation Output
- •Views and Procs of a VAR
- •Structural (Identified) VARs
- •Vector Error Correction (VEC) Models
- •A Note on Version Compatibility
- •References
- •Chapter 33. State Space Models and the Kalman Filter
- •Background
- •Specifying a State Space Model in EViews
- •Working with the State Space
- •Converting from Version 3 Sspace
- •Technical Discussion
- •References
- •Chapter 34. Models
- •Overview
- •An Example Model
- •Building a Model
- •Working with the Model Structure
- •Specifying Scenarios
- •Using Add Factors
- •Solving the Model
- •Working with the Model Data
- •References
- •Part VIII. Panel and Pooled Data
- •Chapter 35. Pooled Time Series, Cross-Section Data
- •The Pool Workfile
- •The Pool Object
- •Pooled Data
- •Setting up a Pool Workfile
- •Working with Pooled Data
- •Pooled Estimation
- •References
- •Chapter 36. Working with Panel Data
- •Structuring a Panel Workfile
- •Panel Workfile Display
- •Panel Workfile Information
- •Working with Panel Data
- •Basic Panel Analysis
- •References
- •Chapter 37. Panel Estimation
- •Estimating a Panel Equation
- •Panel Estimation Examples
- •Panel Equation Testing
- •Estimation Background
- •References
- •Part IX. Advanced Multivariate Analysis
- •Chapter 38. Cointegration Testing
- •Johansen Cointegration Test
- •Single-Equation Cointegration Tests
- •Panel Cointegration Testing
- •References
- •Chapter 39. Factor Analysis
- •Creating a Factor Object
- •Rotating Factors
- •Estimating Scores
- •Factor Views
- •Factor Procedures
- •Factor Data Members
- •An Example
- •Background
- •References
- •Appendix B. Estimation and Solution Options
- •Setting Estimation Options
- •Optimization Algorithms
- •Nonlinear Equation Solution Methods
- •References
- •Appendix C. Gradients and Derivatives
- •Gradients
- •Derivatives
- •References
- •Appendix D. Information Criteria
- •Definitions
- •Using Information Criteria as a Guide to Model Selection
- •References
- •Appendix E. Long-run Covariance Estimation
- •Technical Discussion
- •Kernel Function Properties
- •References
- •Index
- •Symbols
- •Numerics

266—Chapter 26. Discrete and Limited Dependent Variable Models
Then run the artificial regression by clicking on Quick/Estimate Equation…, selecting
Least Squares, and entering:
brmr_y brmr_x (psi*(-xb)*fac)
You can obtain the fitted values by clicking on the Forecast button in the equation toolbar of this artificial regression. The LM test statistic is the sum of squares of these fitted values. If the fitted values from the artificial regression are saved in BRMR_YF, the test statistic can be saved as a scalar named LM_TEST:
scalar lm_test=@sumsq(brmr_yf)
which contains the value 1.5408. You can compare the value of this test statistic with the critical values from the chi-square table with one degree of freedom. To save the p-value as a scalar, enter the command:
scalar p_val=1-@cchisq(lm_test,1)
To examine the value of LM_TEST or P_VAL, double click on the name in the workfile window; the value will be displayed in the status line at the bottom of the EViews window. The p-value in this example is roughly 0.21, so we have little evidence against the null hypothesis of homoskedasticity.
Ordered Dependent Variable Models
EViews estimates the ordered-response model of Aitchison and Silvey (1957) under a variety of assumptions about the latent error distribution. In ordered dependent variable models, the observed y denotes outcomes representing ordered or ranked categories. For example, we may observe individuals who choose between one of four educational outcomes: less than high school, high school, college, advanced degree. Or we may observe individuals who are employed, partially retired, or fully retired.
As in the binary dependent variable model, we can model the observed response by considering a latent variable yi that depends linearly on the explanatory variables xi :
yi = xi¢b + ei |
(26.18) |
where is ei are independent and identically distributed random variables. The observed yi is determined from yi using the rule:

|
|
|
|
|
|
|
|
Ordered Dependent Variable Models—267 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
£ g |
|
|
|
||
|
|
0 |
if y |
1 |
|
|
|||
|
|
|
i |
< y |
|
£ g |
|
||
|
|
1 |
if g |
1 |
|
2 |
|||
yi = |
|
|
|
i |
|
|
|
||
|
2 |
if g |
2 |
< y |
|
£ g |
(26.19) |
||
|
|
|
|
i |
|
|
|
2 |
|
|
|
M |
M |
|
|
|
|
|
|
|
|
M |
if gM < yi |
|
|
||||
|
|
|
|
|
|
|
|
|
|
It is worth noting that the actual values chosen to represent the categories in y are completely arbitrary. All the ordered specification requires is for ordering to be preserved so that yi < yj implies that yi < yj .
It follows that the probabilities of observing each value of y are given by
Pr(yi = 0 |
|
xi, b, g) = F(g1 – xi¢b) |
|
|
|
||
Pr(yi = 1 |
|
xi, b, g) = F(g2 – xi¢b)–F(g1 – xi¢b) |
(26.20) |
|
|||
Pr(yi = 2 |
|
xi, b, g) = F(g3 – xi¢b)–F(g2 – xi¢b) |
|
|
|||
|
º |
|
Pr(yi = M xi, b, g) = 1–F(gM – xi¢b)
where F is the cumulative distribution function of e .
The threshold values g are estimated along with the b coefficients by maximizing the log likelihood function:
N |
M |
|
||
l(b, g) = Â Â log(Pr(yi = j |
|
xi, b, g)) 1(yi = j) |
(26.21) |
|
|
||||
|
|
|||
i = 1 |
j = 0 |
|
where 1(.) is an indicator function which takes the value 1 if the argument is true, and 0 if the argument is false. By default, EViews uses analytic second derivative methods to obtain parameter and variance matrix of the estimated coefficient estimates (see “Quadratic hillclimbing (Goldfeld-Quandt)” on page 757).
Estimating Ordered Models in EViews
Suppose that the dependent variable DANGER is an index ordered from 1 (least dangerous animal) to 5 (most dangerous animal). We wish to model this ordered dependent variable as a function of the explanatory variables, BODY, BRAIN and SLEEP. Note that the values that we have assigned to the dependent variable are not relevant, only the ordering implied by those values. EViews will estimate an identical model if the dependent variable is recorded to take the values 1, 2, 3, 4, 5 or 10, 234, 3243, 54321, 123456.

268—Chapter 26. Discrete and Limited Dependent Variable Models
(The data, which are from Allison, Truett, and D.V. Cicchetti (1976).“Sleep in Mammals: Ecological and Constitutional Correlates,” Science, 194, 732-734, are available in the “Order.WF1” dataset. A more complete version of the data may be obtained from StatLib: http://lib.stat.cmu.edu/datasets/sleep).
To estimate this model, select Quick/Estimate Equation… from the main menu. From the Equation Estimation dialog, select estimation method ORDERED. The standard estimation dialog will change to match this specification.
There are three parts to specifying an ordered variable model: the equation specification, the error specification, and the sample specification. First, in the Equation specification field, you should type the name of the ordered dependent variable followed by the list of your regressors, or you may enter an explicit expression for the index. In our example, you will enter:
danger body brain sleep
Also keep in mind that:
•A separate constant term is not separately identified from the limit points g , so EViews will ignore any constant term in your specification. Thus, the model:
danger c body brain sleep
is equivalent to the specification above.
•EViews requires the dependent variable to be integer valued, otherwise you will see an error message, and estimation will stop. This is not, however, a serious restriction, since you can easily convert the series into an integer using @round, @floor or @ceil in an auto-series expression.
Next, select between the ordered logit, ordered probit, and the ordered extreme value models by choosing one of the three distributions for the latent error term.
Lastly, specify the estimation sample.
You may click on the Options tab to set the iteration limit, convergence criterion, optimization algorithm, and most importantly, method for computing coefficient covariances. See “Technical Notes” on page 296 for a discussion of these methods.
Now click on OK, EViews will estimate the parameters of the model using iterative procedures.
Once the estimation procedure converges, EViews will display the estimation results in the equation window. The first part of the table contains the usual header information, including the assumed error distribution, estimation sample, iteration and convergence information, number of distinct values for y , and the method of computing the coefficient covariance matrix.

Ordered Dependent Variable Models—269
Dependent Variable: DANGER |
|
|
|
|
Method: ML - Ordered Probit (Quadratic hill climbing) |
|
|
||
Date: 08/12/09 |
Time: 00:13 |
|
|
|
Sample (adjusted): 1 61 |
|
|
|
|
Included observations: 58 after adjustments |
|
|
||
Number of ordered indicator values: 5 |
|
|
|
|
Convergence achieved after 7 iterations |
|
|
|
|
Covariance matrix computed using second derivatives |
|
|||
|
|
|
|
|
Variable |
Coefficient |
Std. Error |
z-Statistic |
Prob. |
|
|
|
|
|
BODY |
0.000247 |
0.000421 |
0.587475 |
0.5569 |
BRAIN |
-0.000397 |
0.000418 |
-0.950366 |
0.3419 |
SLEEP |
-0.199508 |
0.041641 |
-4.791138 |
0.0000 |
|
|
|
|
|
Below the header information are the coefficient estimates and asymptotic standard errors, and the corresponding z-statistics and significance levels. The estimated coefficients of the ordered model must be interpreted with care (see Greene (2008, section 23.10) or Johnston and DiNardo (1997, section 13.9)).
The sign of ˆ j shows the direction of the change in the probability of falling in the endpoint b
rankings (y = 0 or y = 1 ) when xij changes. Pr(y = 0 ) changes in the opposite direc- |
|||
ˆ |
|
|
ˆ |
tion of the sign of bj and Pr(y |
= M ) changes in the same direction as the sign of bj . The |
||
effects on the probability of falling in any of the middle rankings are given by: |
|||
∂Pr(y = k) |
= |
∂F(gk + 1 – xi¢b) |
∂F(gk – xi¢b) |
----------------------------∂bj |
----------------------------------------- – |
---------------------------------- (26.22) |
|
|
∂bj |
∂bj |
for k = 1, 2, º, M – 1 . It is impossible to determine the signs of these terms, a priori.
The lower part of the estimation output, labeled “Limit Points”, presents the estimates of the g coefficients and the associated standard errors and probability values:
Limit Points
LIMIT _2:C(4) |
-2.798449 |
0.514784 |
-5.436166 |
0.0000 |
LIMIT _3:C(5) |
-2.038945 |
0.492198 |
-4.142527 |
0.0000 |
LIMIT _4:C(6) |
-1.434567 |
0.473679 |
-3.028563 |
0.0025 |
LIMIT _5:C(7) |
-0.601211 |
0.449109 |
-1.338675 |
0.1807 |
|
|
|
|
|
|
|
|
|
|
Pseudo R-squared |
0.147588 |
Akaike info criterion |
2.890028 |
|
Schwarz criterion |
3.138702 |
Log likelihood |
|
-76.81081 |
Hannan-Quinn criter. |
2.986891 |
Restr. log likelihood |
-90.10996 |
|
LR statistic |
26.59830 |
Avg. log likelihood |
-1.324324 |
|
Prob(LR statistic) |
0.000007 |
|
|
|
|
|
|
|
|
Note that the coefficients are labeled both with the identity of the limit point, and the coefficient number. Just below the limit points are the summary statistics for the equation.

270—Chapter 26. Discrete and Limited Dependent Variable Models
Estimation Problems
Most of the previous discussion of estimation problems for binary models (“Estimation Problems” on page 254) also holds for ordered models. In general, these models are wellbehaved and will require little intervention.
There are cases, however, where problems will arise. First, EViews currently has a limit of 750 total coefficients in an ordered dependent variable model. Thus, if you have 25 righthand side variables, and a dependent variable with 726 distinct values, you will be unable to estimate your model using EViews.
Second, you may run into identification problems and estimation difficulties if you have some groups where there are very few observations. If necessary, you may choose to combine adjacent groups and re-estimate the model.
EViews may stop estimation with the message “Parameter estimates for limit points are nonascending”, most likely on the first iteration. This error indicates that parameter values for the limit points were invalid, and that EViews was unable to adjust these values to make them valid. Make certain that if you are using user defined parameters, the limit points are strictly increasing. Better yet, we recommend that you employ the EViews starting values since they are based on a consistent first-stage estimation procedure, and should therefore be quite well-behaved.
Views of Ordered Equations
EViews provides you with several views of an ordered equation. As with other equations, you can examine the specification and estimated covariance matrix as well as perform Wald and likelihood ratio tests on coefficients of the model. In addition, there are several views that are specialized for the ordered model:
•Dependent Variable Frequencies — computes a one-way frequency table for the ordered dependent variable for the observations in the estimation sample. EViews presents both the frequency table and the cumulative frequency table in levels and percentages.
•Prediction Evaluation— classifies observations on the basis of the predicted response. EViews performs the classification on the basis of the category with the maximum predicted probability.
The first portion of the output shows results for the estimated equation and for the constant probability (no regressor) specifications.

Ordered Dependent Variable Models—271
Prediction Evaluation for Ordered Specification
Equation: EQ_ORDER
Date: 08/12/09 Time: 00:20
Estimated Equation
Dep. Value |
Obs. |
Correct |
Incorrect |
% Correct |
% Incorrect |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
18 |
10 |
8 |
55. |
556 |
44.444 |
2 |
14 |
6 |
8 |
42. |
857 |
57.143 |
3 |
10 |
0 |
10 |
0. |
000 |
100.000 |
4 |
9 |
3 |
6 |
33. |
333 |
66.667 |
5 |
7 |
6 |
1 |
85. |
714 |
14.286 |
Total |
58 |
25 |
33 |
43. |
103 |
56.897 |
|
|
|
|
|
||
|
|
Constant Probability Spec. |
|
|
||
|
|
|
|
|
|
|
Dep. Value |
Obs. |
Correct |
Incorrect |
% Correct |
% Incorrect |
|
|
|
|
|
|
|
|
1 |
18 |
18 |
0 |
100. |
000 |
0.000 |
2 |
14 |
0 |
14 |
0. |
000 |
100.000 |
3 |
10 |
0 |
10 |
0. |
000 |
100.000 |
4 |
9 |
0 |
9 |
0. |
000 |
100.000 |
5 |
7 |
0 |
7 |
0. |
000 |
100.000 |
Total |
58 |
18 |
40 |
31. |
034 |
68.966 |
|
|
|
|
|
|
|
Each row represents a distinct value for the dependent variable. The “Obs” column indicates the number of observations with that value. Of those, the number of “Correct” observations are those for which the predicted probability of the response is the highest. Thus, 10 of the 18 individuals with a DANGER value of 1 were correctly specified. Overall, 43% of the observations were correctly specified for the fitted model versus 31% for the constant probability model.
The bottom portion of the output shows additional statistics measuring this improvement
Gain over Constant Prob. Spec.
|
|
Equation |
Constant |
|
|
|
|
Dep. Value |
Obs. |
% Incorrect |
% Incorrect |
Total Gain* |
Pct. Gain** |
||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
18 |
44.444 |
0. |
000 |
-44.444 |
NA |
|
2 |
14 |
57.143 |
100. |
000 |
42. |
857 |
42.857 |
3 |
10 |
100.000 |
100. |
000 |
0. |
000 |
0.000 |
4 |
9 |
66.667 |
100. |
000 |
33. |
333 |
33.333 |
5 |
7 |
14.286 |
100. |
000 |
85. |
714 |
85.714 |
Total |
58 |
56.897 |
68. |
966 |
12. |
069 |
17.500 |
Note the improvement in the prediction for DANGER values 2, 4, and especially 5 comes from refinement of the constant only prediction of DANGER=1.

272—Chapter 26. Discrete and Limited Dependent Variable Models
Procedures for Ordered Equations
Make Ordered Limit Vector/Matrix
The full set of coefficients and the covariance matrix may be obtained from the estimated equation in the usual fashion (see “Working With Equation Statistics” on page 16). In some circumstances, however, you may wish to perform inference using only the estimates of the g coefficients and the associated covariances.
The Make Ordered Limit Vector and Make Ordered Limit Covariance Matrix procedures provide a shortcut method of obtaining the estimates associated with the g coefficients. The first procedure creates a vector (using the next unused name of the form LIMITS01, LIMITS02, etc.) containing the estimated g coefficients. The latter procedure creates a symmetric matrix containing the estimated covariance matrix of the g . The matrix will be given an unused name of the form VLIMITS01, VLIMITS02, etc., where the “V” is used to indicate that these are the variances of the estimated limit points.
Forecasting using Models
You cannot forecast directly from an estimated ordered model since the dependent variable represents categorical or rank data. EViews does, however, allow you to forecast the probability associated with each category. To forecast these probabilities, you must first create a model. Choose Proc/Make Model and EViews will open an untitled model window containing a system of equations, with a separate equation for the probability of each ordered response value.
To forecast from this model, simply click the Solve button in the model window toolbar. If you select Scenario 1 as your solution scenario, the default settings will save your results in a set of named series with “_1” appended to the end of the each underlying name. See Chapter 34. “Models,” beginning on page 511 for additional detail on modifying and solving models.
For this example, the series I_DANGER_1 will contain the fitted linear index xi¢ˆ . The fitted b
probability of falling in category 1 will be stored as a series named DANGER_1_1, the fitted probability of falling in category 2 will be stored as a series named DANGER_2_1, and so on. Note that for each observation, the fitted probability of falling in each of the categories sums up to one.
Make Residual Series
The generalized residuals of the ordered model are the derivatives of the log likelihood with respect to a hypothetical unit- x variable. These residuals are defined to be uncorrelated with the explanatory variables of the model (see Chesher and Irish (1987), and Gourieroux, Monfort, Renault and Trognon (1987) for details), and thus may be used in a variety of specification tests.