Economics Network CHEER Virtual Edition

Volume 15, Issue 1, 2002

Estimation of technical efficiency: a review of some of the stochastic frontier and DEA software.

Inés Herrero
Department of Applied Economics, University of Huelva, and
Sean Pascoe
CEMARE, Department of Economics, University of Portsmouth

The level of technical efficiency of a particular firm is characterised by the relationship between observed production and some ideal or potential production (Greene 1993). The measurement of firm specific technical efficiency is based upon deviations of observed output from the best production or efficient production frontier. If a firm's actual production point lies on the frontier it is perfectly efficient. If it lies below the frontier then it is technically inefficient, with the ratio of the actual to potential production defining the level of efficiency of the individual firm.

Farrell's (1957) definition of technical efficiency led to the development of methods for estimating the relative technical efficiencies of firms. The common feature of these estimation techniques is that information is extracted from extreme observations from a body of data to determine the best practice production frontier (Lewin and Lovell 1990). From this the relative measure of technical efficiency for the individual firm can be derived. Despite this similarity the approaches for estimating technical efficiency can be generally categorised under the distinctly opposing techniques of parametric and non-parametric methods (Seiford and Thrall 1990).

Stochastic estimations incorporate a measure of random error. This involves the estimation of a stochastic production frontier, where the output of a firm is a function of a set of inputs, inefficiency and random error. An often quoted disadvantage of the technique, however, is that they impose an explicit functional form and distribution assumption on the data. In contrast, the linear programming technique of data envelopment analysis (DEA) does not impose any assumptions about functional form, hence it is less prone to mis-specification. Further, DEA is a non-parametric approach so does not take into account random error. Hence, it is not subsequently subject to the problems of assuming an underlying distribution about the error term. However, since DEA cannot take account of such statistical noise, the efficiency estimates may be biased if the production process is largely characterised by stochastic elements.

The purpose in this paper is to present a range of methods for estimating technical efficiency, and to review the software that is available to do so.

Economic, allocative and technical efficiency

Technical efficiency is just one component of overall economic efficiency. However, in order to be economically efficient, a firm must first be technically efficient. Profit maximisation requires a firm to produce the maximum output given the level of inputs employed (i.e. be technically efficient), use the right mix of inputs in light of the relative price of each input (i.e. be input allocative efficient) and produce the right mix of outputs given the set of prices (i.e. be output allocative efficient) (Kumbhaker and Lovell 2000). These concepts can be illustrated graphically using a simple example of a two input (x1, x2)-two output (y1, y2) production process (Figure 1). Efficiency can be considered in terms of the optimal combination of inputs to achieve a given level of output (an input-orientation), or the optimal output that could be produced given a set of inputs (an output-orientation).

In Figure 1(a), the firm is producing a given level of output (y1*, y2*) using an input combination defined by point A. The same level of output could have been produced by radially contracting the use of both inputs back to point B, which lies on the isoquant associated with the minimum level of inputs required to produce (y1*, y2*) (i.e. Iso(y1*, y2*)). The input-oriented level of technical efficiency (TEI(y,x)) is defined by 0B/0A. However, the least-cost combination of inputs that produces (y1*, y2*) is given by point C (i.e. the point where the marginal rate of technical substitution is equal to the input price ratio w2/w1). To achieve the same level of cost (i.e. expenditure on inputs), the inputs would need to be further contracted to point D. The cost efficiency (CE(y,x,w)) is therefore defined by 0D/0A. The input allocative efficiency (AEI(y,w,w)) is subsequently given by CE(y,x,w)/TEI(y,x), or 0D/0B in Figure 1.1(a) (Kumbhaker and Lovell 2000).

The production possibility frontier for a given set of inputs is illustrated in Figure 1(b) (i.e. an output-orientation). If the inputs employed by the firm were used efficiently, the output of the firm, producing at point A, can be expanded radially to point B. Hence, the output oriented measure of technical efficiency (TEO(y,x)), can be given by 0A/0B. This is only equivalent to the input-oriented measure of technical efficiency under conditions of constant returns to scale. While point B is technically efficient, in the sense that it lies on the production possibility frontier, a higher revenue could be achieved by producing at point C (the point where the marginal rate of transformation is equal to the price ratio p2/p1). In this case, more of y1 should be produced and less of y2 in order to maximise revenue. To achieve the same level of revenue as at point C while maintaining the same input and output combination, output of the firm would need to be expanded to point D. Hence, the revenue efficiency (RE(y,x,p)) is given by 0A/0D. Output allocative efficiency (AEO(y,w,w)) is given by RE(y,x,w)/TEI(y,x), or 0B/0D in Figure 1(b) (Kumbhaker and Lovell 2000).



Figure 1: Input (a) and output (b) oriented efficiency measures

Stochastic production frontier software

Stochastic frontiers can be estimated using a different range of multi-purpose econometric software which can be adapted for the desired estimation. This software includes known statistical packages such as LIMDEP, TSP, Shazam, GAUSS, SAS, etc. The two most commonly used packages for estimating of stochastic production frontiers and inefficiency are FRONTIER 4.1 (Coelli 1996a) and LIMDEP (Greene 1995). A recent review of both packages is provided by Sena (1999).

FRONTIER 4.1 is a single purpose package specifically designed for the estimation of stochastic production frontiers (and nothing else), while LIMDEP is a more general package designed for a range of non-standard (i.e. non-OLS) econometric estimation. An advantage of the former model (FRONTIER) is that estimates of efficiency are produced as a direct output from the package. The user is able to specify the distributional assumptions for the estimation of the inefficiency term in a program control file. In LIMDEP, the package estimates a one-sided distribution, but the separation of the inefficiency term from the random error component requires additional programming.

FRONTIER is able to accommodate a wider range of assumptions about the error distribution term than LIMDEP (Table 1), although it is unable to model exponential distributions. Neither package can include gamma distributions. Only FRONTIER is able to estimate an inefficiency model as a one-step process. An inefficiency model can be estimated in a two-stage process using LIMDEP. However, this may create bias as the distribution of the inefficiency estimates is pre-determined through the distributional assumptions used in its generation.

Table 1. Distributional assumptions allowed by the software

Distribution LIMDEP FRONTIER
Time invariant firm specific inefficiency
· Half-normal distribution Yes Yes
· Truncated normal distribution Yes Yes
· Exponential distribution Yes No
Time variant firm specific inefficiency
· Half-normal distribution No Yes
· Truncated normal distribution No Yes
One step inefficiency model No Yes

Source: Sena (1999)

FRONTIER 4.1

The most commonly used package for estimation of stochastic production frontiers in the literature is FRONTIER 4.1 (Coelli, 1996a). This incorporates the maximum likelihood estimation of the parameters. The estimation process consists on three main steps. At the first step OLS is applied to estimate the production function. This provides unbiased estimators for the b's (except for the intercept term and the variance estimate). The OLS estimates are used as starting values to estimate the final ML model. First, the value of the likelihood function is estimated for different values of g between 0 and 1 given the values for the b's derived in the OLS. Finally an interative Davidon-Fletcher-Powell algorithm calculates the final parameter estimates, using the values of the b's from the OLS and the value of g from the intermediate step as starting values.

FRONTIER 4.1 has been created specifically for the estimation of production frontiers. As such, it is a relatively easy tool to use in estimating stochastic frontier models. It is flexible in the way that it can be used to estimate both production and cost functions, can estimate both time-varying and invariant efficiencies, or when panel data is available, and it can be used when the functional form have the dependent variable both in logged or in original units.

FRONTIER solves two general models. The error components model can be formulated as

Yit = Xit b + (Vit - Uit)

Where Yit is the (logged) output obtained by the i-th firm in the t-th time period; Xit is a (kx1) vector of (transformation of the) input quantities of the i-th firm in the t-th time period; b is a (kx1) vector of unknown parameters; and Vit are assumed to be iid N(0, sv2) random errors, and Uit = Ui exp (-h(t-T)), where Ui are assumed to be iid as truncations at zero of the N(mi, su2).

This is the Battese and Coelli (1992) model. However some other models can be summarized as special cases of this one and can also be solved using FRONTIER. Setting h=0, the time invariant model of Battese, Coelli and Colby (1989) is obtained. The Battese and Coelli (1988) model results from the previous one for the particular case of problems in which balanced data is available. If we add m=0 to the aforementioned assumptions, the Pitt and Lee (1981) model results. And if we finally set T=1 in the Pitt and Lee model, we obtain the original cross-sectional data model of Aigner, Lovell and Schmidt (1977).

If h>0, the inefficiency term, Uit, is always decreasing with time, whereas h<0 implies that Uit is always increasing with time. That could be one of the main problems when using this model, technical efficiency is forced to be a monotonous function of time.

The second model included in the FRONTIER package is the Technical Efficiency (TE) effects model (Battese and Coelli 1995). It can be expressed as Yit = Xit b + (Vit - Uit), where Yit, Xit, b and Vit are as defined earlier and Uit ~N(mit, su2), where mit = Zitd, Zit is the vector of firm-specific variables which may influence the firms' efficiency. FRONTIER offers also the solution of the model of Stevenson (1980) which is a particular case of the previous model that can be obtained for the cases in which T is equal to 1 (for cross-sectional data).

There are two approaches to estimating the inefficiency models. These may be estimated with either a one step or a two step process. For the two-step procedure the production frontier is first estimated and the technical efficiency of each firm is derived. These are subsequently regressed against a set of variables, Zit, which are hypothesised to influence the firms' efficiency. A problem with the two-stage procedure is the inconsistency in the assumptions about the distribution of the inefficiencies. In the first stage, the inefficiencies are assumed to be independently and identically distributed (iid) in order to estimate their values. However, in the second stage, the estimated inefficiencies are assumed to be a function of a number of firm specific factors, and hence are not identically distributed unless all the coefficients of the factors are simultaneously equal to zero (Coelli, Rao and Battese, 1998). FRONTIER uses the ideas of Kumbhakar, Ghosh and McGuckin (1991) and Reifschneider and Stevenson (1991) and estimates all of the parameters in one step to overcome this inconsistency. The inefficiency effects are defined as a function of the firm specific factors (as in the two-stage approach) but they are then incorporated directly into the MLE. This is something that should be taken into consideration when programming in some of the general statistical packages.

FRONTIER offers a wide variety of tests on the different functional forms of the models that can be conducted easily by placing restrictions on the models and testing the significance of the restrictions using the likelihood ratio test. The FRONTIER program is easy to use. A brief instruction file and a data file have to be created. The executable file and the start-up file can be downloaded from the Internet free of charge at the CEPA (University of New England) web page: http://www.uq.edu.au/economics/cepa/frontier.htm.

Other packages

LIMDEP is a multi-purpose econometric package that can be programmed to estimate several stochastic frontier models using some of the stochastic frontier estimation routines that are already implemented in the package. The estimation process can be carried out when the inefficiency variable is assumed to have the half-normal and truncated normal distribution but also when the exponential distribution is assumed. Hence, when there seems to be strong evidence to assume an exponential distribution, LIMDEP may be a better option than FRONTIER. Among other features, LIMDEP offers the possibility of including random effects for panel data and estimates of one-sided disturbance components. The package includes the optimisation of non-linear regression with two-step least squares and user defined maximum likelihood estimation. There is a LIMDEP Web page at http://www.limdep.com/

The flexibility feature of GAUSS allows the possibility to be adapted for the estimation of production frontiers. GAUSS includes some non-linear optimisation routines which can be used for this purpose. Being a very general statistical package also has some disadvantages and the programming structure is not as easy as some of the other software packages. The GAUSS program can be ordered at http://www.aptech.com/prod.html. There is also a Web page that provides answers to questions frequently asked about the program: http://www.indiana.edu/~statmath/gauss/index.html

The TSP package can be specifically adapted for the estimation of the most basic models of production frontiers, using the ML user programmed likelihood functions. The program also includes non-linear three-stage least squares. The program can be ordered and more information can be obtained at: http://www.tspintl.com/products/tsp

The statistical program Shazam can also be adapted for the estimation of production frontiers. It allows the optimisation of non-linear functions either by Maximum Likelihood or least squares and it includes some hypothesis testing. Information about the program can be found at: http://shazam.econ.ubc.ca/

DEA software

Most of the general-purpose mathematical optimisation software can be adapted to solve Data Envelopment Analysis problems. Examples of program code for DEA models have already been published and are readily adaptable (e.g. Olesen and Petersen (1995) present the GAMS code for a DEA model that can be adapted to suit most analyses). Emrouznejad (2000) developed a SAS program for different DEA models, including different options as input/output orientation and CRS/VRS. The Emrouznejad (2000) program and comments can be downloaded from: http://www.deazone.com/software/index.htm#sasdea.

These general programs offer the possibility of a wide range of applications using non-specialist DEA software. However, there are several DEA-specific programs that provide a variety of interesting facilities. Seven of the most common ones are listed below, including the sources where further information can be found:

  1. EMS 1.3 (Efficiency measurement system) by H. Scheel. University of Dortmund. Germany. This can be downloaded from the net. http://www.wiso.uni-dortmund.de/
  2. DEAP 2.1 (Coelli, 1996b). Centre for Efficiency and Productivity Analysis (CEPA). University of New England, NSW, Australia. It is now free and can also be downloaded from the net, as well as its manual from the CEPA Web pages http://www.uq.edu.au/economics/cepa/deap.htm
  3. Frontier Analyst, version 3. Professional Edition. Banxia Software Ltd. Glasgow. Scotland. http://www.banxia.com/
  4. WARWICK DEA software. Aston Business School. University of Aston. Birmingham, England, UK. http://www.warwick.ac.uk/~bsrlu/
  5. OnFront 2. Economic Measurement and Quality I Lund AB Lund. Sweden. http://www.emq.se/software.html
  6. DEA-Solver Professional 2.0. Saitech Inc. New Jersey. U.S.A. This is the most recent package. Its manual is the Cooper, Seiford and Tone (2000) book (Data Envelopment Analysis. Kluwer Academic Publishers). http://www.saitech-inc.com/
  7. IDEAS 6.1. Professional version. Consulting Inc., Amherst, MA, USA. http://www.ideas2000.com/

Package Facilities

The key features of the different software packages have been summarised in Table 2.

Table 2. Key features of the packages

Window
analysis
Malmq.
index
Weight
restr.
Input/
output
orient.
A&P
Super-
Effic.
Exog.
Fixed
Vbles.
Catego-
rical
vbles.
Multi
Stage
DEA
CRS
VRS
NIRS Addit.
Model
RTS
info
EMS XXXXXX   XXX 
DEAP  X X    XX  X
BANXIA   XX X   X  X
WARWICK (*)(*)XXXX   X XX
OnFront 2 (*)X X(*)X   XX X
DEA Solver X XXXX X XX X
IDEAS (*)  X X X X XX

(*) This can be carried out by doing some modifications to the program

EMS and DEA-solver appear to offer the most options. Of the software examined, only EMS and DEA-Solver can be used to carry out window analysis when time series data are available, although if this is the case, DEAP and OnFront can be used for the calculation of Malmquist indices. OnFront2 also includes many different options, some of them not offered by any other package, such as the ability to carry out input and output congestion measures and Capacity Utilization measures. Warwick software can be adapted to compute Malmquist indices but only by running the DOS version in batch mode to compute efficiencies. These efficiency scores are needed later in order to compute the indices; similarly it can be adapted to carry out window analysis.

The additive model can be estimated using EMS and Warwick software, and the Slacks-Based-Measure (SBM) model using either EMS or DEA-solver.

All the programs under study - except DEAP - allow the possibility of choosing between an input or output orientation, the assumption of either constant or variable returns to scale, the possibility of including non discretionary variables (i.e. exogenously fixed variables), and offer some scale information. This latter possibility has recently been added in the new version of OnFront under the name of vector analysis.

Banxia lacks some of the options that other programs offer, such as window analysis, but it presents the advantage of offering many graphs relating efficiency to the different inputs and outputs. It also allows the introduction and/or elimination of some of the decision-making units (DMUs). It is a very user-friendly package and very easy to work with even for those who do not have a wide knowledge of DEA. A thorough user´s guide is also provided. One-day workshops introducing some of the theory of DEA, its practical applications and how to handle the program take place regularly in London. Details regarding contents and cost information can be found at: http://www.banxia.com/training/bxfanalyst.html

Regarding the use of categorical variables, only IDEAS and DEA-Solver offer their inclusion as an option in the different models. The possibility of including weak or strong disposability of inputs has been added in the new version of OnFront.

EMS, Warwick software and the new version of OnFront are the only packages that compute Andersen and Petersen super-efficiency scores. EMS and DEA-solver include the possibility of working with non-convex technologies and to compare different programs' efficiencies.

EMS and DEAP offer a good on-line guide for users. Banxia and OnFront provide a reference guide to users together with the program. DEA-Solver program just includes a very brief reference guide in the package, while more detailed explanations about the options can be found in the associated reference book (Cooper et al. 2000).

Operating System and PC Requirements.

Most of the software packages run under Windows though some of them, like OnFront and Warwick DEA, offer both DOS and Windows versions. There are some others like PIONEER (not included in the above table) that run under Unix (for additional information contact barr@seas.smu.edu). DEAP is the only package among those presented here that does not offer a Windows version. It is based on MS-DOS and the input files have to be text files. It also lacks some of the options that other programs offer. In spite of these disadvantages, DEAP presents many desirable technical characteristics. Foremost of these is that DEAP allows a multi-step procedure to be used in solving the DEA models. This can be either a one-step, two-step or a multi-stage procedure. This is important as using a one-step procedure can produce instabilities. Many of the other programs, such as the Warwick Software, avoid instabilities using a two-step procedure but wrong peers can be obtained for some of the inefficient units. Both problems can be overcome by the use of the DEAP multi-step procedure. DEAP does not requires a large memory PC and being currently free of charge is another good reason for using it.

The operating System used by all programs, with the exception of DEAP, is Windows 95 or later(note) and NT. EMS accepts Excel97 or text data files. Warwick and OnFront require input in the form of text files (although this restriction can be lessened using the Windows clipboard). The new version of Banxia offers a direct data import from Excel and SPSS and the reports produced there can be directly exported to RTF (as Word), Excel and html files. The DEA-solver is based in Excel 97. Access, Dbase, Excel and Text files are all accepted files for IDEAS. Requirements regarding the capacity of the computer are highly related to how sophisticated the software is. The most demanding are Banxia and OnFront. Banxia software requires 50MB of hard disk space, 32 MB of RAM and they strongly recommend using a Pentium PC. OnFront requires at least 16 MB of RAM and 8Mb of disk space. DEAP only requires 4 MB of RAM and it can even be run using a 386 PC. However it is not as user-friendly as the others.

Software Prices

The prices in 2001 (excluding VAT, shipping and handling) for the software under analysis are presented in Table 3.

Table 3. Price for main DEA software, 2001 (excludes VAT)

Academic price Commercial price Additional licence
(academic)
Additional licence
(commercial)
EMS 1.3 Free Free Free Free
DEAP 2.1 Free Free Free Free
BANXIA 3 £195-595 £395-2395 35% discount 35% discount
WARWICK £200/500 £200/500 - -
OnFront2 $750 $1750 $450 $900
DEAsolver 2.0 $800 $1600 $400 $800
IDEAS $140/1400 $280/2800 - -

The prices are for individual licences either for academic or business purposes. There are some other prices for Campus licences, educational purposes, etc. Some of the software presented here, such as Banxia, Warwick or IDEAS, can be obtained at different prices if the number of DMUs under study is limited. Banxia software offers a wide variety of prices depending on the maximum number of units it analyses. The lower level is for 75 units (£195) then 250 (£295), 500 units (£395) 1500 units (£495) and 2500 units (£595). The same range can be bought for commercial purposes and prices vary from £395 to £2395. Warwick software offers a limited version for analysis undertaking less than 50 units (£200) and a full version for an unlimited number of DMUs (£500). The same applies for the DOS Warwick version (£150/300). IDEAS also offers two versions, the standard version which allows analyses of up to 500 DMUs, 20 variables and 20 price ratios and the professional version for analyses of up to 2,000 DMUs, 25 variables and 120 price ratios. The rest of the programs can be used for any number of units (within the capacity of the memory of the computer). Banxia has recently launch some extra packages at different prices: Cross efficiency module (£95), 500 unit three-month-trial licence programme (£295), Support and maintenance for a year (£229) and the upgraded from the previous version is offered by obtaining the CD and new manual (£169) or by obtaining an electronic copy (£129, with no physical products).

Acknowledgement

This review was undertaken as part of the EU funded project 'Technical efficiency in EU fisheries: implications for monitoring and management through effort controls' (QLK5-CT1999-01295).

Note

Warwick software also allows for Windows 3.1

References

Aigner, D., C. A. K. Lovell and P. Schmidt (1977). Formulation and estimation of stochastic frontier production function models. Journal of Econometrics 6: 21-37.

Banxia Software (1998). Banxia Frontier Analyst. User's Guide. Professional Edition. Frontier Analyst.

Battese, E. and T. J. Coelli (1988). Prediction of firm level technical inefficiencies with a generalised frontier production function and panel data, Journal of Econometrics, 38, 387-399.

Battese, G. E., T. J. Coelli and T. C. Colby, (1989) Estimation of Frontier Production Functions and the Efficiencies of Indian Farms Using Panel Data from ICRISAT's Village Level Studies. Journal of Quantitative Economics, 5(2): 327-48

Battese, G. E. and T. J. Coelli (1992). Frontier Production Function, technical efficiency and panel data: with application to Paddy Farmers in India. Journal of Productivity Analysis 3: 153-169.

Battese, G. E. and T. J. Coelli (1995). A model for technical inefficiency effects in a stochastic frontier production function for panel data. Empirical Economics 20: 325-332.

Coelli, T. (1996a). A guide to FRONTIER version 4.1: a computer program for frontier production function estimation. CEPA Working Paper 96/07, Department of Econometrics, University of New England, Armidale, Australia.

Coelli, T. J. (1996b). 'A guide to DEAP version 2.1: A Data Envelopment Analysis (Computer) program'. CEPA Working Papers No.8/96. ISSN 1327-435X. ISBN 1 86389 4969.

Coelli, T. J., D. S. P. Rao and G. E. Battese (1998). An Introduction to Efficiency and Productivity Analysis, Kluwer Academic Publishers: USA.

Cooper, W. W., Seiford, L. M. and Tone, K. (2000)., Data Envelopment Analysis. Kluwer Academic Publishers. ISBN 0792386930.

Emrouznejad, A. (2000). An Extension to SAS/OR for Decision System Support, Proceeding of the 25 SAS User Group International Conference, April 2000, Indianapolis pp1456-1461.

Färe, R. and Grosskopf, S. (1998). 'Reference Guide to OnFront'. Economic Measurement and Quality I Lund AB Lund. Sweden.

Farrell, M. J. (1957). The measurement of productive efficiency, Journal of the Royal Statistical Society, 120, 252-90.

Greene, W. (1995). LIMDEP (Version 7): User's Manual and Reference Guide, Econometric Software Inc., New York.

Greene, W. H. (1993). "Frontier Production Functions", EC-93-20. Stern School of Business, New York University.

Hollingsworth, B., (1997). 'A review of Data Envelopment Analysis Software'. The Economic Journal, 1997, 1268-1270

Hollingsworth, B., (1999). Data Envelopment Analysis and Productivity Analysis: a review of the options. The Economic Journal, 458-462.

Kumbhaker, S. C. and Lovell C. A. K. (2000). Stochastic Frontier Analysis. Cambridge: Cambridge University Press.

Kumbhakar, S., S. Ghosh and J. McGuckin. (1991). A generalized Production Frontier Approach for Estimating Determinants of Inefficiency in U.S. Dairy Farms. Journal of Business and Economic Statistics, 9:279-286.

Lewin, A. Y. and C. A. K. Lovell (1990). Editors Introduction, Journal of Econometrics, 46, 3-5.

Olesen and Petersen (1995). A presentation of GAMS for DEA. Computers & Operations Research 23(4) pp 323-339

Pitt, M. M. and L. F. Lee. (1981). The measurement and sources of technical inefficiency in the Indonesian weaving industry. Journal of Development Economics, 9:43-64.

Reifschneider, D. and R. Stevenson. (1991). Systematic departures from the frontier: A framework for the analysis of firm inefficiency. International Economic Review, 32 (3):715-723.

Scheel, H. (2000). EMS: Efficiency Measurement System User´s Manual. Version 1.3

Seiford, L. M. and R. M. Thrall (1990) Recent developments in DEA: the mathematical programming approach to frontier analysis, Journal of Econometrics, 46, 7-38.

Sena, V. (1999). Stochastic frontier estimation: a review of the software options. Journal of Applied Econometrics, 14: 579-586.

Stevenson, R. E., (1980). Likelihood function for generalized stochastic frontier estimation, Journal of Econometrics, 13, 57-66.

Top | CHEER Home

Copyright 1989-2007