Note On Two
Estimators For The Polynomial Regression With Errors In The Variables
Department
of Mathematics, College of Science,
King
Khalid University,
Abha,
Saudi Arabia
دراسة عن تقدرين لدالة
الانحدار متعددة الحدود ذات الأخطاء
في المتغيرات
مقدري معالم دالة بعلاقة انحدار تربيعية ذات الأخطاء في المتغيرات لحالتين حين وجود الخطأ، أو عدمه في المعادلة، ثم تعميمه في أ نموذج بدالة ذات علاقة متعددة الحدود بأي درجة، وبإرتباط أخطاء الاستجابة، ومتغيرات دالة الانحدار، وبأخطاء ليست بالضرورة طبيعية، كماتم اشتقاق مصفوفة التغاير لهذه المقدرات.
The estimators for the parameters of the quadratic regression relationship with errors in the variables for two different cases regarding the presence or absence of errors in the equation are generalized to the model of a polynomial functional relationship of any degree with correlated errors of response and regressor variables which is not necessarily normal error. The asymptotic covariance matrix of these estimators is also derived.
Keywords: Polynomial regression, functional relationship, errors in variables, adjusted least squares.
![]()
INTRODUCTION
Many investigators (Fuller
1987; Wolter & Fuller 1982) have considered the quadratic functional relationships
with errors in the variables for two different cases regarding the presence
(case 1) or absence (case 2) of errors in the equation. They developed
estimators for the parameters of the quadratic relationship in both cases,
assuming that, in case 1, the error variance of the regressor variable or, in
case 2, the error variances of dependent and regressor variables be known. In
case 1, the errors of dependent and regressor variables were assumed to be
uncorrelated. In both cases the errors were taken to be normally distributed.
Here, both these
estimators will be generalized to the model of a polynomial functional
relationship of any degree and with correlated errors of dependent and
regressor variables and with not necessarily normal errors. In case 1, the
resulting estimators seem to be
the same as the one developed by Cheng & Schneeweiss (1996). They
derived the asymptotic covariance
matrix of the estimators of case 1.
The same will be done here
for the estimator of case 2. For a discussion of the practical relevance of the
two cases (see Cheng & Schneeweiss (1996)) and the literature cited e.g. in Carroll & Ruppert (1996)
and Cheng & Van Ness (1994).
CASE
1: ERRORS IN THE EQUATION
Consider a polynomial
functional relationship with errors in the equation:
(1)
![]()
where
are i.i.d. random
errors with expectation 0 and covariance matrix
The
are unobservable
(latent) nonstochastic variables. The regressor error variance
and the covariance
of regressor
error
and dependent
variable error
are assumed to
be known. The variance
, which contains the error-in-the-equation variance component,
is unknown. If the error variables are jointly normally distributed we have
(N)
It is well-known that replacing
the latent variable
by its
observable counterpart x in the polynomial relationship and estimating the
parameters
in the resulting
polynomial regression by OLS (Ordinary Least Squares) yields inconsistent
estimates (Grilliches & Ringstad 1970). As a first step to remove this
inconsistency, Fuller (1987) suggests to view the powers of
as k+1 different latent regressor variables, for which their
observable counterparts unbiased estimates
computable from
the data are available, so that a linear functional relationship results:
![]()
![]()
where the
are the new
measurement errors, with
. Let
, then the model can be written as
![]()
(2)
For this linear functional
relationship a consistent estimator of
can now be constructed if unbiased estimates
and
of, respectively,
the covariance matrix
and the
covariance vector
are available.
The error adjusted least squares normal equations are given by (all summations
are for
):
,
or
,
(3)
where the - denotes averages over i = 1, …, n; for the quadratic
relationship (Fuller 1987). Following the idea of Chan & Mak (1985), the
estimates
of
are easily
constructed as certain polynomials in
of degree r.
Their coefficients depend on higher moments of
up to the order
of 2r, which are assumed to be known. In case of (N) only
needs to be
known, and the
can be computed
by the recursive relation
with
. For non-normal errors
the variables
,
can be computed
by solving the triangular linear system
,
,
, ![]()
where the higher moments
of
are assumed to be known.
The covariance matrix
is given by
![]()
(4)
The elements of
are powers of
and can
therefore be estimated by the variables
. Let
,
then
. Thus, an unbiased estimate of
is given by
.
Similarly,
.
Cheng & Schneeweiss
(1996) derived an unbiased estimate of
in terms of a
linear combination of the
, the coefficients of which depend on
and
only,
, and which they denoted by
. Thus
. In case of (N),
. The normal equation (1) can now be written as
![]()
with
,
, and
.
This is exactly the normal
equations system for the ALS estimator of Cheng & Schneeweiss (1996). Its
solution
is consistent
for
and under
general conditions asymptotically normally distributed. The asymptotic
covariance matrix of
can be estimated
by Cheng & Schneeweiss (1996).
,
where
![]()
CASE
2: NO ERRORS IN THE EQUATION
Wolter & Fuller (1982)
construct an estimate of the quadratic functional relationship when the whole
of
is known. This
corresponds to the case where there is no error in the equation but an only
measurement error is known. The estimator can be computed without any
iterations. It can be generalized to the case of a polynomial functional
relationship. Let
and let
be the error covariance
matrix of
and
its estimate, as
given below:

Furthermore, let
,
,
,
, so that
![]()
Then a generalization of
Wolter and Fuller’s estimator is given by
,
(5)
where
is the smallest positive
root (eigenvalue) of
;
see also Moon & Gunst
(1995) for the special case (N).
THE ASYMPTOTIC COVARIANCE
MATRIX OF
IN (CASE 2)
Under general conditions
is
asymptotically normally distributed with an asymptotic covariance matrix
which can be
computed as follows. First note that with
and
the estimating
equation (5) for
can be written
as:
,
(6)
where
is the smallest
positive eigenvalue and
the
corresponding eigenvector.
Let
, where
,
(7)
,
, and
. For large n all these differences will be small in
probability and the estimating equation (6) can be expanded as
![]()
This can be simplified
with the help of equation (7) and
using the fact that
to:
Deleting the first
equation of this system we get the following system for
:
(8)
Now by equation (6):
,
and taking differences
![]()
Thus

Substituting this
expression for
in (8) we get:
(9)
with
and

(10)
Obviously
, and by the central limit theorem
converges in
distribution to a normal distribution with covariance matrix
![]()
Thus by equation (9),
also converges
to a normal distribution with covariance matrix

An estimate of the
asymptotic covariance matrix of
is given by
,
with
![]()
The author is greatly
indebted to the referees and editorial board for their useful comments and
suggestions.
REFERENCES
Carroll, R.J., &
Ruppert, D. 1996. The use and misuse
of orthogonal regression in linear errors-in-variables models. The American
Statistician 50: 1-6.
Chan, L.K., & Mak,
M.K. 1985. On the polynomial
functional relationship. Journal of the Royal Statistical Society 47: 510-518.
Cheng, C.L., &
Schneeweiss, H. 1996. The polynomial
Regression with errors in the variables. Research report 386, University of
Munich, Munich, Germany.
Cheng, C.L., & Van
Ness, J.W. 1994. On estimating Linear
Relationship when both variables are subject to error. Journal of the Royal
Statistical Society 56 (1): 167-183.
Fuller, W.A. 1987. Measurement error models, John Wiley, New York.
Grilliches, Z. &
Ringstad, V. 1970.
Errors-in-variables bias in nonlinear context. Econometrica 38: 368-370.
Moon, M.S. & Gunst,
R. 1995. Polynomial measurement error
modeling. Computational Statistics and Data Analysis 19: 1-21.
Wolter, K.H., &
Fuller, W.A. 1982. Estimation of the
quadratic errors-in-variables model. Biometrica 69: 175-182.
(Received 1/10/1419; 18th January 1999, accepted
12/8/1420; 20th November 1999)