Fitting straight lines when both variables are subject to error.

Thomas A. Jones1
1Exxon Production Research Company, Houston, USA

Tóm tắt

Usual methods for fitting a straight line, Y =α + βX,to data fail if the “independent” variable Xis subject to error. The problem is further complicated if there is no strong reason for selecting one of the two variables as independent; neither of the two lines may be correct. This review article discusses the maximum likelihood estimators of α and β under functional and structural models. These models involve differing assumptions about the statistical distributions of the dependent and independent variables. In addition, least-squares procedures are also considered. All these methods lead to the same result, a quadratic equation which can be solved to give an estimate of β. This result requires knowledge of the ratio of the error variances, λ = φ 2/τ2, where φ2 is the variance of the Yresiduals and τ 2 is the variance of the X residuals. If φ 2 and τ2 are unknown, estimates of λ can be difficult to obtain. If replicate sampling was employed, estimates of the variances can be made, and then of λ.

Tài liệu tham khảo

Acton, F. S., 1959, Analysis of straight-line data: Dover Publications, Inc., New York, 267 p. Barnett, V. D., 1970, Fitting straight lines—the linear functional relationship with replicated observations: Appl. Stat., v. 19, p. 135–144. Bartlett, M. S., 1949, Fitting a straight line when both variables are subject to error: Biometrics, v. 5, p. 207–212. Berkson, J., 1950, Are there two regressions?: Jour. Amer. Stat. Assoc., v. 45, p. 164–180. Berner, R. A., 1976, The solubility of calcite and aragonite in seawater at atmospheric pressure and 34.5‰ salinity: Amer. Jour. Sci., v. 276, p. 713–730. Birch, M. W., 1964, A note on the maximum likelihood estimate of a linear structural relationship: Jour. Amer. Stat. Assoc., v. 59, p. 1175–1178. Brooks, C., Hart, S. R., and Wendt, I., 1972, Realistic use of two-error regression treatments as applied to rubidium-strontium data: Rev. Geophys. Space Phys., v. 10, p. 551–577. Brown, R. L., 1957, Bivariate structural relation: Biometrika, v. 44, p. 84–96. Creasy, M. A., 1956, Confidence limits for the gradient in the linear functional relationship: Jour. Royal Stat. Soc., Ser. B, v. 18, p. 65–69. Davies, R. B., and Hutton, B., 1975, The effects of errors in the independent variables in linear regression: Biometrika, v. 62, p. 383–391. Deming, W. E., 1943, Statistical adjustment of data: John Wiley and Sons, Inc., New York, 261 p. Dolby, G. R., 1972, Generalized least squares and maximum likelihood estimation of nonlinear functional relationships: Jour. Royal Statistical Society, Series B, v. 34, p. 393–400. Dolby, G. R., 1976a, A note on the linear structural relation when both residual variances are known: Jour. Amer. Stat. Assoc., v. 71, p. 352–353. Dolby, G. R., 1976b. The ultrastructural relation: a synthesis of the functional and structural relations: Biometrika, v. 63, p. 39–50. Goldberger, A. S., 1964, Econometric theory: John Wiley and Sons, Inc., New York, 399 p. Graybill, F. A., 1961, An introduction to linear statistical models: McGraw-Hill Book Co., New York, 463 p. Hodges, S. D., and Moore, P. G., 1972, Data uncertainties and least squares regression: Appl. Stat., v. 21, p. 185–195. Hogg, R. V., and Craig, A. T., 1965, Introduction to mathematical statistics: The Macmillan Company, New York, 383 p. Imbrie, J., 1956, Biometrical methods in the study of invertebrate fossils: Bull. Amer. Museum Natural History, v. 108, p. 211–252. Jones, H. E., 1937, Some geometrical considerations in the general theory of fitting lines and planes: Metron, v. 13, p. 21–30. Kendall, M. G., 1951, Regression, structure, and functional relationship. I: Biometrika, v. 38, p. 11–25. Kendall, M. G., 1952, Regression, structure, and functional relationship. II: Biometrika, v. 39, p. 96–108. Kendall, M. G., and Stuart, A., 1961, The advanced theory of statistics, vols. 1 and 2: Hafner Publishing Co., New York, 676 p. Kermack, K. A., and Haldane, J. B. S., 1950, Organic correlation and allometry: Biometrika, v. 37, p. 30–41. Kruskal, W. H., 1953, On the uniqueness of the line of organic correlation: Biometrics, v. 9, p. 47–58. Lindgren, B. W., 1961, Statistical theory: Macmillan Co., New York, 427 p. Lindley, D. V., 1947, Regression lines and the linear functional relationship: Jour. Royal Stat. Soc. London Suppl. Ser. B, v. 9, p. 218–244. Madansky, A., 1959, The fitting of straight lines when both variables are subject to error: Jour. Amer. Stat. Assoc., v. 54, p. 173–205. Mark, D. M., and Church, M., 1977, On the misuse of regression in earth science: Jour. Math. Geology, v. 9, p. 63–75. McIntyre, G. A., Brooks, C., Compston, W., and Turek, A., 1966, The statistical assessment of Rb-Sr isochrons: Jour. Geophys. Res., v. 71, p. 5429–5468. Miller, R. L., and Kahn, J. S., 1962, Statistical analysis in the geological sciences: John Wiley and Sons, Inc., New York, 483 p. Moran, P. A. P., 1971, Estimating structural and functional relationships: Jour. Multivariate Analysis, v. 1, p. 232–255. Pearson, K., 1901, On lines and planes of closest fit to systems of points in space: Philosophical Magazine, v. 2, series 6, p. 559–572. Richardson, D. H., and Wu, D., 1970, Alternative estimators in the errors in variables model: Jour. Amer. Stat. Assoc., v. 65, p. 724–748. Riersol, O., 1950, Identifiability of a linear relation between variables which are subject to error: Econometrika, v. 18, p. 375–389. Solari, M. E., 1969, The “maximum likelihood solution” of the problem of estimating a linear functional relationship: Jour. Royal Statistical Soc., Series B, v. 31, p. 372–375. Sprent, P., 1966, A generalized least-squares approach to linear functional relationships, with discussion: Jour. Royal Statistical Soc., Series B, v. 28, p. 278–297. Teissier, G., 1948, La relation d'allometric, sa signification statistique et biologique: Biometrics, v. 4, p. 14–52. Villegas, C., 1961, Maximum likelihood estimation of a linear functional relationship: Annals of Math. Stat., v. 32, p. 1048–1062. Villegas, C., 1969, Confidence region for a linear relation: Annals of Math. Stat., v. 35, p. 780–788. Wald, A., 1940, The fitting of straight lines if both variables are subject to error: Annals of Math. Stat., v. 11, p. 284–300. Worthing, A. G., and Geffner, J., 1946, Treatment of experimental data: John Wiley and Sons, Inc., New York, 342 p. York, D., 1966, Least-squares fitting of a straight line: Canadian Jour. of Physics, v. 44, p. 1079–1086. York, D., 1967, The best isochron: Earth and Planetary Science Letters, v. 2, p. 479–482. York, D., 1969, Least-squares fitting of a straight line with correlated errors: Earth & Planetary Science Letters, v. 5, p. 320–324.