Jump to content

Proportionate reduction of error

From Wikipedia, the free encyclopedia

Proportionate reduction of error (PRE) is the gain in precision of predicting dependent variable from knowing the independent variable (or a collection of multiple variables). It is a goodness of fit measure of statistical models, and forms the mathematical basis for several correlation coefficients.[1] The summary statistics is particularly useful and popular when used to evaluate models where the dependent variable is binary, taking on values {0,1}.

Example

[edit]

If both and vectors have cardinal (interval or rational) scale, then without knowing , the best predictor for an unknown would be , the arithmetic mean of the -data. The total prediction error would be .

If, however, and a function relating to are known, for example a straight line , then the prediction error becomes . The coefficient of determination then becomes and is the fraction of variance of that is explained by . Its square root is Pearson's product-moment correlation .

There are several other correlation coefficients that have PRE interpretation and are used for variables of different scales:

predict from coefficient symmetric
nominal, binary nominal, binary Guttman's λ[2] yes
ordinal nominal Freeman's θ[3] yes
cardinal nominal η[4] no
ordinal binary, ordinal Wilson's e [5] yes
cardinal binary point biserial correlation yes

References

[edit]
  1. ^ Freeman, L.C.: Elementary applied statistics, New, York, London, Sidney (John Wiley and Sons) 1965
  2. ^ Guttman, L. The quantification of a class of attributes: A theory and method of scale construction. In: The prediction of personal adjustment. Horst, P.; Wallin, P.; Guttman, L. et al. (eds.) New York (Social Science Research Council) 1941, pp. 319–348.
  3. ^ Freeman, L.C.: Elementary applied statistics, New, York, London, Sidney (John Wiley and Sons) 1965
  4. ^ anonymous. Fehlerreduktionsmaße [web-site, accessed 2017-07-29]. 2016. Available from: https://de.wikipedia.org/wiki/Fehlerreduktionsma%C3%9Fe#.CE.B72.
  5. ^ Freeman, L.C.: Order-based statistics and monotonicity: A family of ordinal measures of association. J. Math. Sociol. 1986, vol. 12, no. 1, pp. 49–69. Available from: http://moreno.ss.uci.edu/41.pdf.