Prediction Bias

From BIF Guidelines Wiki

Bias

Let u be the true progeny difference (TPD) and u^ be our estimate (EPD). From this we could estimate the degree of bias in our estimate by determining the difference in the mean u and mean u^. However, we never observe the TPD. Instead we estimate it using pedigree, performance, and genomic data.

We can approximate the degree of bias and under/over dispersion of EPD by using regression techniques. One such way to do this is to regress the EPD with more information (e.g., genomic EPD) on the EPD with less information (e.g, pedigree-based EPD). Our expectation is that the intercept from this regression is 0 (no bias) and the slope of the regression is 1 (no over or under dispersion). Our expectations come from the theory of BLUP (Garrick personnal communication) where u^ is an unbiased estimator of u