Why are outliers problematic in regression?

Prepare for the PHFO Quantitative Analysis For Business Exam. Study with flashcards, multiple choice questions, hints, and explanations to ensure confidence and success in your exam!

Multiple Choice

Why are outliers problematic in regression?

Explanation:
In regression, the fitting process usually minimizes squared errors, so extreme values have outsized influence. An outlier can tug the estimated line toward itself, changing both the slope and the intercept. If the outlier has a typical x but an extreme y, it pulls the line away from where most data lie, leading to biased coefficients and poorer predictions for the rest of the data. If the outlier has an extreme x (high leverage), it can pull the line toward itself even if its y isn’t extreme, again shifting the coefficients and affecting predictions. This distortion also tends to worsen fit metrics and can make standard errors unreliable, undermining inference. So the strongest reason is that outliers can distort the estimated coefficients and the resulting predictions because the common fitting method treats large residuals as particularly costly. The other statements don’t fit: there is a real effect on model fit, outliers don’t always improve accuracy, and they can influence more than just the intercept.

In regression, the fitting process usually minimizes squared errors, so extreme values have outsized influence. An outlier can tug the estimated line toward itself, changing both the slope and the intercept. If the outlier has a typical x but an extreme y, it pulls the line away from where most data lie, leading to biased coefficients and poorer predictions for the rest of the data. If the outlier has an extreme x (high leverage), it can pull the line toward itself even if its y isn’t extreme, again shifting the coefficients and affecting predictions. This distortion also tends to worsen fit metrics and can make standard errors unreliable, undermining inference.

So the strongest reason is that outliers can distort the estimated coefficients and the resulting predictions because the common fitting method treats large residuals as particularly costly. The other statements don’t fit: there is a real effect on model fit, outliers don’t always improve accuracy, and they can influence more than just the intercept.

Subscribe

Get the latest from Passetra

You can unsubscribe at any time. Read our privacy policy