Consider simple linear regression with pairs of numbers . Let be the least squares line where and . In terms of the summary statistics, derive a simple expression for the residual standard deviation , where

For a question like this one that involves a derivation, after you formulate an algebraic solution, check its validity on some numerical regression examples with small data sets. If you match numerically in some instances, your answer is likely correct. If your theoretical answer doesn't match the numerical cases, go back to review your "derivation".

Part a)
To validate whether you have the correct expression, suppose , , and . What is your value of the residual SD:

Hint: