Chapter 6: Q19E (page 331)
Exercises 19 and 20 involve a design matrix \(X\) with two or more columns and a least-squares solution \(\hat \beta \) of \({\bf{y}} = X\beta \). Consider the following numbers.
(i) \({\left\| {X\hat \beta } \right\|^2}\)鈥攖he sum of the squares of the 鈥渞egression term.鈥 Denote this number by .
(ii) \({\left\| {{\bf{y}} - X\hat \beta } \right\|^2}\)鈥攖he sum of the squares for error term. Denote this number by \(SS\left( E \right)\).
(iii) \({\left\| {\bf{y}} \right\|^2}\)鈥攖he 鈥渢otal鈥 sum of the squares of the \(y\)-values. Denote this number by \(SS\left( T \right)\).
Every statistics text that discusses regression and the linear model \(y = X\beta + \in \) introduces these numbers, though terminology and notation vary somewhat. To simplify matters, assume that the mean of the -values is zero. In this case, \(SS\left( T \right)\) is proportional to what is called the variance of the set of -values.
19. Justify the equation \(SS\left( T \right) = SS\left( R \right) + SS\left( E \right)\). (Hint: Use a theorem, and explain why the hypotheses of the theorem are satisfied.) This equation is extremely important in statistics, both in regression theory and in the analysis of variance.
Short Answer
The equation \(SS\left( T \right) = SS\left( R \right) + SS\left( E \right)\) is justified.