Summary
Whew! Beside reviewing some matrix algebra and seeing how it can be used to formulate multiple regression models, we've taken a look at five different flavors of research studies involving multiple linear regression analysis. We gleaned what we could from the analyses knowing what we know from our study of the simple linear regression model. Hopefully, you now appreciate many of the similarities in simple linear regression analyses and multiple linear regression analyses. In future lessons, we will focus on some of the differences. In particular, here's a list of the new things we'll learn on our way to becoming experts in multiple linear regression analyses:
- The above research scenarios and regression models, as well as a few more.
- The "general linear test" which helps to answer many research questions (Lesson 10)
- F-tests for more than one slope (Lesson 10)
- Interactions between two or more predictor variables (Lesson 11)
- Using "variance inflation factors," the detection of correlated predictors ("multicollinearity") and the limitations they cause (Lesson 12)
- Selection of variables from a large set of variables for inclusion in a model ("stepwise regression and "best subsets regression") (Lesson 13)
- Identifying influential data points (Lesson 14)
Comprehensive Exercises
Directions. Type up your answer the following question in a Word file named exercises08_yourPSUid.doc. Once you have completed the exercise, upload your file to the Lesson #8 Comprehensive Exercises dropbox. |
8.1 Are brain size and body size predictive of intelligence?The iqsize.txt data set contains data on the intelligence based on the performance IQ (y = PIQ) scores from the revised Wechsler Adult Intelligence Scale, brain size ( x1 = brain) based on the count from MRI scans (given as count/10000), and body size measured by height in inches ( x2 = height) and weight in pounds ( x3 = weight) on 38 college students.
yi = β0 + β1xi1 + β2xi2 + β3xi3 + εi (with independent, normally distributed error terms and equal variances). |
4 Effect of linearly dependent variables in the X matrixThis exercise is designed to illustrate problems that
can occur when the variables in the X matrix are linearly 4.1 Reaction times in sleep-deprived The data set deprived.txt contains data on the reaction times (y = time, in hundredths of a second) to the onset of light of subjects in four sleep-deprived groups (Kirk, 1995). The four groups were sleep-deprived for 12 hours (the control group, Grp12), for 18 hours (Grp18), for 24 hours (Grp24), and for 30 hours (Grp30). Each group contained 8 subjects.
|
Title of this problem set here...Some researchers were interested in studying the relationship between behavior types of individuals and cholesterol levels. In general, Type A behavior is characterized by urgency, aggression and ambition, while type B behavior is relaxed, non-competitive, and less hurried. The behavior.txt data set contains the researchers' data as they entered it for analysis in Minitab. When the researchers tried to fit the model with chol as the response and typeA and typeB as the predictors, they obtained the following error message:
|
