You will be using your Framingham dataset for the following questions.
1. Create a scatterplot of systolic blood pressure and total serum cholesterol. Do you think there is a liner relationship between these variables? Why or why not? If so, what kind?
2. What is the correlation between systolic blood pressure and total serum cholesterol? Interpret.
3. Calculate a simple linear regression where the outcome is total serum cholesterol and the independent variable is age. Interpret.
4. Calculate a simple linear regression where the outcome is total serum cholesterol and the independent variable is smoking status. Interpret.
5. Calculate a multivariable regression where the outcome is total serum cholesterol and the independent variables are BMI, age, sex and smoking status. Interpret.
6. Use the regression from question 7 to answer the following.
a. What is the predicted total serum cholesterol for a 50 year-old man who doesn’t smoke and whose BMI is 25?
b. What is the predicted total serum cholesterol for a 25 year-old woman who smokes and whose BMI is 32?