You will be using your Framingham dataset for the following questions.

1. Create a scatterplot of systolic blood pressure and total serum cholesterol. Do you think there is a liner relationship between these variables? Why or why not? If so, what kind?

2. What is the correlation between systolic blood pressure and total serum cholesterol? Interpret.

3. Calculate a simple linear regression where the outcome is total serum cholesterol and the independent variable is age. Interpret.

4. Calculate a simple linear regression where the outcome is total serum cholesterol and the independent variable is smoking status. Interpret.

5. Calculate a multivariable regression where the outcome is total serum cholesterol and the independent variables are BMI, age, sex and smoking status. Interpret.

6. Use the regression from question 7 to answer the following.

a. What is the predicted total serum cholesterol for a 50 year-old man who doesn’t smoke and whose BMI is 25?

b. What is the predicted total serum cholesterol for a 25 year-old woman who smokes and whose BMI is 32?