STAT 230
STAT 230 - Linear Models
Course overview
This course is an introduction to the formulation and use of linear models and generalizations including parameter estimation and inference for such models in a variety of settings. Emphasis will be split between understanding the theoretical foundations of the models and the ability to apply the models to answer scientific questions.
Statistical analyses will be conducted using R and the integrated development environment (IDE) RStudio.
Example: How does physical activity relate to body mass index (kg/m^2)?
Course schedule
Week 1: Statistical inference foundations (probability distributions, central limit theorem)
Week 2: Confidence intervals, hypothesis testing, ANOVA
Week 3: Simple and multiple linear regression (least squares, geometric interpretation, confidence intervals, hypothesis testing)
Week 4: Multiple linear regression: Categorical variables
Week 5: Saudi National Day. Multiple linear regression: Interactions
Week 6: Model assumptions and unusual observations. Residuals and leverages
Week 7: Generalized and weighted linear squares. Model selection
Week 8: Mid-semester break. Midterm exam
Week 9: Generalized linear models (GLMs), Poisson regression, logistic regression
Week 10: Generalized linear models (GLMs), exponential family, deviance, inference
Week 11: Classification, ROC curve
Week 12: Survival models: censorship, Kaplan-Meier, Cox proportional hazards model
Week 13: Statistics Workshop
Week 14: Local regression, splines. Generalized additive models (GAMs)
Week 15: Spatial models
Week 16: Final exam
Assignments
There will be homeworks that will be submitted to Blackboard by the due date. Late homeworks will not be accepted, unless prior arrangements have been made.
Exams
There will be one midterm exam to be scheduled inside of the regular class hours, and one comprehensive final exam to be scheduled by the registrar’s office.
No written material may be brought into the examination, except for one page (one side) of handwritten notes. A simple calculator may be used.
Midterm exam on Thursday 17th October.
Method of evaluation
50% - Final exam
25% - Midterm exam
25% - Homework
Required knowledge
Advanced and multivariate calculus, linear algebra, probability and statistics.
References
- Christensen (2011) Plane Answers to Complex Questions: the Theory of Linear Models, Springer
- Wood (2015) Core Statistics, Cambridge University Press
- Seber and Lee (2003) Linear Regression Analysis, Wiley
- Hocking (1996) Methods and Applications of Linear Models: Regression and the Analysis of Variance, Wiley
- McCullagh and Nelder (1989) Generalized Linear Models, Chapman & Hall/CRC
- Kariya and Kurata (2004) Generalized Least Squares, Wiley
- Hastie and Tibshirani (1990) Generalized Additive Models, Chapman & Hall/CRC
- Davison (2003) Statistical Models, Cambridge University Press
- Faraway (2005) Linear Models with R, Chapman & Hall/CRC
- Faraway (2006) Extending the Linear Model with R, Chapman & Hall/CRC
- Wood (2006) Generalized Additive Models: An Introduction with R (Chapman & Hall/CRC)
- Moore (2016) Applied Survival Analysis Using R. Springer
License
You may not copy or distribute the course materials.