The course introduces methods of statistical inference, useful in any area of science that collects and analyzes data. The course discusses the methodological foundations, as well as issues of practical implementation and use. Methods discussed in this class are applicable to a broad range of problems, from design and analysis of empirical studies of complex real-life phenomena, to design and analysis of evaluations of computer experiments or computer science research. The coursework includes a term project involving method development, implementation and/or work with real-life investigations.
The course discusses the following topics:
Basics of frequentist statistical inference for continuous data: measures of association, confidence and prediction intervals, hypothesis testing, benefits and limitations of p-values.
Experimental design: ways to include data to the study to maximize its statistical efficiency, and strategies for analysis of the designed experiments. Factorial and randomized block designs, linear mixed effects models, and response surface exploration.
Introduction to causal inference: graphical models, adjustments for confounders, interventions and counterfactual inference.
(Optional, if time allows) Introduction to Bayesian data analysis: model specification, estimation, checking and inference.
Instructor: Prof. Olga Vitek, Email: o.vitek@northeastern.edu Office hours: Tue and Fri after the class, or by appointment. 177 Huntington Ave, 9th floor.
Teaching assistants: Mr. Ethan Rogers, Email: rogers.eth@northeastern.edu Office hours: Thursdays 1-2:30pm, Hastings 102; or by appointment, 177 Huntington Ave, 9th floor.
Course policies and administration: Syllabus, Piazza, Canvas. Academic integrity policy is strictly enforced.
(KNNL) Applied Linear Statistical Models. Kutner, Neter, Nachtsheim, Li, McGraw-Hill, 5th Edition, 2004. Website. Additional texts will be posted dynamically on Piazza