Course Topics
Introduction to Linear Models
Topics:
 Simple Linear Regression Examples
 Assumptions for Linear Models
 Ordinary Least Squares (OLS) estimators
 R2
 Residuals

Household Spending on Alcohol vs. Tobacco
Data and Story Library
 Amazon River Water Levels
Applied Linear Regression, Weisberg, p.31.

Normal Body Temperature, Gender, and Heart Rate
Journal of Statistics Education Data Archive, normtemp.dat
 Olympic Records
Applied Linear Regression, Weisberg, p.29.
 Age and Systolic Blood Pressure
Statistics in Medicine, Colton, p.189, Table 6.1.
Transformations
 Bacteria Deaths Due to XRay
Radiation
Regression Analysis By Example, Chatterjee and Price, p.36.
 World
Billionaires 1992
Data and Story Library
 Brain and Body Weights of
Mammals
Applied Linear Regression, Weisberg, p.144.
 Size of Romanesque Churches
Applied Linear Regression, Weisberg, p.149.
 Electricity Usage,
Temperature and Occupancy
A Casebook for a First Course in Statistics and Data Analysis, Chatterjee,
Handcock and Simonoff, p.177.
 Estimates of Flock Size
Applied Linear Regression, Weisberg, p.102.
 Estimating Rates of Return
of Investments
A Casebook for a First Course in Statistics and Data Analysis, Chatterjee,
Handcock and Simonoff, p.186.
 PCB Contamination of U.S.
Bays and Estuaries
A Casebook for a First Course in Statistics and Data Analysis, Chatterjee,
Handcock and Simonoff, p.164.

Televisions, Physicians, and Life Expectancy
Journal of Statistics Education Data Archive, televisions.dat
 Tree Diameter
Applied Linear Regression, 1st ed., Weisberg, p.139, 1980.
 Advertising Revenue
Regression Analysis By Example, Chatterjee & Price, 2nd ed 1991, pg 257
Inference in Linear Regression
Topics:
Inferences concerning intercept and slope
Confidence intervals for intercept and slope
Prediction intervals for E(Y)
Regression through the Origin
ANOVA Approach to regression
Fdistribution
 Voting Fraud in an Election
A Casebook for a First Course in Statistics and Data Analysis, Chatterjee,
Handcock and Simonoff, p.213.
 Plastic Hardness
Applied Linear Statistical Models, Neter, Kutner, Nachtsheim and
Wasserman, p.39, CH01PR22.DAT.
Many of the previous data sets can also be used here.
Regression Diagnostics
Topics:
 Outliers
 Influential points
 Graphical diagnostics
 Remedies
 Weighted Least Squares
 Household
Spending on Alcohol vs. Tobacco
Data and Story Library
 Cheddar
Cheese Taste
Data and Story Library
 Electricity Usage,
Temperature and Occupancy
A Casebook for a First Course in Statistics and Data Analysis, Chatterjee,
Handcock and Simonoff, p.177.
 PCB Contamination of U.S. Bays and Estuaries
A Casebook for a First Course in Statistics and Data Analysis, Chatterjee,
Handcock and Simonoff, p.164.
 Rat Data
Applied Linear Regression, Weisberg, p.122.
 Repair Times For Computers
Regression Analysis By Example, Chatterjee and Price, p.16.
 Stopping Times and Distances of Automobiles
Applied Linear Regression, Weisberg, p.161.
 Television Rating Data
Regression Analysis By Example, Chatterjee and Price, p.25.
Regression in Matrix Notation
Multiple Regression
Topics:
 Why multiple regression?
 Examples
 Assumptions
 Visual representation
 Estimation
 Fundamental Equation of Regression Analysis
 ANOVA approach to Multiple regression
 Regression diagnostics
 Marginal effects of covariates (Extra sums of squares)
 Pooled tests of significance
 Uncorrelated Predictors
 Multicollinearity
 Confounding
 Cheddar
Cheese Taste
Data and Story Library

Cigarette Data
Journal of Statistics Education Data Archive, cigarettes.dat
 Work Crew Productivity
Applied Linear Statistical Models, Neter, Kutner, Nachtsheim and
Wasserman, p.286, CH07TA06.DAT.
 Nutritionally Deficient Children
Applied Regression Analysis and Other Multivariable Methods, Kleinbaum
& Kupper, p.132.
 Equal Educational Opportunity Data
Regression Analysis By Example, Chatterjee and Price, p.176.
 Gas Vapor Data
Applied Linear Regression, Weisberg, p.138.
 Influence of Previous Weather on Ozone Levels
Applied Linear Regression, Weisberg, p.63.
 PCB Contamination of U.S.
Bays and Estuaries
A Casebook for a First Course in Statistics and Data Analysis, Chatterjee,
Handcock and Simonoff, p.164.

Statistics of Poverty and Inequality
Journal of Statistics Education Data Archive, poverty.dat
 Brand Preference
Applied Linear Statistical Models, Neter, Kutner, Nachtsheim and Wasserman,
p.252, CH06PR05.DAT
 Property Valuation
Regression Analysis By Example, Chatterjee and Price, p.257.
 Rat Data
Applied Linear Regression, Weisberg, p.122
 Repair Times For Computers
Regression Analysis By Example, Chatterjee and Price, p.16.
 Systolic Blood Pressure
Applied Regression Analysis and Other Multivariable Methods, Kleinbaum
& Kupper, p.60.
 Land Valuation
Applied Linear Regression, Weisberg, p.193.
 A Study of Supervisor Performance
Regression Analysis By Example, Chatterjee and Price, p.69.
 Televisions,
Physicians, and Life Expectancy
Journal of Statistics Education Data Archive, televisions.dat
 Television Rating Data
Regression Analysis By Example, Chatterjee and Price, p.25.
 Bicycling Exercise Tolerance
Applied Linear Statistical Models, Neter, Kutner, Nachtsheim, Wasserman, 4th ed. 1996 CH23TA04.DAT
Qualitative Predictor Variables
Topics:
 Categorical Variable (2 levels)
 Categorical Variable (3 levels)
 Mixture of Continuous and Categorical Variables
 Two Qualitative Predictors
 Two Qualitative and One Continuous Predictor
Determinants of Plasma Carotene and Retinol
Model Building Strategies
Topics:
 Data Collection and Preparation
 Reduction of Covariates
 Model Refinement
 Model Validation
Determinants of Wages from the Current Population Survey
This sample writeup is an example of the type of scientific writing that is expected.
Single Factor Analysis of Variance
Topics:
 Definitions
 Regression versus ANOVA
Analysis of Covariance
Two Factor Analysis of Variance
Topics:
 Why two factor ANOVA?
 Forced Expiratory Volumes
Applied Regression Analysis and Other Multivariable Methods, Kleinbaum
& Kupper, p.323.
 Stress Reduction
Applied Regression Analysis and Other Multivariable Methods, Kleinbaum
& Kupper, p.344.
 Average Patient Waiting Time
Applied Regression Analysis and Other Multivariable Methods, Kleinbaum
& Kupper, p.346.
Interactions
Topics:
 Concepts
 Parametrization
 Interaction between qualitative and quantitative covariates,
 Interaction between 2 qualitative covariates
 Testing specific hypotheses
 Hay Fever Relief
Applied Linear Statistical Models, Neter, Kutner, Nachtsheim and Wasserman, p.841,
CH19PR14.DAT.
 Tool Life
Applied Linear Regression, Weisberg, p.191.
 Forced Expiratory Volumes
Applied Regression Analysis and Other Multivariable Methods, Kleinbaum
& Kupper, p.323.
 Salary Survey Data
Regression Analysis By Example, Chatterjee and Price, p.97.
Experimental Studies
Topics:
 Clinical Trials
 Randomization
 Sample size and Power
Complex Data Sets
These data sets are more complex than the ones used for weekly assignments. They
were used for the two projects given at midterm and for the final takehome exam.
Students were required to assimilate all the material covered in the course and
to make various decisions affecting the course of the analysis, such as which
covariates to drop, which to transform, which hypotheses to test. Decisions made
early in the analysis could lead to very different final models. The students
needed to remain aware of their aims and to determine whether the final model
achieved these aims and led to a meaningful interpretation.
 Cloud Seeding Data
Applied Linear Regression, Weisberg, p.170.
 Crash
Test Dummies
Data and Story Library
 Fuel Consumption by State
Applied Linear Regression, Weisberg, p.35.
 Galápagos Island Species Data
Applied Linear Regression, Weisberg, p.224.
 Factors Influencing
Motor Insurance Rates
DATA: A Collection of Problems from Many Fields for the Student and Research
Worker, Andrews and Hertzberg, p.415.
 Land Rent Data
Applied Linear Regression, Weisberg, p.162.
 Mathematics Proficiency and the Home Environment
Applied Linear Statistical Models, Neter, Kutner, Nachtsheim and Wasserman, p.440,
CH10TA11.DAT.
 Determinants of Plasma Carotene and Retinol
 SENIC Project
Applied Linear Statistical Models, Neter, Kutner, Nachtsheim and Wasserman, p.1365,
Appendix C.1.
 SMSAs in the U.S.
Applied Linear Statistical Models, Neter, Kutner, Nachtsheim and Wasserman, p.1367,
Appendix C.2.
 U.S. Crime Rate
Data and Story Library

U.S. Temperatures
Data and Story Library
 Determinants of Wages from the Current Population Survey
 Air Pollution in U.S. Cities (USAIR.DAT)
A Handbook of Small Data Sets, Hand, Daly, Lunn, McConway and Ostrowski, 1994, pg 20

Life Insurance Premiums (PREMIUM.DAT)
A Handbook of Small Data Sets, Hand, Daly, Lunn, McConway and Ostrowski, 1994, pg280

Elective Hernia Repair (HERNIOR.DAT)
A Handbook of Small Data Sets, Hand, Daly, Lunn, McConway and Ostrowski, 1994, pg390
