Statistics

Faculty of the Department of Statistics

Carl N. Morris, Professor of Statistics (Chair)
John Barnard, Assistant Professor of Statistics
Arthur P. Dempster, Professor of Theoretical Statistics
Mayumi Morimoto, Lecturer on Statistics
Donald B. Rubin, Professor of Statistics (Director of Graduate Studies) (on leave spring term)
Clyde H. Schoolfield, Jr., Lecturer on Statistics
David van Dyk, Assistant Professor of Statistics (Head Tutor)
Steve C. Wang, Lecturer on Statistics

Other Faculty Offering Instruction in the Department of Statistics

Frederick Mosteller, Professor of Mathematical Statistics, (Emeritus), (FAS), Roger Irving Lee Professor of Mathematical Statistics (Public Health) (Emeritus)
Bernard Rosner, Professor of Medicine (Medical School)
Alan Zaslavsky, Associate Professor of Statistics (Medical School)

In 1999–2000, the Statistics Department offers four courses at the introductory level (below Statistics 110). Statistics 100 and 101 are essentially equivalent in terms of their quantitative requirements, but differ in the amount of emphasis placed on different techniques and applications. Statistics 100 emphasizes regression, including multiple regression, which is essential in economics and related fields. Statistics 101 emphasizes analysis of variance, which is widely used in experimentally oriented subjects such as psychology and biology. Statistics 104 combines the content of Statistics 100 and 101, and moves somewhat faster than these courses, assuming a stronger quantitative orientation. Statistics 102 is comparable to Statistics 104 in its technical level, but is specifically oriented toward biomedical applications and techniques.

Generally, Statistics 104 and 101 will be accepted as fulfilling any requirement or prerequisite which is fulfilled by Statistics 100. Consult the Statistics Department or your tutorial office for more information about which courses satisfy your concentration requirements, and for guidance on selecting a course. More detailed information can be accessed through the network at the Statistics Department home page, http://fas-www.harvard.edu/~stats/.

Primarily for Undergraduates

*Statistics 91r. Supervised Reading and Research
Catalog Number: 6641
David van Dyk and members of the Department
Half course (fall term; repeated spring term). Hours to be arranged.
Note: Normally may not be taken more than twice; may be counted for concentration in Statistics if taken for graded credit; may be taken in either term; for further information consult with head tutor.

*Statistics 99hf. Tutorial — Senior Year
Catalog Number: 4381
David van Dyk and members of the Department
Half course (throughout the year). Hours to be arranged.
The systematic application of statistical ideas to a problem area.
Note: In exceptional circumstances, may be taken alternatively as a half course in the spring term only.

For Undergraduates and Graduates

Statistics 100. Introduction to Quantitative Methods
Catalog Number: 3808
Clyde H. Schoolfield, Jr.
Half course (fall term; repeated spring term). M., W., F., at 11, and a section meeting to be arranged. EXAM GROUP: 4
Introduces the key ideas underlying statistical and quantitative reasoning, including fundamentals of probability. Topics may include elements of sample surveys, experimental design and observational studies, descriptive and summary statistics for both measured and counted variables, and statistical inference including estimation and tests of hypotheses as applied to one- and two-sample problems, regression with one or more predictors, correlation, and analysis of variance. Emphasizes simple and multiple regression and applications in nonexperimental fields including, but not limited to, economics.
Note: Only one of the following courses may be taken for credit: Statistics 100, 101, 104.

Statistics 101. Introduction to Quantitative Methods
Catalog Number: 5128
Steve C. Wang
Half course (fall term). M., W., F., at 11, and a section meeting to be arranged. EXAM GROUP: 4
Covers the same topics as Statistics 100. Emphasizes the analysis of variance, applied in experimental fields such as psychology and other behavioral sciences.
Note: Only one of the following courses may be taken for credit: Statistics 100, 101, 104.

Statistics 102. Fundamentals of Biostatistics
Catalog Number: 0266
Bernard Rosner (Medical School)
Half course (spring term). M., W., F., at 11, and section meeting to be arranged. EXAM GROUP: 4
An introduction to statistical methods used in biological and medical research. Elementary probability theory, basic concepts of statistical inference, sampling theory, regression and correlation methods, analysis of variance, study design. Emphasis on applications to medical problems.
Note: Primarily for undergraduates with medical or biological interests.

Statistics 104. Introduction to Quantitative Methods
Catalog Number: 4582
Mayumi Morimoto
Half course (fall term). M., W., F., at 11, and a section time to be arranged. EXAM GROUP: 4
Covers the same topics as 100 and 101 combined, at a slightly higher level. Applications will be drawn from fields such as economics, behavioral and health sciences, policy analysis, and law.
Note: Only one of the following courses may be taken for credit: Statistics 100, 101, 104.

Statistics 110. Introduction to Probability
Catalog Number: 0147
Clyde H. Schoolfield, Jr.
Half course (fall term). M., W., F., at 12, and a section meeting to be arranged. EXAM GROUP: 5
A first course in probability pointed toward applications, for students with some calculus. Models include the normal, binomial, exponential, Poisson and gamma distributions. Topics include expectation, independence, conditioning, generating functions, joint distribution and density functions, and limit laws.
Prerequisite: Mathematical preparation at the level of intermediate calculus.

Statistics 111. Introduction to Theoretical Statistics
Catalog Number: 1836
Steve C. Wang
Half course (spring term). M., W., F., at 12, and a section meeting to be arranged. EXAM GROUP: 5
Basic concepts of statistical inference from frequentist and Bayesian perspectives. Topics include maximum likelihood methods, confidence and Bayesian interval estimation, hypothesis testing, least squares methods, and analysis of variance.
Prerequisite: Statistics 110 and basic linear algebra.

Statistics 139. Regression Analysis
Catalog Number: 1450
Steve C. Wang
Half course (fall term). Tu., Th., 10–11:30, and a section meeting to be arranged. EXAM GROUP: 12, 13
An introduction to data analysis using multiple regression. Topics may include model building and diagnostics, graphical checks of assumptions, transformations, multivariate graphics and visualization, exploratory data analysis, tests of significance and confidence intervals, and logistic regression. The course will emphasize analysis and investigation of real datasets using computer software.
Prerequisite: Statistics 100 or equivalent.

[Statistics 140. Design of Experiments and Quasi-Experiments]
Catalog Number: 6306
Donald B. Rubin
Half course (fall term). Hours to be arranged. EXAM GROUP: 16, 17
Statistical designs for the estimation of the effects of treatments in both controlled experiments and observational studies. Topics include randomization, blocking, fractional replication, covariance adjustment, subclassification, matched sampling, model-based adjustment.
Note: Expected to be given in 2000–01.
Prerequisite: Statistics 100 and 139, or equivalent.

Statistics 149. Generalized Linear Models
Catalog Number: 6617
John Barnard
Half course (spring term). Tu., Th., 10–11:30. EXAM GROUP: 12, 13
An introduction to the application and theory of generalized linear models. Emphasis is on understanding models and applying them to data. Topics include likelihood theory, exponential families, model specification, model checking and diagnostics, logistic and ordinal regression, log-linear models, and quasi-likelihood. Applications are drawn from a variety of fields, including medicine, biology, and the social sciences.
Prerequisite: Statistics 111 or equivalent and Statistics 139 or equivalent.

[Statistics 160. Survey Methods]
Catalog Number: 2993
Alan Zaslavsky (Medical School)
Half course (fall term). Hours to be arranged. EXAM GROUP: 8, 9
Methods for design and analysis of sample surveys. Techniques for sample design, with examples from some widely used current surveys. Estimation methods (including calculation and use of sampling weights) and variance estimation methods (including resampling methods). Several guest lectures on nonstatistical aspects of survey methodology such as questionnaire design and validation. Other topics may include variance estimation for complex surveys and estimators, nonresponse, and small-area estimation.
Note: Expected to be given in 2000–01.
Prerequisite: Statistics 111 or 139, or permission of instructor.

Statistics 171. Introduction to Stochastic Processes
Catalog Number: 4180
Mayumi Morimoto
Half course (spring term). M., W., 3:30–5. EXAM GROUP: 8, 9
An introductory course in stochastic processes. Topics include Markov chains, branching processes, Poisson processes, birth and death processes, renewal theory, queuing theory, Brownian motion, and Martingales.
Prerequisite: Statistics 110 or equivalent.

Statistics 185. Statistical Decision and Forecasting
Catalog Number: 6788
David van Dyk
Half course (spring term). M., W., F., at 1. EXAM GROUP: 6
The development of a Bayesian approach to the related problems of decision and forecasting. Decision topics will include utility, loss, decision rules, risk, admissibility of decision rules, and decision theoretic aspects of sequential analysis. Forecasting will be developed through the dynamic linear model and include topics such as sequential analysis and smoothing; models for polynomial trends, seasonal trends, and adjustment for covariates; and forecast intervention, monitoring, and error analysis. Theory and computational methods will be developed with a strong emphasis on applications to a variety of data sets.
Prerequisite: Statistics 110 or 139 or equivalent.

Primarily for Graduates

Statistics 210. Probability Theory and Statistical Inference I
Catalog Number: 2487
Carl N. Morris
Half course (fall term). Tu., Th., 11:30–1. EXAM GROUP: 13, 14
Random variables, their distributions and densities. Families and exponential families of distributions. Expectation. Independence, product spaces, and joint distributions. Types of convergence. Limit theorems (weak and strong laws, central limit problem). Conditional probability and expectation, multivariate Normal distribution, particular examples of conjugate, marginal, and conditional distributions. Inequalities, approximations, and stochastic simulation. Sampling distributions, likelihood function, sufficiency, and information.
Prerequisite: A course in probability and statistics at least at the level of Statistics 110, 111.

Statistics 211. Probability Theory and Statistical Inference II
Catalog Number: 1946
Carl N. Morris
Half course (spring term). Tu., Th., 11:30–1. EXAM GROUP: 13, 14
Introduction to statistical inference. Frequency, Bayesian, and decision-theoretic approaches. Likelihood, sufficiency, multivariate Normal distribution, and exponential families. Testing hypotheses and estimation. Maximum likelihood estimation, likelihood ratio tests, linear models, models for frequency data, large and moderate sample approximations, including the delta method.
Prerequisite: Advanced calculus, Statistics 210, or equivalent.

[Statistics 214. Causal Inference in Statistics and the Social and Biomedical Sciences ]
Catalog Number: 4042
Donald B. Rubin
Half course (spring term). Hours to be arranged. EXAM GROUP: 16, 17
Approaches to causal inference. Covers randomized experiments with and without noncompliance, observational studies with and without ignorable treatment assignment, instrumental variables and sensitivity analysis. A number of applications from economics, medicine, education, etc., are discussed.
Note: Expected to be given in 2000–01.

Statistics 217. Probabilistic Reasoning and Statistical Practice
Catalog Number: 6777
Arthur P. Dempster
Half course (spring term). Tu., Th., 2:30–4. EXAM GROUP: 16, 17
A discussion of the logic of the statistical inference from R. A. Fisher to belief functions.

Statistics 220 (formerly Statistics 220r). Bayesian Data Analysis
Catalog Number: 6270
David van Dyk
Half course (spring term). M., F., 2–3:30. EXAM GROUP: 7, 8
Begins with basic Bayesian models, whose answers often appear similar to classical answers, followed by more complicated hierarchical and mixture models with nonstandard solutions. Includes methods for monitoring adequacy of models and examining sensitivity of conclusions to change in models. Throughout, emphasis on drawing inferences via computer simulation rather than mathematical analysis.
Prerequisite: Statistics 110 and 111.

[Statistics 230. Multivariate Analysis]
Catalog Number: 4626
John Barnard
Half course (fall term). Hours to be arranged. EXAM GROUP: 7, 8
A survey of multivariate analysis. Normal distribution theory, estimation, and hypothesis testing. Multivariate techniques, including cluster analysis, multidimensional scaling, principal component analysis, discriminant analysis, and multiple regression. These techniques are applied to data sets.
Note: Expected to be given in 2000–01.

[Statistics 231. Bayesian Time Series]
Catalog Number: 1687
David van Dyk
Half course (spring term). Hours to be arranged. EXAM GROUP: 8, 9
A study of the dynamic linear model. Topics may include review of classical time series models, forecasting, smoothing, regression methods, polynomial trend models, seasonal models, and forecast monitoring and intervention. Theory and computational methods will be developed with an emphasis on applications to a variety of data sets.
Note: Expected to be given in 2000–01.

Statistics 232 (formerly Statistics 332). Incomplete Multivariate Data
Catalog Number: 4196
John Barnard
Half course (fall term). M., F., 2–3:30. EXAM GROUP: 7, 8
Methods for handling incomplete data sets with general patterns of missing data, emphasizing likelihood-based and Bayesian approaches. Focus is on the application and theory of iterative maximization methods, iterative simulation methods, and multiple imputation. Includes coverage of some multivariate tools and theory relevant to missing data problems. Real examples are drawn from a variety of fields, including health sciences, history of science, and government.
Note: Expected to be omitted in 2000–01.
Prerequisite: A course in probability (Statistics 110-level), a course in theoretical statistics (Statistics 111-level), and knowledge of regression and linear algebra (Statistics 139-level).

Statistics 239. Advanced Regression Analysis
Catalog Number: 7423
John Barnard
Half course (fall term). Tu., Th., 10–11:30, M., 7–9 p.m. EXAM GROUP: 1, 12, 13
Besides the applications done jointly with Statistics 139, students meet separately to develop the theory (multivariate normal, maximum likelihood, likelihood ratio tests, Gauss-Markov, etc.) of linear models. Students do some of the homework assignments from Statistics 139, but also other assignments that differ and are more advanced. Grading is separate from Statistics 139.
Prerequisite: Probability and statistics at the level of Statistics 110 and 111.

Statistics 271. Stochastic Processes in Continuous Time
Catalog Number: 9993
Arthur P. Dempster
Half course (fall term). Tu., 2–4. EXAM GROUP: 16, 17
Modeling and statistical analysis for Gaussian processes governed by stochastic differential equations with applications to control engineering and financial modeling.

[Statistics 290. Risk Analysis]
Catalog Number: 5300
---------------
Half course (spring term). M., W., 2–3:30. EXAM GROUP: 7, 8
Rational decision-making under uncertainty, decision trees, subjective expected utility. Risk aversion, decreasing risk aversion, multiple risks. Risk sharing, insurance. Principals and agents. Rare events. Risks to life and health. Statistical models and assessment. Participants give talks and write papers on topics of their choice.
Note: Expected to be given in 2000–01.
Prerequisite: Statistics 111 or equivalent.

Statistics 292hfr (formerly Statistics 292a). Topics in Statistics
Catalog Number: 0925
Donald B. Rubin
Half course (throughout the year). W., 2–3:30. EXAM GROUP: 7, 8
A range of currently active projects. All involve real applications and require mathematical statistical development. Applications include education, census, political science, biomedical research. Techniques include design of experiments, Bayesian modelling, multiple imputation.

Cross-listed Courses

Economics 2110a. Quantitative Methods I
Economics 2140a. Econometric Methods I
Economics 2140b. Econometric Methods II
[Economics 2140c. Econometric Methods III: Issues in Applied Econometrics]
Economics 2140d. Time Series Analysis
Mathematics 212a. Functions of a Real Variable
Mathematics 212b. Functions of a Real Variable

Graduate Courses of Reading and Research

*Statistics 301. Special Reading and Research
Catalog Number: 4474
John Barnard 1916, Arthur P. Dempster 2345, Mayumi Morimoto 2505, Carl N. Morris 2178, Frederick Mosteller 2235, Bernard Rosner (Medical School) 4018, Donald B. Rubin 7966 (on leave spring term), Clyde H. Schoolfield, Jr. 2440, Steve C. Wang 2581, Alan Zaslavsky (Medical School) 1927, and David van Dyk 2669

*Statistics 302. Direction of Doctoral Dissertations
Catalog Number: 3382
John Barnard 1916, Arthur P. Dempster 2345, Carl N. Morris 2178, Bernard Rosner (Medical School) 4018, Donald B. Rubin 7966 (on leave spring term), Alan Zaslavsky (Medical School) 1927, and David van Dyk 2669

Statistics 311. Recent Advances in Markov Chain Monte Carlo Technology
Catalog Number: 0826
David van Dyk 2669
Half course (fall term). Th., 2–4.
Starting with a review of such standard techniques as Data Augmentation, the Gibbs sampler, and Metropolis Hastings, the course will focus on recent research papers on such topics as adaptive rejection sampling, the method of auxiliary variables, simulated tempering, the collapsed Gibbs sampler, marginal and conditional data augmentation, the nested EM algorithm, slice sampling, exact sampling, simulated sintering, reversible jump MCMC, regeneration, and sequential MC methods.
Prerequisite: Statistics 220 or equivalent.

*Statistics 317. Research Seminar
Catalog Number: 1478
Arthur P. Dempster 2345
Half course (fall term). Hours to be arranged.
Current research on real time detection, classification and control of dynamic systems from a Bayesian perspective.

[*Statistics 324r. Parametric Statistical Inference and Models]
Catalog Number: 3366
Carl N. Morris 2178
Half course (fall term; repeated spring term). Hours to be arranged.
Considers problems in which it is reasonable to develop separate probability models for data, conditional on a parameter vector, and for the vector of unknown parameters, the latter distribution depending on a few unknown hyperparameters. This leads to hierarchical modeling, with relationships to growth curves, kriging, BLUP, longitudinal data, and empirical Bayes. Robustness, model checking, and likelihood inference are considered. A variety of applications is discussed, including to biostatistics, health services, education, sports, and other fields.
Note: Expected to be given in 2000–01.

*Statistics 349r. Analysis of Psychological Data: Issues and Examples
Catalog Number: 4528
Donald B. Rubin 7966 (on leave spring term)
Half course (fall term). Th., 12–2.
Consulting projects on statistical problems arising in psychological and related research areas. Participants expected to contribute actively to one or more projects.