|
MAY 5 -
6, 2003
Presented
at the University of Maryland
ANDREW
GELMAN
Department of Statistics, Columbia University
INSTRUCTOR
Andrew Gelman
is a professor in the Departments of Statistics and Political Science and
founding director of the Quantitative Methods in Social Sciences Program at
Columbia University. He is the
author of the books "Bayesian Data Analysis" and "Teaching
Statistics: A Bag of Tricks" and over 100 articles on statistical
methods, theory, and applications. His
statistical research interests include Bayesian inference, multilevel
modeling, and statistical graphics and model checking. His applied research interests include public opinion and
voting, decision analysis, and environmental health.
His research has received awards from the American Statistical
Association, the American Political Science Association, and other research
organizations.
PREQUISITE
COURSE
Basic statistics course including basic familiarity with linear regression.
TOPICS
AND APPROACH
Why
learn about Bayesian data analysis?
Bayesian
inference is an effective and flexible approach for analyzing large data sets
and combining information from different sources. Bayesian methods allow
inference without overfitting for models with large numbers of parameters.
Important examples for sample surveys include small-area estimation,
longitudinal data analysis, and multilevel regression.
Why Bugs and R?
Bugs is a freely-available computer program that can fit Bayesian statistical
models without the need for programming by the user. Bugs's modeling language
is open-ended, making it easy to generalize a model or expand it to account
for additional data.
R is an open-source language for statistical computing and graphics. We run
Bugs from R, which gives us flexibility in data manipulation before the
analysis and display of inferences after.
What topics are covered?
The course begins with Bayesian analyses of simple classical models such as
linear and logistic regression. These
examples introduce Bugs and R as well as the basic ideas of Bayesian data
analysis. We then discuss how to
fit and understand multilevel (hierarchical) models, in which parameters are
modeled in batches. We discuss
general issues in practical Bayesian data analysis, including how to check the
fit of a model to data and how to incorporate design information (e.g., strata
or clusters in sampling) into a Bayesian model.
We also give some background on the computational algorithms used by
Bugs as well as programming tips. We
illustrate with several applications from our own research using survey and
social science data.
Here
is an approximate syllabus for the course:
1.
R and Bugs for classical inference
2. Model checking, model
expansion, and debugging
3.
Hierarchical linear regression
4.
Hierarchical logistic and Poisson regression
5.
Understanding the Gibbs sampler and Metropolis algorithm
6.
Expressing a single model in several different ways
7.
Prior distributions
8.
Troubleshooting: what to
do when your simulations run too slowly, or get
stuck, or don't make sense
9. Accounting for design of
surveys and experiments
10. Concluding discussion
What are the concepts introduced in the course?
Bayesian data analysis has three steps: model building, inference, and
model checking. We integrate
these using R and Bugs. "Fitting"
a model is neither the beginning nor end of analysis.
Bayesian hierarchical modeling can be seen as a generalization of least
squares or maximum likelihood to allow partial pooling of inferences from
different data sources.
TENTATIVE
SCHEDULE
Please be on time. There is a 30
minute period at the beginning of class in which each participant will have to
download software before instruction begins.
|
MONDAY, MAY 5, 2003 |
|
8:00
- 8:30 |
Check
in |
|
8:30
- 9:00 |
Software
Download |
|
9:00
- 10:30 |
Session
1 |
|
10:30
-10:45 |
Break |
|
10:45
- 12:15 |
Session
2 |
|
12:15
- 1:30 |
Lunch |
|
1:30
- 3:00 |
Session
3 |
|
3:00
- 3:15 |
Break |
|
3:15
- 4:45 |
Session
4 |
|
|
|
|
TUESDAY, May 6, 2003 |
|
9:00
- 10:30 |
Session
5 |
|
10:30
- 10:45 |
Break |
|
10:45
- 12:15 |
Session
6 |
|
|
|
LOCATION
The course will be held at
The University of Maryland in the OACS Lab-0225 Lefrak Hall. The University of
Maryland is located in College Park, MD, with nearby access to the College
Park Metro Stop. Located on US Route 1 and approximately 2 miles from I-495.
Information, directions, and times will be sent with your confirmation
letter.
REGISTRATION
Online registration is required. Confirmation of registration and instructions
will be sent after the registration form has been processed. Registration is
not firm until you receive a confirmation letter. Payment by credit card is
required. Post registration payment may be done online using the student's
registration number. Please note registration number. Registration and Payment
deadline is April
18, 2003. Payment is due by April
18, 2003 or the registration will be cancelled.
Please notify JPSM as soon as possible if you need to cancel your
registration. Cancellation between April 21 –April 23, 2003 will
require a $100 administrative fee,
the remainder will be reimbursed. Cancellation on or after April 24, 2003 is
subject o the full fee amount.
INQUIRIES
For information on
this short courses contact JPSM at 301-314-7911; fax
301-314-7912; or e-mail shortcourse@survey.umd.edu
Statistics
Consortium Web Page: http://www.jpsm.umd.edu/stat
JPSM Home page:
http://www.jpsm.umd.edu/
|