2009-2010 Bulletin â PDF - SEAS Bulletin - Columbia University
2009-2010 Bulletin â PDF - SEAS Bulletin - Columbia University
2009-2010 Bulletin â PDF - SEAS Bulletin - Columbia University
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
203<br />
W3107 as prerequisites; like other<br />
advanced offerings in statistics, it covers<br />
both theory and practical aspects of<br />
modeling and data analysis. Advanced<br />
offerings in probability theory, stochastic<br />
processes, and mathematical finance<br />
generally take STAT W3105 as a prerequisite;<br />
advanced offerings in statistical<br />
theory and methods generally take STAT<br />
W4107 and, in several cases, W4315 as<br />
prerequisites; an exception is STAT<br />
W4220: Data mining, which has a<br />
course in computer programming as<br />
prerequisite and STAT W3107 as corequisite.<br />
STAT 4201 is a high-level survey<br />
of applied statistical methods.<br />
Please note that STAT W3000 has<br />
been renumbered as W3105 and STAT<br />
W3659 has been renumbered as<br />
W3107. For a description of the following<br />
course offered jointly by the<br />
Departments of Statistics and Industrial<br />
Engineering and Operations Research,<br />
see ‘‘Industrial Engineering and<br />
Operations Research’’:<br />
SIEO W4150x and y Introduction to probability<br />
and statistics<br />
3 pts. I. Hueter and L. Wright.<br />
Prerequisites: MATH V1101 and V1102 or the<br />
equivalent. A quick calculus-based tour of the<br />
fundamentals of probability theory and statistical<br />
inference. Probabilistic models, random variables,<br />
useful distributions, expectations, laws of large<br />
numbers, central limit theorem. Statistical inference:<br />
point and confidence interval estimation,<br />
hypothesis tests, linear regression. Students<br />
seeking a more thorough introduction to probability<br />
and statistics should consider STAT W3105 and<br />
W3107.<br />
STAT W3105x Introduction to probability<br />
3 pts. Instructor to be announced.<br />
Prerequisites: MATH V1101 and V1102 or the<br />
equivalent. A calculus-based introduction to probability<br />
theory.Topics covered include random variables,<br />
conditional probability, expectation, independence,<br />
Bayes’ rule, important distributions,<br />
joint distributions, moment-generating functions,<br />
central limit theorem, laws of large numbers, and<br />
Markov’s inequality.<br />
STAT W3107y Introduction to statistical inference<br />
3 pts. Instructor to be announced.<br />
Prerequisite: STAT W3105 or W4105, or the<br />
equivalent. Calculus-based introduction to the<br />
theory of statistics. Useful distributions, law of<br />
large numbers and central limit theorem, point<br />
estimation, hypothesis testing, confidence intervals<br />
maximum likelihood, likelihood ratio tests,<br />
nonparametric procedures, theory of least<br />
squares, and analysis of variance.<br />
STAT W4201x and y Advanced data analysis<br />
3 pts. D. Alemayehu and instructor to be<br />
announced.<br />
Prerequisite: A one-term introductory statistics<br />
course. This is a course on getting the most out<br />
of data. The emphasis will be on hands-on experience,<br />
involving case studies with real data and<br />
using common statistical packages. The course<br />
covers, at a very high level, exploratory data<br />
analysis, model formulation, goodness-of-fit testing,<br />
and other standard and nonstandard statistical<br />
procedures, including linear regression, analysis<br />
of variance, nonlinear regression, generalized<br />
linear models, survival analysis, time series<br />
analysis, and modern regression methods.<br />
Students will be expected to propose a data set<br />
of their choice for use as case study material.<br />
STAT W4240x Data mining<br />
3 pts. D. Madigan.<br />
Prerequisite: COMS W1003, W1004, W1005,<br />
W1007, or the equivalent. Corequisite: STAT<br />
W3107. Data mining is a dynamic and fast-growing<br />
field at the interface of statistics and computer<br />
science. The emergence of massive datasets<br />
containing millions or even billions of observations<br />
provides the primary impetus for the field.<br />
Such datasets arise, for instance, in large-scale<br />
retailing, telecommunications, astronomy, computational<br />
and statistical challenges.This course will<br />
provide an overview of current research in data<br />
mining and will be suitable for graduate students<br />
<strong>SEAS</strong> <strong>2009</strong>–<strong>2010</strong>