STATISTICS 593:
Machine Learning, Spring, 2000
Instructors: Alejandro Murua and Jon A. Wellner


Tentative Time: T-Th 9:30 - 11:00
First Meeting: Tuesday, March 28


This special topic course will deal with statistical issues arising from ``machine learning''. The course will begin with a discussion of the concept ``learning'' and its connection to some classical statistical problems and terminology. Although emphasis will be placed on classification error bounds, and model selection techniques, the course will cover both applied and theoretical aspects of machine learning. Examples of applications will be drawn from speech and object recognition, as well as from document classification, and other fields. This course is a credit/non-credit course. Student participation will be encouraged either through special reading assignments, short presentations, or short term projects.


Tentative Topics / Outline
I.
Introduction: (3 weeks)
A. Problems: Classification, and Discrimination problems: (1.5 weeks)
B. Methods: Vapnik-Chervonenkis classes and empirical process theory: (1.5 weeks)


II.
Classification and Decision Trees: (2 weeks)
A. Classification and decision trees: (1 week)
B. Exponential bounds, Part II: (1 week)


III.
Model Complexity: Penalization and Model Selection: (3.5 weeks)
A. Approaches to model selection: (1 week)
B. Empirical Penalization and Exponential bounds, Part III: (2.5 weeks)

Some Reference Books: