Instructor:Marina Meila
mmp at stat dot washington dot edu
Canvas course site
Optional Textbooks "Machine Learning: A Probabilistic Approach" by K. Murphy and "Elements of Statistical Learning" by Hastie, Tibshirani and Friedman. For each lecture, I will point out the chapters in these books that are relevant. For neural networks, I will be using the Deep learning book by Ian Goodfellow, Yoshua Bengio and Aaron Courville.
Lectures: Tuesdays, & Thursdays 12:30:1:50 on zoom
Office hours: Monday 2:30-3:30 TA: Zhaoqi Li
Course web page: http://www.stat.washington.edu/courses/stat535/fall20 (this page) will be used to post lecture notes prior to lecture, and homeworks. The resources page is useful for addional reading. All other materials will be posted on Canvas. Everything posted here that is directly relevant to the learning objectives will be linked to from Canvas, hence you won't need to visit the web site once you are registered.
Class mailing list: stat535a_au20 at UW will be used sparingly by me mainly for announcements.
Learning Goals
Syllabus
Unsupervised Learning
Grading:The grade is based on homework +
quizzes (65-70%), project (25-30%), and class
participation (5%). These percentages are approximative, and may change by up to 5%.
With the exception of generic libraries (like plotting, matrix functions) you must write your own code. In particular, you are not to use the matlab, R or python code that is made availabe with the textbook(s).
What will the course be about?
Who is this class for?
Prerequisites
Notice to Students - Zoom Recordings
This course is scheduled to run synchronously at your scheduled class time via Zoom. These Zoom class sessions will be recorded. The recording will capture the presenter’s audio, video and computer screen. Student audio and video and chat will be recorded if they share their computer audio and video during the recorded session. The recordings will only be accessible to students enrolled in the course to review materials. These recordings will not be shared with or accessible to the public.
The University and Zoom have FERPA-compliant agreements in place to protect the security and privacy of UW Zoom accounts.
Students who do not wish to be recorded should:
Religious accomodation Washington state law requires that UW develop a policy for accommodation of student absences or significant hardship due to reasons of faith or conscience, or fororganized religious activities. The UW’s policy, including more information about how to requestan accommodation, is available at Religious Accommodations Policy. Accommodations must berequested within the first two weeks of this course using the Religious Accommodations Request Form.
The UW food pantry A student should never have to make the choice between buying food or textbooks. The UW Food Pantry helps mitigate the social and academic effects of campus food insecurity. They aim to lessen the financial burden of purchasing food by providing students withaccess to food and hygiene products at no-cost. Students can expect to receive 4 to 5 days’ worth ofsupplemental food support when they visit the Pantry. For information including operating hours,location, and additional food support resources visit The UW Food Pantry. They can be found onthe North side of West Campus’ Poplar Hall at the corner of Brooklyn Ave NE and 41st.
Last modified: Fri Sep 21 11:20:19 PDT 2018
Supervised Learning (Prediction)
Format: The course will consist of two weekly lectures, a
series of homework assignments, a few quizzes, and a project. The TA
will offer an optional Tutorials where he will go over certain basic
topics in more depth.
The grading policies described here apply to students in good standing; students who engage in misconduct will be reported to the office of Community Standards & Student Conduct. Their grades will be handled differently.
Submit each homework as a single .pdf file through Canvas.
The assignments will consist of (1)
short programming assignments (typically, to implement a version or a
special case of an algorithm presented in the lecture) to be done in
the programming language of your choice and (2) problems or other
questions, including proofs. The programming assignments will be split
into two separate parts:
Late homeworks will be accepted in exceptional circumstances. Please let us know in advance if you think you will be late.
Teamwork: Each class participant
submits her/his homework individually. Unless explicitly allowed to do
so, you are required to write your own code. Discussing
homework questions is acceptable as long as hints or solutions are not asked for or given; for example, a discussion to clarify what a question requires, on the discussion board or elsewhere, is acceptable.
On the discussion board: Answer questions asked by others on course topics. Replies to homework related questions: you can answer to a question asking for clarification, but not to a "how to solve this" question. If your answer is correct then it is considered. If you find an error in the homework or project and are the first to point it out to us, congratulations! we consider that participation.
How much participation is enough? Once a week on average, either in class or on discussion board, is plenty.
The class will teach the basic principles of Machine Learning, and in particular will highlight the intimate connection between statistics and computation (meaning algorithms, data structures, and optimization) in modeling large or high-dimensional data. Solutions that are algorithmically elegant, often end up being also statistically sound, and sometimes when the model estimation program runs fast, we find that the model fits the data well.
These principles will be illustrated during the study of a variety of
models, problems and methods..
This class is a core class in the Machine Learning/Big Data PhD Track in Statistics. For any Statistics student who wants to learn Machine Learning/Big Data, this class is the fist in the triplet of graduate courses 535 --> 538/548 and serves as a prerequisite to
STAT 538 Advanced Machine Learning (taught in Winter), and
STAT 548 Machine Learning for Big Data
Capacity permitting, the class is open to other graduate students with
an interest in statistics, algorithms and computing who satisfy the
prerequisites.