Introduction to Machine Learning (NYU Paris, Summer School 2022)
Machine Learning is getting more and more important these days with applications ranging from autonomous driving to computer assisted medicine, including weather or financial forecasting. In this class we will study the mathematical foundations of the current machine learning algorithms.
We will cover the main models from both supervised learning including linear and non linear regression and classification (kernel methods, support vector machine, neural networks) and unsupervised learning (including clustering, gaussian mixtures, self organizing maps, principal and independent component analysis and non linear dimensionality reduction)
We will review basic concepts in probability and statistics. We will discuss Bayesian vs frequentist statistics and model/parameter inference, as well as sampling methods.
Finally, we will also discuss the important question of model assessment and selection.
The class will follow the structure
1. Lectures (introduction of the new material that will be needed during the lab sessions and for the assignements)
2. Programming (lab) sessions, (you have the opportunity to apply what you have learned during the lecture, and you can ask all the questions you want to make sure you understand everything before the assignement)
3. Assignments (You are given a new problem and you are evaluated on your ability to use the course material to solve this new problem)
Schedule and Classroom
Lecture: Tues/Wedn./Thursd., 10.00am12.30pm (Paris Time), Zoom
Recitations: Friday 10.00am12.30pm (Paris time). Zoom
Office hour: TBA (Paris time)
Assignments policy
Except if explicitely stated otherwise, assignments are due at the beginning of each class.
Current (temporary) version of the notes: Lecture notes as well as the list of sections for the Final
Practice (theory) Questions for each exam can be found by clicking on those exams below
Exam : 60% of the grade (30% Midterm (Material), 30% Final(Material)), Sample Midterm 1, Sample Final
, Exams:
Enter password:
Midterm I
Exams:
Enter password:
Midterm II
Exams:
Enter password:
Midterm III
Exams:
Enter password:
Final I
Exams:
Enter password:
Final II
Assignments : 30 % of the grade (Tentative schedule below)
Final Project : 10 % of the grade (Tentative schedule below, List of suggestions, Poster guidelines)
The Github repo for the class can be accessed at https://github.com/acosse/Intro2ML_Summer2021 and will be used for the lab and the assignments. You can also click on each “Lab” in the schedule below which will display a rendering of the notebooks through nbviewer. To access the file itself (and to be able to download it), you should go directly to github
Tentative schedule:
Legend: Lab sessions are in green, Homeworks are in red (right side of the table), dates related to the project are in orange.
Week #  date  Topic  Assignements 
Week 1 Tue/Wed
 05/24, 05/25  General Intro + reminders on proba and inference. Class demo 1 Part I, 
Readings 
Part I : supervised Learning  
Week 1 Th/Fr 
05/25, 05/27  Linear and logistic regression, regularization and Compressed sensing Linear Classification Part I, Part II, Note on the BiasVariance tradeoff Demos Weeks 1 and 2 Additional Note Ridge vs LASSO 
Readings Assignment 1 
Week 2 Tue/Wed 
05/31, 06/01 
Linear and logistic regression, Linear Classification (Notes 1, Notes 2,
Notes 3) Demo 2, Demo 3, Demo Logistic vs OLS: demo1, demo2 Demo BiasVariance (Part I) Additional Note on MVN/GDA 
Readings 
Week 2 Th/Fr 
06/01, 06/03  Non Linear classification, Kernel methods, SVM, Part I, Part II,
Part III
Lab 3, (Partial) Solutions 
Readings
Assign. 2, Assig. 1 due 
Week 3 Tue/Wed 
06/06, 06/07  Neural Networks, Optimization, Stochastic Optimization, Deep learning, Part I,
Part II
Lab 4 (Part I) / Solutions (Part I), Solutions (Part II) 
Readings 
Week 3 Th/Fr 
06/07, 06/10  Lab 2: Non Linear regression and classification, Neural Nets
Lab 5 Zoom Slides 
Project choice MidTerm Revisions 
Part II : Unsupervised Learning  
Week 4 Tue/Wed 
06/14, 06/15  Clustering, Linear Latent variable models Slides
Lab 6 
Readings 
Week 4 Th/Fr 
06/15, 06/17  Linear Latent variable models (Part II), PCA, ICA, GMM, EM algorithm, Non linear LVM, Part I Part II, Additional Note on MVN Demos FA/PCA , Handwritten notes 

Week 5 Tue/Wed 
06/21, 06/22  Non Linear LVM and Manifold Learning Parts 1&2
Lab 7 
Readings 
Week 5 Th/Fr 
06/22, 06/24  Lab 3: Unsupervised Learning  Readings 
Week 06 Tue/Wed 
06/28, 06/29  Advanced Topics: Reinforcement Learning  
Week 6 Th/Fr 
06/29, 07/01  Advanced topics: Probabilistic models, HMM, Bayesian Nets  
Lab Sessions and programming policy
The lab sessions will require you to do some programming. It is strongly recommended to use python as it is more flexible and will be useful to you when moving to pytorch later on for more advanced machine learning methods requiring GPU processing.
Downloading and getting started with Python.
Data sets can be downloaded on the following websites: