Slides 2024

Outline

The 2024 course consists of the following topics

Introduction.
Overview of Mathematics of Data
Empirical Risk Minimization
Statistical Learning with Maximum Likelihood Estimators

Generalized linear model
Linear regression
M-estimator examples

Linear algebra reminder
Convexity and Gradients
Convergence rates and convergence plots

Principles of iterative descent methods
Structures in optimization
Gradient descent methods

Optimality of convergence rates
Lower bounds
Accelerated gradient descent
Newton and Adaptive methods
Tensor methods

Stochastic gradient descent
Concise signal models
Compressive sensing
Sample complexity bounds for estimation and prediction
Challenges to optimization algorithms for non-smooth optimization
Subgradient method

Composite minimization
Proximal gradient methods
Introduction to Frank-Wolfe method

Variance reduction
Introduction to deep learning
Challenges in deep learning theory and applications

The classical trade-off between model complexity and risk
Generalization bounds via uniform convergence
Generalization in deep learning
Implicit regularization of optimization algorithms
Double descent
Scaling Laws

Adaptive gradient methods
Scalable non-convex optimization

Adversarial machine learning
Wasserstein generative adversarial networks
Difficulty of minimax optimization.

Convergence of minmax
Diffusion models
Robustness in deep learning

Primal-dual optimization-I: Fundamentals of minimax problems
Fenchel conjugates
Duality

Primal-dual optimization-II: Augmented Lagrangian grandient methods
Semi-definite programming
HCGM and CGAL algorithms

Language models: Basis of language models.
Self attention and Transformer
GTP family