COL870/8385: Special Topics in Machine Learning

Instructor: Adarsh Barik
Credit: 3 (3-0-0)
Semester 2 (2025-2026)
LH 620
TF 3:30-5 PM

Optimization for Machine Learning

COL870/COL8385 is a 3-credit Special Topics course in Machine Learning. The course will cover topics in optimization in both offline and online settings. The material will be motivated throughout by applications to modern machine learning problems, and will include both foundational ideas and advanced topics.

Expected background

This course is intended for both (post)graduate and undergraduate students interested in the optimization foundations of machine learning. A basic level of mathematical maturity is expected; students with concerns about their background should consult the instructor.

A fundamental understanding of linear algebra, calculus, and probability theory is required for this course. Prior experience or familiarity with machine learning is also beneficial, though not mandatory.

Learning outcomes

Upon completing this course, students will be familiar with key concepts in optimization, enabling them to:

Understand the fundamental concepts such as convex sets, convex functions and optimality criteria for the optimization problems
Understand first-order optimization algorithms and analyze their convergence properties
Gain familiarity with the foundational concepts in online learning

Content

Key topics include convex sets and functions, conjugates, subdifferentials, primal and dual problem formulations, strong and weak duality, minimax characterizations, and optimality conditions including the Karush-Kuhn-Tucker (KKT) criteria. The course will also cover first-order optimization methods such as gradient descent, stochastic gradient descent (SGD), accelerated gradient techniques, subgradient methods, and Frank-Wolfe algorithms. Additionally, foundational concepts in online learning will be explored, including online gradient descent, online mirror descent, Follow-The-Regularized-Leader (FTRL), and parameter-free algorithms.

Tentative Grading Scheme

Midsemester Exam	30%
Final Exam	30%
Assignments	25%
Scribe	10%
In-class participation	5%

Audit Policy

Minimum 75% attendance and marks equivalent to a grade B or above will be considered as audit pass.

Scribe

Each student will be assigned a lecture to scribe.
Scribing must be done in LaTeX, and I will provide the required scribe template.
Scribe will be due within one week of the lecture date.

In-class participation

The grade for in-class participation will be based on both attendance and active engagement in class discussions.
The first five minutes of each lecture will be devoted to reviewing the summary and unsolved exercises from the previous lecture, providing an opportunity for students to contribute to the discussion.

Collaboration and use of AI Tools

Collaboration and discussion among students are strongly encouraged. However, all deliverables will be graded individually. You may acknowledge your collaborators by including their names in the acknowledgement section.
Plagiarism in any form (including the use of AI tools) is strictly prohibited. Any violation of this policy will result in a score of 0 for the entire course.

Lecture Notes and Assignments

Jan 2, 2026: Introduction, Convex Sets and Convex Functions (handwritten notes, scribe, [BV] Chap 2-3)
Jan 6, 2026: Convex Functions and conjugates (handwritten notes, scribe, [BV] Chap 3)
Jan 9, 2026: Conjugates, Beyond Convexity (handwritten notes, [BV] Chap 3) and Convex Optimization Problems (handwritten notes, scribe, [BV] Chap 4)
Jan 13, 2026: Duality (handwritten notes, scribe, [BV] Chap 5)
Jan 16, 2026: Duality contd. (handwritten notes), KKT Conditions (handwritten notes, scribe)
Jan 20, 2026: KKT Conditions contd. (handwritten notes), First-Order Methods (handwritten notes, scribe)
Jan 23, 2026: Convergence of First-Order Methods (handwritten notes, scribe, [BG])
Jan 27, 2026: Convergence (Lipschitz-continuous functions) (handwritten notes, scribe, [BG])
Jan 30, 2026: Convergence (Strongly convex, Smooth functions) (handwritten notes, scribe, [BG])

Reference textbook

[BV] Convex Optimization. S. Boyd and L. Vandenberghe. Cambridge University Press, Cambridge, 2003
[N] Introductory Lectures on Convex Optimization: A Basic Course. Y. Nesterov. Kluwer, 2004.
[NW] Numerical Optimization. J. Nocedal and S. J. Wright, Springer Series in Operations Research, Springer-Verlag, New York, 2006 (2nd edition).
[O] A Modern Introduction to Online Learning. Francesco Orabona. arXiv preprint arXiv:1912.13213 (2019).
[H] Introduction to Online Convex Optimization. Elad Hazan. arXiv preprint arXiv:1909.05207 (2019).
[BG] Potential-Function Proofs for Gradient Methods. Nikhil Bansal, Anupam Gupta. arXiv preprint arXiv:1712.04581 (2017).