Data Compression With and Without Deep Probabilistic Models

Download links for lecture notes, problem sets, solutions, and video summaries for each session will be added as we progress with the course.

Overview and Symbol Codes

What is the connection between data compression, probabilistic models, and error correction? We answer these questions with some concrete examples of so-called symbol codes.

Lecture

19 April

Tutorial

19. & 26.

Lecture Notes Video 1 Video 2
Problem Set 0 Solutions to Problem Set 0
Problem Set 1 Solutions to Problem Set 1

Theoretical Bounds of Lossless Compression (feat. Entropy)

We prove the Source Coding Theorem. This cornerstone of information theory quantifies information content, and it states a fundamental lower bound for the bit rate of lossless compression.

Lecture

26 April

Tutorial

3 May

Lecture Notes Video 1 Video 2
Problem Set Solutions

Proof of Optimality of Huffman Coding

We prove that the famous Huffman Coding algorithm constructs an optimal symbol code. Then we introduce an information theoretical measure for model mismatch, the “KL-divergence”.

Lecture

3 May

Tutorial

10 May

Lecture Notes Problem Set Solutions

Mutual Information and Taxonomy of Probabilistic Models

How can we build powerful probabilistic models without sacrificing efficiency? We'll discuss various designs after introducing important concepts from probability and information theory.

Lecture

10 May

Tutorial

17 May

Lecture Notes Problem Set Solutions

Stream Codes I: Arithmetic Coding And Range Coding

We've proved in Lecture 3 that Huffman Codes are optimal symbol codes. But it turns out that we can do better than symbol codes—by thinking in fractional bits.

Lecture

17 May

Tutorial

24 May

Lecture Notes Problem Set Solutions

Stream Codes II: Asymmetric Numeral Systems (ANS)

This recently invented stream code is as performant as range coding while being much easier to implement. But it has a caveat—or is it a feature?
(Yes, that's a clickbait teaser; sue me.)

Lecture

24 May

Tutorial

7 June

Lecture Notes Problem Set Solutions

Bits-Back Coding

We generalize the so-called “bits-back trick” from the ANS algorithm to arbitrary latent variable models. This allows us to use latent variables for data compression without paying for them. Think of it as “short selling” bits.

Lecture

7 June

Tutorial

14 June

Lecture Notes Problem Set Solutions

Variational Inference

This method for approximate Bayesian inference is a mainstay of modern probabilistic machine learning. And—curiously—its most natural derivation actually builds on the bits-back coding algorithm.

Lecture

14 June

Tutorial

21 June

Lecture Notes Problem Set Solutions

Variational Autoencoders

We extend variational inference and learn both the generative model from training data, and how to quickly do inference in the learned model. This results in a popular class of models for neural image and video compression.

Lecture

21 June

Tutorial

28 June

Lecture Notes Problem Set Solutions

Lossy Compression: From VAEs to Rate/Distortion Theory

We implement our first lossy compression method and observe a lower bit rate than with lossless compression. We then derive a theoretical bound for the bit rate lossy compression.

Lecture

28 June

Tutorial

5 July

Lecture Notes Problem Set Solutions

Channel Coding Theorem and Source-Channel Separation

We take a step back from compression and consider the wider problem of efficient communication. Our discussion also reveals how meaningful the theoretical bound for lossy compression is.

Lecture

5 July

Tutorial

12 July

Skeleton Lecture Notes

Recent Advances in Machine-Learning Based Data Compression

Diffusion models, flows, quantization methods, probabilistic models for videos, ... — modern compression research is a vibrant field.

Lecture

12 July

Tutorial

19 July

Skeleton Lecture Notes
Problem Set Solutions

Guest Star: Lucas Theis (Google Research)

I am excited that Lucas Theis agreed to give a guest lecture. He is a pioneer of ML-based compression, and at the forefront of novel methods like diffusion models & universal quantization.

Lecture

19 July

Talk

19 July

Skeleton Lecture Notes

Exam

Tentatively scheduled for the last week of the semester; I'm open to rescheduling based on students' preferences and scheduling constraints.

26 July (tentatively)

Data Compression With and Without Deep Probabilistic Models
(Summer Term 2023)

Additional Resources

Links

Schedule & Course Materials

Overview and Symbol Codes

Theoretical Bounds of Lossless Compression (feat. Entropy)

Proof of Optimality of Huffman Coding

Mutual Information and Taxonomy of Probabilistic Models

Stream Codes I: Arithmetic Coding And Range Coding

Stream Codes II: Asymmetric Numeral Systems (ANS)

Bits-Back Coding

Variational Inference

Variational Autoencoders

Lossy Compression: From VAEs to Rate/Distortion Theory

Channel Coding Theorem and Source-Channel Separation

Recent Advances in Machine-Learning Based Data Compression

Guest Star: Lucas Theis (Google Research)

Exam

Data Compression With and Without Deep Probabilistic Models(Summer Term 2023)

Additional Resources

Links

Schedule & Course Materials

Overview and Symbol Codes

Theoretical Bounds of Lossless Compression (feat. Entropy)

Proof of Optimality of Huffman Coding

Mutual Information and Tax­on­o­my of Probabilistic Models

Stream Codes I: Arithmetic Coding And Range Coding

Stream Codes II: Asym­met­ric Numeral Systems (ANS)

Bits-Back Coding

Variational Inference

Variational Autoencoders

Lossy Compression: From VAEs to Rate/Distortion Theory

Channel Coding Theorem and Source-Channel Separation

Recent Advances in Machine-Learning Based Data Compression

Guest Star: Lucas Theis (Google Research)

Exam

Data Compression With and Without Deep Probabilistic Models
(Summer Term 2023)

Mutual Information and Taxonomy of Probabilistic Models

Stream Codes II: Asymmetric Numeral Systems (ANS)