A reading list for machine learning systems

compiled by Jeongseob Ahn
Last modified: Winter 2023

Generative & LLMs

Serving Systems (& inference acceleration)

Parallelism & Distributed Systems

GPU Cluster Management

Memory Management for Machine Learning

Scheduling & Resource Management

Deep Learning Compiler

Deep Learning Recommendation Models

Hardware Support for ML

ML at Mobile & Embedded Systems

ML Techniques for Systems

Frameworks