A reading list for machine learning systems

compiled by Jeongseob Ahn
Last modified: Spring 2022

Frameworks

Parallelism & Distributed Systems

GPU Cluster Management

Memory Management for Machine Learning

Scheduling & Resource Management

Serving Systems (& inference acceleration)

Very Large Models

Deep Learning Recommendation Models

Hardware Support for ML

ML at Mobile & Embedded Systems

ML Techniques for Systems