A reading list for AI systems

compiled by Jeongseob Ahn
Last modified: Fall 2024

Generative & LLMs

Serving Systems (& inference acceleration)

Parallelism & Distributed Systems

GPU Cluster Management

Memory Management for Machine Learning

Scheduling & Resource Management

Deep Learning Compiler

Deep Learning Recommendation Models

Hardware Support for ML

ML at Mobile & Embedded Systems

ML Techniques for Systems

Frameworks