Joshi, Gauri.

Optimization Algorithms for Distributed Machine Learning [electronic resource] / by Gauri Joshi. - 1st ed. 2023. - XIII, 127 p. 40 illus., 38 illus. in color. online resource. - Synthesis Lectures on Learning, Networks, and Algorithms, 2690-4314 . - Synthesis Lectures on Learning, Networks, and Algorithms, .

Distributed Optimization in Machine Learning -- Calculus, Probability and Order Statistics Review -- Convergence of SGD and Variance-Reduced Variants -- Synchronous SGD and Straggler-Resilient Variants -- Asynchronous SGD and Staleness-Reduced Variants -- Local-update and Overlap SGD -- Quantized and Sparsified Distributed SGD -- Decentralized SGD and its Variants.

This book discusses state-of-the-art stochastic optimization algorithms for distributed machine learning and analyzes their convergence speed. The book first introduces stochastic gradient descent (SGD) and its distributed version, synchronous SGD, where the task of computing gradients is divided across several worker nodes. The author discusses several algorithms that improve the scalability and communication efficiency of synchronous SGD, such as asynchronous SGD, local-update SGD, quantized and sparsified SGD, and decentralized SGD. For each of these algorithms, the book analyzes its error versus iterations convergence, and the runtime spent per iteration. The author shows that each of these strategies to reduce communication or synchronization delays encounters a fundamental trade-off between error and runtime.

9783031190674

10.1007/978-3-031-19067-4 doi


Algorithms.
Machine learning.
Artificial intelligence.
Distribution (Probability theory).
Computer science.
Algorithms.
Machine Learning.
Design and Analysis of Algorithms.
Artificial Intelligence.
Distribution Theory.
Computer Science.

QA76.9.A43

518.1