IntSGD: Floatless Compression of Stochastic Gradients

On 17 Feb, 2021 By admin 0 Comments

February, 2021

Abstract

We propose a family of lossy integer compressions for Stochastic Gradient Descent (SGD) that do not communicate a single float. This is achieved by multiplying floating-point vectors with a number known to every device and then rounding to an integer number. Our theory shows that the iteration complexity of SGD does not change up to constant factors when the vectors are scaled properly. Moreover, this holds for both convex and non-convex functions, with and without overparameterization. In contrast to other compression-based algorithms, ours preserves the convergence rate of SGD even on non-smooth problems. Finally, we show that when the data is significantly heterogeneous, it may become increasingly hard to keep the integers bounded and propose an alternative algorithm, IntDIANA, to solve this type of problems.

Attachment:

IntSGD Floatless Compression of Stochastic Gradients.pdf

Resource Type:

Academic Paper

Tags:

Machine Learning

Stochastic Gradient Descent

IntSGD

Floatless Compression

Stochastic Gradients

You are here

IntSGD: Floatless Compression of Stochastic Gradients