Convergence Of Variance-Reduced Learning Under Random Reshuffling

Ying, BichengYuan, KunSayed, Ali H.2018-12-132018-12-132018-12-132018-01-0110.1109/ICASSP.2018.8461739https://infoscience.epfl.ch/handle/20.500.14299/152297WOS:000446384602093Several useful variance-reduced stochastic gradient algorithms, such as SVRG, SAGA, Finito, and SAG, have been proposed to minimize empirical risks with linear convergence properties to the exact minimizers. The existing convergence results assume uniform data sampling with replacement. However, it has been observed that random reshuffling can deliver superior performance and, yet, no formal proofs or guarantees of exact convergence exist for variance-reduced algorithms under random reshuffling. This paper makes two contributions. First, it resolves this open issue and provides the first theoretical guarantee of linear convergence under random reshuffling for SAGA; the argument is also adaptable to other variance-reduced algorithms. Second, under random reshuffling, the paper proposes a new amortized variance-reduced gradient (AVRG) algorithm with constant storage requirements compared to SAGA and with balanced gradient computations compared to SVRG. AVRG is also shown analytically to converge linearly.AcousticsEngineering, Electrical & ElectronicEngineeringrandom reshufflingvariance-reductionstochastic gradient descentlinear convergenceConvergence Of Variance-Reduced Learning Under Random Reshufflingtext::conference output::conference proceedings::conference paper