Kavli Affiliate: Jia Liu | First 5 Authors: Menglu Yu, Ye Tian, Bo Ji, Chuan Wu, Hridesh Rajan | Summary: Fueled by advances in distributed deep learning (DDL), recent years have witnessed a rapidly growing demand for resource-intensive distributed/parallel computing to process DDL computing jobs. To resolve network communication bottleneck and load balancing issues in […]
Continue.. GADGET: Online Resource Optimization for Scheduling Ring-All-Reduce Learning Jobs