Abstract: When running in Parameter Server (PS), the Distributed Stochastic Gradient Descent (D-SGD) incurs significant communication delays and huge communication overhead due to the model ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results