【论文学习8】TernGrad: Ternary Gradients to Reduce Communication in Distributed Deep Learning

其他 2020-03-23 12:34:11 阅读次数: 0

演讲摘要（即论文摘要）

作者视频讲解：https://www.youtube.com/watch?v=WWWQXTb_69c&feature=youtu.be&t=20s

摘要

分布式训练的瓶颈为同步梯度和参数的高网络通信成本。在论文中，我们提出了三元梯度来加速分布式学习。只需要一个三元数组{-1,0,1}就可以减少通信时间。在梯度有界的前提下，我们数学证明了TerGrad的收敛性。在边界指导下，我们提出了分层的三元化和梯度裁剪来提高收敛性。实验证明可以提升准确性。

作者主页：http://www.pittnuts.com/

PPT链接：https://github.com/wenwei202/terngrad/blob/master/NIPS17-TernGrad-slides-v3.pdf

猜你喜欢

转载自www.cnblogs.com/20189223cjt/p/12551296.html

【论文学习8】TernGrad: Ternary Gradients to Reduce Communication in Distributed Deep Learning

TernGrad: Ternary Gradients to Reduce Communication in Distributed Deep Learning

论文导读：RESOURCE ELASTICITY IN DISTRIBUTED DEEP LEARNING

「Deep Learning」Note on the Shattered Gradients Problem

ICLR 2018 | Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training

Elephas: Distributed Deep Learning with Keras & Spark

Fast Distributed Deep Learning on RDMA阅读笔记

【译】Distributed Deep Learning - Part 1 - An Introduction

Large Scale Distributed Deep Learning using Kubernetes

联邦学习——FedAvg《Communication-Efficient Learning of Deep Networks from Decentralized Data》论文笔记

《Communication-Efficient Learning of Deep Networks from Decentralized Data》论文阅读

论文学习之综述：《Deep learning》

Deep learning II - I Practical aspects of deep learning - Vanishing/Exploring gradients 梯度消失/爆炸

paper survey ——deep learning or machine learing and optical communication

Communication-Efficient Learning of Deep Networks from Decentralized Data

Distributed Deep Learning Training and Inference Using Apache Spark

CoRR 2018 | Horovod: Fast and Easy Distributed Deep Learning in Tensorflow

[论文学习]Deep Learning Based Recommendation: A Survey

Deep Residual Learning for Image Recognition 论文学习

【论文学习记录】Xception: Deep Learning with Depthwise Separable Convolutions

A Fast Learning Algorithm for Deep Belief Nets - 论文学习

【论文学习笔记】《A Review of Deep Learning Based Speech Synthesis》

分布式机器学习的地域性问题怎么解决？ DLion: Decentralized Distributed Deep Learning in Micro-Clouds 论文精读

Deep Learning 学习笔记

【深度学习】Deep Learning

如何解决ASGD中的“STALENESS“？Staleness-aware Async-SGD for Distributed Deep Learning给你一个解论文精读

Deep learning 论文笔记

RL+RA 文献Multi-Agent Deep Reinforcement Learning for Enhancement of Distributed Resource Allocation

Learning to Learning with Gradients———论文阅读第一部分

Learning to Learning with Gradients———论文阅读第三部分

今日推荐

开放签电子签章：停止新增，优化体验，前进更进（五一假期前工作）

开源日报 | 中学生开源前端动画引擎；全球首个Llama3 8B中文版开源模型；联想电脑恐出局；Linus讽刺AI炒作

“百模大战”必有一战 | 2024中国“百模大战”竞争格局分析

最强开源大模型 Llama 3 上架 Gitee AI

虽然老乡鸡开源的不是代码，但背后的原因却让人很暖心

富文本编辑器 Quill 2.0 重磅发布，特性、可靠性与开发者体验大幅提升

周排行

使用Redis中间件解决商品秒杀活动中出现的超卖问题（使用Java多线程模拟高并发环境）

野指针及c++指针使用注意点

redis 3.0　新特性

(翻译)火狐操作系统javascript API

微信小程序开发入门

mysql数据查询之五子句(where、group by、having、order by和limit)

Codeforces Round #517 Div. 1翻车记

在caffe 中实现Generative Adversarial Nets（二）

企业级漏洞扫描工具

java byte数组与String互转

每日归档

2024-04-23(26)

2024-04-22(39)

2024-04-21(0)

2024-04-20(6)

2024-04-19(5)

2024-04-18(0)

2024-04-17(5)

2024-04-16(70)

2024-04-15(42)

2024-04-14(0)