直播预告 | 大模型专场二活动开启

点击蓝字

c55c068ba70973ba4cb0a09585c7c274.jpeg

关注我们

AI TIME欢迎每一位AI爱好者的加入!

897feb90812c00c056964dadefb273c1.jpeg

哔哩哔哩直播通道

扫码关注AI TIME哔哩哔哩官方账号预约直播

c8931398ef9ecde4ae9017df15cba399.gif

✦+

+

活动时间

扫描二维码关注公众号,回复: 14650371 查看本文章

4月7日  10:00-11:00

+

邀请嘉宾

a839a16532f3bd6e7cc6f83572ff9f63.jpeg

讲者简介

Greg Yang

Greg Yang is a researcher at Microsoft Research in Redmond, Washington. He joined MSR after he obtained Bachelor's in Mathematics and Master's degrees in Computer Science from Harvard University, respectively advised by ST Yau and Alexander Rush. He won the Hoopes prize at Harvard for best undergraduate thesis as well as Honorable Mention for the AMS-MAA-SIAM Morgan Prize, the highest honor in the world for an undergraduate in mathematics. He gave an invited talk at the International Congress of Chinese Mathematicians 2019.

报告题目

The unreasonable effectiveness of mathematics in large scale deep learning

报告简介

Recently, the theory of infinite-width neural networks led to the first technology, muTransfer, for tuning enormous neural networks that are too expensive to train more than once. For example, this allowed us to tune the 6.7 billion parameter version of GPT-3 using only 7% of its pretraining compute budget, and with some asterisks, we get a performance comparable to the original GPT-3 model with twice the parameter count. In this talk, I will explain the core insight behind this theory. In fact, this is an instance of what I call the *Optimal Scaling Thesis*, which connects infinite-size limits for general notions of “size” to the optimal design of large models in practice, illustrating a way for theory to reliably guide the future of AI. I'll end with several concrete key mathematical research questions whose resolutions will have incredible impact on how practitioners scale up their NNs.

请添加“AI TIME小助手(微信号:AITIME_HY)”,回复“大模型”,将拉您进群!

c9882a6a47b5517c9028ee77c749414f.gif

AI TIME微信小助手

b9f117cfbdcd78e0866c4d8b1ec18e7f.jpeg

往期精彩文章推荐

b03bc6b1227f0508f263c3357377286d.jpeg

记得关注我们呀!每天都有新知识!

 关于AI TIME 

AI TIME源起于2019年,旨在发扬科学思辨精神,邀请各界人士对人工智能理论、算法和场景应用的本质问题进行探索,加强思想碰撞,链接全球AI学者、行业专家和爱好者,希望以辩论的形式,探讨人工智能和人类未来之间的矛盾,探索人工智能领域的未来。

迄今为止,AI TIME已经邀请了1000多位海内外讲者,举办了逾500场活动,超500万人次观看。

0fdea2faf0c7a29810d4d2a30296ffa7.png

我知道你

在看

~

4fea84cb59e0dfc7259bdd1645bcacd7.gif

点击 阅读原文 预约直播!

猜你喜欢

转载自blog.csdn.net/AITIME_HY/article/details/129965080