Microsoft's open source depth study optimized libraries DeepSpeed, trainable 100 billion parameter model - Code World

Microsoft's open source depth study optimized libraries DeepSpeed, trainable 100 billion parameter model

News 2020-02-12 09:00:56 views: null

NoSuchKey

Guess you like

Origin www.oschina.net/news/113328/microsoft-opensource-deepspeed

Microsoft's open source depth study optimized libraries DeepSpeed, trainable 100 billion parameter model

ZeRO & DeepSpeed: allows training model has more than 100 billion parameter optimization (Microsoft)

Microsoft open source DeepSpeed-Chat: make ChatGPT-like 100 billion large models speed up and save money by 15 times

Popular understanding of Megatron-DeepSpeed: the technology behind the 100 billion parameter model BLOOM

[World Premiere] Scholar·Puyu’s 20 billion parameter model InternLM-20B is open source!

Facebook open source depth study recommended model DLRM

Scholar·Puyu 20 billion parameter model InternLM-20B open source

The full version of the 7 billion parameter large language model RedPajama 7B is released, open source and commercially available

A brief introduction to the top 100 Android open source libraries on GitHub

Tencent Tang Daosheng: With over 100 billion parameters and over 2 trillion tokens, Tencent’s Hunyuan large model is fully open to the industry

Meta AI's Galactica: A 120 Billion Parameter Scientific Language Model

When AI meets open source, open the superpower of the people｜Microsoft DeepSpeed-Chat

Microsoft shocked the open source DeepSpeed Chat, one-click to achieve end-to-end RLHF training of ChatGPT

The world's largest open source translation model! Produced by Meta, supports 100 voices and languages!

DeepSpeed Chat: One-click RLHF training, make your ChatGPT-like 100 billion large model speed up and save money by 15 times

Tsinghua University’s second generation 6 billion parameter ChatGLM2 is open source! Ranked first in the Chinese list, crushing GPT-4, and speeding up reasoning by 42%

Kears source of trainable

[pytorch] The role of self.register_buffer(): defined as a non-trainable model parameter

180 billion parameters, supports Chinese, 3.5 trillion training data! Open source ChatGPT-like model

The new work of Chen Danqi's team: A single card A100 can train 30 billion parameter models!

zz depth study of the attention model

ZTE’s “Nebula R&D Big Model”: AI programming assistant, 100 billion tokens

The winner of the 100-model war will eventually be the open source model | Near Craftsman

Recommendable C / C ++ open source frameworks and libraries

Retrofit of third party open source libraries

Summary of Open Source Libraries for Machine Learning

Summary of Open Source Libraries for Machine Learning

Mark! A roundup of the best open source libraries for Android

Open Source: Best JavaScript and CSS Open Source Libraries of the Year!

ImageBind, MetaAI open source model across 6 different modalities (image, text, audio, depth, temperature and IMU data)

Recommended

Ranking

HTML anchor - absolute and relative positioning, fixed positioning

clippingNode cut

LVS_Director+keepalived

Why does TypeScript have objects? how to create objects

ActiveMQ (seven) - ActiveMQ's Network

Notes Spinner class (drop-down list box):

Paddle image segmentation 7-day punch-in camp learning summary

The seven stages of class loading

Intelligent management solution for street lighting

String and StringBu ff er, what's the difference StringBuilder is? Why String is immutable?

Daily

More

2025-05-04(0)

2025-05-03(0)

2025-05-02(0)

2025-05-01(0)

2025-04-30(0)

2025-04-29(0)

2025-04-28(0)

2025-04-27(0)

2025-04-26(0)

2025-04-25(0)