ChatGPT actual combat and privatization large model implementation

Large model status

baseline base selection

Insert image description here

data structure

Insert image description here

  • Domain data
  • Book data
  • Website data
  • news content
  • command trim data

Mixed data (public data + domain data, ratio 1:5) avoids knowledge forgetting, leading to a decline in general capabilities.

Migration method

Insert image description here

  • When resources are insufficient, train based on the chat model

  • When resources are sufficient, train tens of millions of data on the Base model . Do not use the full amount of data for training on the Chat model.

evaluate

Insert image description here

think

Insert image description here


Domain large model training techniques

Insert image description here

Tokenizer

Insert image description here

Distributed Deep Learning

Insert image description here

data parallelism

Insert image description here

Pipeline parallelism

Insert image description here

vector parallelism

Insert image description here
Insert image description here

Distributed framework——Megatron-LM

Insert image description here

Distributed deep learning framework—Colossal-AI

Insert image description here

Distributed deep learning framework—DeepSpeed

Insert image description here
Insert image description here
Insert image description here
Insert image description here

P-tuning fine-tuning

Insert image description here

LF

Insert image description here
Insert image description here

Model inference acceleration

Model inference acceleration method——FastLLM

Insert image description here

Model inference acceleration method——VLLm

Insert image description here


Large model product form and implementation scenarios in the field

Insert image description here

Thoughts from ChatGPT user perspective

Insert image description here

ChatGPT enterprise perspective thinking

Insert image description here

Construction elements of enterprise-level ChatGPT

Insert image description here

Choose high-quality application scenarios

Insert image description here

ChatGPT scene design-industrial manufacturing

Insert image description here

ChatGPT scene design-copywriting

Insert image description here

Guess you like

Origin blog.csdn.net/uncle_ll/article/details/132781105