[Large model] Partial open source large model research and outline key information collation

 Keywords : large model, LLM, OpenAI, GPT, LLaMA...

Recently, I investigated the mainstream open source large models, and built an experimental environment, tried to finetuning on the domain data, sorted out part of the large model information in an outline style, and gradually sorted out the details of the paper and the experimental process later.

Research on generative large models (partial)

  • OpenAI
    • chatGPT
      • Main API: API name -- basic model
        • text-davince-003 -- GPT3(175B)
        • gpt-3.5-turbo-0301 -- GPT3.5, corresponding to ChatGPT
        • code-davinci-002 -- CodeX(12B)
    • GPT4 (Plus needs to be activated)
      • API: gpt-4/gpt-4-0314/gpt-4-32k/gpt-4-32k-0314
      • Base model: GPT4
  • open source model
  • GPT model in the medical field

Attached curtain mind map link  password: yhr6

Guess you like

Origin blog.csdn.net/iling5/article/details/130587148