Scholar·PuYu large model full-link open source open system

Large models become popular keywords

Insert image description here
Large models have become an important way to develop general artificial intelligence

Insert image description here

The open source history of Shusheng·Puyu large model

Insert image description here
Insert image description here

Scholar Puyu 20B open source large model performance

  • Comprehensively leading open source models of similar magnitude (including Llama-33B, Llama2-13B and domestic mainstream 7B and 13B open source models)
  • Reaching Llama2-70B level with less than one-third of the parameters
    Insert image description here

From model to application

Insert image description here

Scholar·PuYu full-link open source open system

Insert image description here

data

Insert image description here
Total data volume 2TB
includes:

  • Text data 5 billion documents
  • Image-text dataset 22 million files
  • Video data exceeds 1000 files
    Insert image description here

pre-training

Insert image description here

fine-tuning

  • Incremental training
    • Vertical domain knowledge articles, books, codes, etc.
  • Supervised fine-tuning
    • High-quality dialogue and question-and-answer data
      Insert image description here

Review

Insert image description hereProposed the OpenCompass evaluation system
in 6 major dimensions with 80+ evaluation sets and 400,000+ evaluation questions
Insert image description here
Insert image description here

deploy

Insert image description here
Insert image description here

agent

Insert image description here
Limitations:

  • Access to the latest information and knowledge
  • reliability of response
  • mathematical calculations
  • Tool usage and interaction
    Insert image description here
    Insert image description here
    Insert image description here

Guess you like

Origin blog.csdn.net/shengweiit/article/details/135372800