[Live broadcast preview] Will large models replace programmers? "

Introduction:

Llama is a large language model (LLM) developed and open sourced by the artificial intelligence research team of Meta (formerly Facebook). It is open for commercial use and has had a profound impact on the entire field of artificial intelligence. Following the previously released Llama 2 model that supports 4096 contexts, Meta has further launched the Meta Llama 3 series of language models with better performance, including an 8B (8 billion parameters) model and a 70B (70 billion parameters) model. The performance of Llama 3 70B is comparable to Gemini 1.5 Pro and surpasses Claude Big Cup in all aspects, while the 400B+ model is expected to compete with Claude Extra Large Cup and the new GPT-4 Turbo

In various test benchmarks, the Llama 3 series models have demonstrated their superior performance. They are comparable to other popular closed-source models on the market in terms of practicality and safety evaluation, and even surpass them in some aspects. The release of Meta Llama 3 series not only consolidates its competitive position in the field of large-scale language models, but also provides researchers, developers and enterprises with powerful tools to promote the further development of language understanding and generation technology.

project address:

https://github.com/meta-llama/llama3

Differences between llama2 and llama3

Differences between llama3 and GPT4

index	Call 3	GPT-4
Model size	70B、400B+	100B、175B、500B
Parameter Type	Transformer	Transformer
training objectives	Masked Language Modeling、Perplexity	Masked Language Modeling、Perplexity
training data	Books、WebText	Books、WebText
performance	SOTA (question and answer, text summarization, machine translation, etc.)	SOTA (question and answer, text summarization, machine translation, etc.)
Open source	yes	no

Highlights of Llama 3

Open to everyone: Meta makes cutting-edge AI technology accessible by open-sourcing a lightweight version of Llama 3. Whether you are a developer, researcher or a friend who is curious about AI technology, you can freely explore, create and experiment. Llama 3 provides an easy-to-use API for researchers and developers.
Large model scale: The parameter scale of the Llama 3 400B+ model has reached 400 billion, which is a large language model.
Will be integrated into various applications soon: Llama 3 is currently empowered with Meta AI, Meta AI experience address: https://www.meta.ai/

Using Ollama on Windows, running the Llama3 model

Visit https://ollama.com/download/windows page to download OllamaSetup.exethe installation program.

After installation, select the corresponding model parameters for installation according to your computer configuration (at least 8GB of memory is required to run 7B, and at least 16GB of memory is required to run 13B)

What I am running here is Llama3:8b. It can be seen that there are still some problems with Chinese.

Model	Parameters	Size	Download
Call 3	8B	4.7GB	`ollama run llama3`
Call 3	70B	40GB	`ollama run llama3:70b`
Mistral	7B	4.1GB	`ollama run mistral`
Dolphin Phi	2.7B	1.6GB	`ollama run dolphin-phi`
Phi-2	2.7B	1.7GB	`ollama run phi`
Neural Chat	7B	4.1GB	`ollama run neural-chat`
Starling	7B	4.1GB	`ollama run starling-lm`
Code Llama	7B	3.8GB	`ollama run codellama`
Llama 2 Uncensored	7B	3.8GB	`ollama run llama2-uncensored`
Call 2 13B	13B	7.3GB	`ollama run llama2:13b`
Call 2 70B	70B	39GB	`ollama run llama2:70b`
Orca Mini	3B	1.9GB	`ollama run orca-mini`
The lava	7B	4.5GB	`ollama run llava`
Gemma	2B	1.4GB	`ollama run gemma:2b`
Gemma	7B	4.8GB	`ollama run gemma:7b`
Solar	10.7B	6.1GB	`ollama run solar`

Using Hugging Face

Visit: https://huggingface.co/chat/ and switchModels

Replicate use

8B model: https://replicate.com/meta/meta-llama-3-8b

70B model: https://replicate.com/meta/meta-llama-3-70b

This article is a reprint of the article Heng Xiaopai , and the copyright belongs to the original author. It is recommended to visit the original text. To reprint this article, please contact the original author.

Running Llama 3 large-scale models in a local environment: a feasibility and practical guide