MoDa community launches Mistral AI’s first open source MoE model Mixtral8x7B - Code World

MoDa community launches Mistral AI’s first open source MoE model Mixtral8x7B

News 2024-01-09 13:54:58 views: null

Mistral AI recently released the first open source MoE model Mixtral8x7B and announced its launch in the MoDa community.

Mixtral-8x7B is a mixed expert model (Mixtrue of Experts), consisting of 8 expert networks with 7 billion parameters. In terms of capabilities, Mixtral-8x7B supports 32k token context length and supports English, French, Italian, German and Spanish, has excellent code generation capabilities and can be fine-tuned to an instruction following model.

The model achieved a score of 8.3 on the MT-Bench evaluation, which is equivalent to GPT3.5.

WeChat screenshot_20231214092122.png

Mixtral-8x7B-v0.1 model:

https://www.modelscope.cn/models/AI-ModelScope/Mixtral-8x7B-v0.1/summary

Mixtral-8x7B-Instruct-v0.1 model:

https://www.modelscope.cn/models/AI-ModelScope/Mixtral-8x7B-Instruct-v0.1/summary

Mistral-7B-Instruct-v0.2 new model:

https://www.modelscope.cn/models/AI-ModelScope/Mistral-7B-Instruct-v0.2/summary

Guess you like

Origin blog.csdn.net/English0523/article/details/134993631

MoDa community launches Mistral AI’s first open source MoE model Mixtral8x7B

Comprehensive analysis of the first open source MoE large model Mixtral 8x7B: from principle analysis to code interpretation

The second wave of Alibaba Cloud Tongyi Qianwen open source! The large-scale visual language model Qwen-VL is launched on Moda Community

[ScienceAI Weekly] DeepMind’s latest research is published in Nature; my country’s first self-developed earth system model is open source; Google launches a health care model

Spring open source community's first domestic project successfully graduated

Microsoft launches small model Phi-2 with better performance than Llama 2/Mistral 7B

Comprehensive analysis of the first open source MoE large model Mixtral 8x7B: from principle analysis to code interpretation

8 major bottlenecks currently faced by the large model of the open source community

Mistral AI releases Mistral 7B, a model with 7.3 billion parameters

The world's first commercially available biomedical large model BioMedGPT-10B open source

Meta dropped another bomb on the open source community! Publish AI code generation SOTA large model Code Llama

Kaiyuan Daily | Open source front-end animation engine for middle school students; the world’s first Llama3 8B Chinese version open source model; Lenovo Computer may be out of business; Linus satirizes AI hype

Open the world's first open source application model OAM |. Cloud native ecology Weekly Vol 23

Miscellaneous - open source community: open source community

Tencent launches Crane, the first cloud-native cost optimization open source project in China

ONAP open source community

Mistral AI releases 7.3 billion parameter model, "crushing" Llama 2 13B

Commercially available! OpenBuddy, the world's first large Chinese language model based on the Falcon architecture, is open source!

The first open-source community open day of argot invites you to gather sparks for the era of digital intelligence

AI essential medical floor! Tencent excellent view of the industry's first open source 3D medical image data of a large pre-training model

ModaHub community open source AI Agent development framework and evaluation

Baidu "AI Contagion": the first open source pneumonia CT image analysis AI model, so diagnosis from minutes to seconds

China's first open source foundation is here!

AI Daily: Apple launches open source tools for artificial intelligence developers using Mac

Open Source: from community to commercial

Community sharing｜JumpServer open source bastion host has always been my first choice

The largest Llama open source community in China releases the first pre-trained Chinese version Llama2

[Dromara's new open source project] Mendmix joins the Dromara open source community

Imagination launches first edge AI course

[New open source project] PDF construction framework x-easypdf joined the Dromara open source community

Recommended

Ranking

SpringBoot-integrate redis

[Sword pointing to offer] Interview question 03: Repeated numbers in an array

Arrangement "Offer Penalty for prove safety" string

Browser prevent the automatic generation of fill and Echo have been saved account solutions

Work hard and never slacken——2022 Yinmai Information Year-End Summary

Install jdk7 on Linux system

App common dependency management tools

EduCoder-Web程序设计基础-html5— 给表单组件添加说明-第1关：label标签相关概念

Machine learning - clustering - density clustering algorithm notes

Ant's large model is exposed, AI+ finance enters the "big model" era

Daily

More

2024-04-30(36)

2024-04-29(5)

2024-04-28(12)

2024-04-27(29)

2024-04-26(22)

2024-04-25(32)

2024-04-24(30)

2024-04-23(30)

2024-04-22(5)

2024-04-21(0)