GPT4 multimodal open source replacement project

Since the release of ChatGPT and Stable Diffusion, various related open source projects have blossomed, which is really overwhelming.

Today, I will focus on selecting a few high-quality open source projects, which will be of great help to our daily work, study and life.

Today I organize and share with you, I hope it will be helpful to you.

一、Visual ChatGPT

This is an open source project of Microsoft. In just over a week, it has gained 23.6k+ stars.

To briefly summarize it, it is a multimodal question answering system.

Supports AI drawing , language question and answer , and picture question and answer , and integrates the three recent hot spots of the AI ​​session.

Show results:

The system implementation framework is as follows:

System Realization Framework of Visual ChatGPT

This is an open source project that "works hard to make miracles", which integrates various research results: BLIP, CLIP, ChatGPT, pix2pix, inpainting, vqa, etc.

To put it bluntly, it is to teach you how to use these projects to build a multi-modal question answering system. This system architecture is of great reference value .

project address:

https://github.com/microsoft/visual-

Guess you like

Origin blog.csdn.net/qq_41771998/article/details/130240103