[AIGC] 7, BLIP | Unified understanding and generation tasks generate higher quality text descriptions for images - Code World

[AIGC] 7, BLIP | Unified understanding and generation tasks generate higher quality text descriptions for images

Enterprise 2023-04-09 18:02:58 views: null

NoSuchKey

Guess you like

Origin blog.csdn.net/jiaoyangwm/article/details/130036782

[AIGC] 7, BLIP | Unified understanding and generation tasks generate higher quality text descriptions for images

[Computer Vision | Natural Language Processing] BLIP: Unified Vision-Language Understanding and Generation Tasks (Paper Explanation)

[Computer Vision] BLIP: A Bootstrap Multimodal Model for Unified Understanding and Generation

Generate text images

Generate text images

Use PIL to generate text images

Application of extracting relations from images to generation tasks

Android images and text generate new images (Canvas)

CLIP: Train a unified vector embedding of images and text

Why bother to roll text to generate images? ? ?

Faces la Carte: Automatically generate target face images based on witness descriptions

AIGC in action - using variational autoencoders to generate facial images

Inside GPT — I: Understanding Text Generation

In-depth understanding of autoencoders (generate images with variational autoencoders)

Text generation method model.generate() parameter explanation in Transformers

AIGC: Use artificial intelligence generation technology to sequentially generate text and image content and compare the output results of multiple cutting-edge models (GPT-3.5, GPT-4, Claude) and analyze performance case sets

UniIVAL: The first grand unified model supporting image, video, audio and text tasks!

Python-based natural language processing tasks (text classification, text matching, semantic understanding and sequence annotation)

Text generation: Use recurrent neural networks to generate text, such as automatically writing poetry or automatically generating code comments

（四十九）：UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning

Image links in html generate text, and use HTML5 to convert text to images [Long Weibo Generator]

Image links in html generate text, and use HTML5 to convert text to images [Long Weibo Generator]

[Paper] 2102.DALL-E: Zero-Shot Text-to-Image Generation (the beginning of text generation of various imaginative images)

Understanding of macro tasks and micro tasks

New work by the University of Science and Technology of China and Byte | UniDoc: A large model for unified image and text understanding

Based on the generate() method of transformers to realize diversified text generation: interpretation of parameter meaning and algorithm principle

html2canvas inline element border style generation problem solving (generate pictures based on text)

Generating pictures based on text descriptions is not a dream!

LangChain manipulates data in MySQL using text descriptions

Teach you how to quickly input Chinese/English text to generate images through PaddleHub (Stable Diffusion)

Recommended

Ranking

45 kinds of ultra-wide design patterns!

AI testing, promising now and promising future: The industry’s first AI testing cheats are released

2019-12-08

Summary of 260 common network security interview questions (with answer analysis + supporting materials)

Java front-end compilation and back-end compilation understanding

The difference and connection between YARN and Zookeeper

Database knowledge point accumulation day02

Data structure review-Binary tree traversal (end-of-term series)

PBR流程介绍和模型规范

Inaction Store Information

Daily

More

2025-04-30(0)

2025-04-29(0)

2025-04-28(0)

2025-04-27(0)

2025-04-26(0)

2025-04-25(0)

2025-04-24(0)

2025-04-23(0)

2025-04-22(0)

2025-04-21(0)