Explode! OpenAI actually merged AI painting and ChatGPT! Netizen: MidJourney subscription, cancellation

 Datawhale dry information 

Latest: DALL·E 3 , Editor: Qubits

It’s amazing, OpenAI actually merged AI painting and ChatGPT !

No, the newly released DALL·E 3 directly brought two major shocks to AIGC——

  • The prompt word threshold is greatly reduced

  • Detailed descriptions that understand the nuances of semantic meaning are incredibly accurate.

d05772e1a2533a81cfc4cb8de6b47d3b.png

That's right, the new version of DALL·E 3 not only eliminates the need for prompt word engineering, but also improves language understanding capabilities to a new level!

Just imagine it. For words , ChatGPT helps you expand; for drawings , DALL·E 3 gives you precise details.

AI can be like filming a movie, ensuring that every detail from the background to the characters is reproduced verbatim:

Under the full moon , the streets are bustling with pedestrians enjoying the bustling nightlife.

At a corner stall, a young woman with fiery red hair and a signature velvet cloak was haggling with a grumpy old vendor.

The grumpy hawker, tall and sophisticated, wearing a neat suit and a striking mustache , was chatting animatedly on his steampunk phone .

92720905bc44235f459d9e8c91d03734.png

In addition to basic details, DALL·E 3 can even interpret vague adjectives such as prosperous, bargaining, and grumpy to life, which is no longer on the same level as CLIP.

0deadd6e33ad864de536e65114311f9b.png

At the same time, compared to the previous generation of old models, DALL·E 3’s own painting skills have also taken a big step forward:

6039837673e1e5ac54d39cce4f18bb19.png

Such an operation immediately stunned netizens.

Some netizens have decided to cancel their Midjourney subscription. "If Midjourney can't understand the text accurately, it's not even a competition."

9e8085e0fb6dee844423c7630516166a.png
7de8314ccdd9534b315e27d5c576364c.png

Some netizens ridiculed that this is simply putting pressure on the upcoming Google Gemini.

f5c38d5387ac1de355043b7bad9dd5c3.png

For more details, let’s take a look at the effects displayed by DALL·E 3 one by one.

Use it directly in ChatGPT

Compared with the previous two generations, the biggest advantage of DALL·E 3 is that it is natively built on ChatGPT .

Not only does it mean a huge leap in language understanding, but even prompt words can be written by ChatGPT itself .

More details are hidden in the promotional video where Ultraman can’t help but boast about his cuteness.

e8f4aa26097820d270921fc684d7a728.png

This is the story of a parent who turned a five-year-old's fantasy into reality .

First, parents asked ChatGPT, "My 5-year-old baby has been talking about a 'super sunflower hedgehog'. What should it look like?"

You can see that ChatGPT has written four prompt words in different styles at the same time and given the corresponding images.

776065cc4143d8c1cd9eec07ef0ce7bb.gif

After parents choose one of the illustrations with a fairy tale style, the image of the protagonist of the story, the little hedgehog, seems to be fixed , and they can continue to ask ChatGPT to draw more.

By the way, give the little hedgehog a name Larry, so that you don't have to say "Super Sunflower Hedgehog" every time in subsequent conversations.

9932073a4922369b47241c63ad9a2c40.gif

The protagonist has been decided, and then more elements will be added to make the entire fairy tale richer, such as drawing a house for Larry.

This not only demonstrates the ability of DALL·E 3 to create a consistent image , but also shows that LARRY's name is correctly written on the mailbox, which solves the problem of the previous version of DALL·E not being able to write .

5f069fd69a8001a24f89cd257e65e65b.gif

Since we already use ChatGPT, why not improve the storyline?

3ed4aadab61ebc849f113f165a85678b.gif

The plot you just compiled will have matching illustrations available immediately.

52f46e18ce64155b6e7c3c3eb1708aa9.gif

Keep the character image and move to a completely different sticker style without any problem, you can print it out directly.

257c03692f7eba72a210efa6ba7ecf30.gif

Pay attention, here comes the most amazing part, just let ChatGPT summarize all the content in the previous conversation and write it into a complete bedtime story.

0b70bed162cf36740eae93364078e015.gif

Although the demonstration ends here, it is completely conceivable that with the ChatGPT plug-in function, an e-book can be directly generated.

9c4d2bff500a7b0ed200c00e9642f40f.gif

It reminds me of the previous suggestion by netizens that the best way to ensure the safety of AI is to have employees of OpenAI, Anthropic and other companies have children.

This way they have an incentive to make sure the world is safe when AGI arrives. (Manual dog head)

a982b1af9656f6244d2d54aab60c8398.png

Although DALL·E 3 cannot be played immediately, you can still take a look at the large number of samples released at once.

d69664e51286b2fed28d95d0d227638b.png

When you click on each card, you can also see prompt words, which are described directly in human words without adding complicated spells.

b3e266f33970230bdcf7944f00dd54dd.png

The combination of complex scenes and non-existent concepts has a stunning effect.

d8e329de976daa47ff33027da52e34f9.png

When making interior design concept drawings, the relationship between light and shadow cannot be faulted at first glance.

79d6bf0de4d01cc4e847d6e3027179d7.png

Coupled with the ability to write correctly, it is also very productive to post a poster directly (there are still some problems with the small fonts where the text is not specified).

367ba4bcda524abbebf6e257fc3782e7.png

Noam Brown, the father of poker AI who just joined OpenAI, also posted pictures of his trial robot playing cards.

5f499bacf0c070aadb713e55f2e79190.png

CEO Altman’s favorite picture is “Avocado Seeing a Doctor.”

71432bc78cbc2d3da82234ae03e8952d.png

37cce653cc953f236c028547a3c4e43d.png

Some netizens tried the effect of using the same prompt words on DALL·E 2. They could only say that the words were wrong, the hole in the middle of the avocado had no words, and the treatment was even worse...

ae5acce7ce2653f6cd17fc5dbe5c0aa2.png

Do you still remember when DALL·E 1 was first released in January 2021? The place where the dream begins is a set of "avocado sofas".

No wonder netizens lamented: Look how far it has come!

9fb0a297425fcd29d4a9bf8bfe44fde9.png

"If there is any infringement, please delete the picture."

Of course, in addition to the above features, OpenAI also previewed some magical new features.

For example, DALL·E 3 will soon be equipped with an image discriminator .

This classifier can help identify whether an image was generated by DALL·E 3, not only to avoid accidental injuries (manual dog head), but also to quickly claim it as your own when DALL·E creates a good work.

c331670223afc946ba8360c8a30b0e98.png

As for generating images, OpenAI said it has done a lot of work to prevent it from generating violent, pornographic or other harmful images, or images with the names of public figures (stars, celebrities, etc.).

Regarding privacy, the New York Times previously broke the news that OpenAI is using certain technologies to blur faces in images uploaded to ChatGPT.

This is also to prevent ChatGPT from becoming a complete "face recognition tool", especially for celebrities whose photos have been circulated on the Internet.

Now this technology may also be used in DALL·E 3 to prevent the generation of infringing images .

At the same time, OpenAI is also working with the security red team to improve its image risk assessment capabilities.

In addition, in terms of training data, OpenAI also learned to protect itself this time with Midjourney’s “lessons learned from the past”.

Rather than going to court directly with the artist or waiting to be sued, OpenAI released a training data “disclaimer” on its official website :

You can disable our web crawler GPTBot from accessing your website by filling out the form. Alternatively, you can send an image that you want to keep private and we will remove it from the training data.

fd8ba3c911b303b082181c7ec16ad42a.png

However, some netizens are not satisfied with the DALL·E 3 demonstration effect, thinking that it is not as good as the pictures produced by senior MidJourney players. OpenAI's funds are many times abundant.

1450d1f682aef4c0cb4573a78565793e.png

Some netizens turned on Leeuwenhoek mode and began to pick out the details of the prompt words missed in the demonstration picture one by one.

For example, this cup is missing miniature lightning bolts.

472f118a7299cfdcce448aef3b0f1047.png

There are only cannon wreckage scattered on the seabed here, but no treasure.

0e8a5b556b67c823bc94f762e6da426d.png

Whether these problems can be improved by adjusting the prompt words will not be known until you actually play the game.

So when will DALL·E 3 be launched? Highlights:

  • ChatGPT Pro membership ($20/month) and Enterprise edition available in October .

  • A standalone version will be available later this fall (currently $15 for 115 plays).

Reference links:
[1]https://openai.com/dall-e-3
[2]https://www.nytimes.com/2023/07/18/technology/openai-chatgpt-facial-recognition.html
[ 3]https://twitter.com/sama/status/1704547625482203560

741007d070fb2870c73fadb02aa2ea53.png

Good stuff to learn, like three times in a row

Guess you like

Origin blog.csdn.net/Datawhale/article/details/133152987