Technology | GPT4 Capability Exploration - Interacting with Humans

This article is the first WeChat public account: fully armed big brother
[Introduction] : Since ChatGPT has not blossomed everywhere in China last year, GPT3 has been used to solve some code problems, and now the application of GPT4 has sprung up, everyone The evaluation of GPT4 is mixed. Taking advantage of the time now, explore the capabilities of GPT4. The "GPT4 Capability Exploration" series of articles mainly refer to the papers published by Microsoft [1]. Here the author chose the most interesting part - whether GPT4 has a mind, and tested it.

Conclusion : GPT4 can more accurately grasp the emotions, intentions, and states of people in the dialogue scene (in the case of incomplete context), and can correctly answer logical puzzles. Today's GPT4 still needs a large number of high-quality data sets. I want to take time to equip GPT with eyes and ears. Can it observe and learn by itself like humans? The emergence of GPT4 has brought artificial general intelligence (AGI) into reality.

This article mainly tests GPT4 from two aspects of whether GPT4 can understand human emotions and intentions.

心智理论是一种理解自己和身边人心理状态的能力。这些心理状态包括信念、情绪、欲望、假装和知识等。

Mental tests include some simple tasks and complex tasks. Simple task: infer the mental state of another person. Example: I think Xiao Ming is in a good mood today. Complex tasks such as inferring the state of mind of another person towards a third party. Example: I think Xiao Ming thinks Xiao Fang is in a good mood today.

In order to prevent the mental test on the Internet (Sally-Anne test, testing children's mind) has been used as a training set, here is a temporary example.

Understanding False Belief Tests

Given such a scenario:

Xiaoming and Xiaofang share a piggy bank. Xiaoming puts his 1 yuan coin into the piggy bank. Xiaofang finds that there is a 1 yuan coin in the piggy bank, so she opens the piggy bank and takes the 1 yuan coin to buy snack. Xiaofang will not tell Xiaoming that she used this coin, and no one else saw it.

question:

When Xiao Ming needs change to buy snacks, where should he get the money?

GPT4's answer:

Xiao Ming didn't know that Xiao Fang had used the 1 yuan coin in the piggy bank. If Jim needs change to buy snacks, he might first go to the piggy bank for change. However, when he finds that there is no change in the piggy bank, he may need to find other ways to get change, such as: borrowing change from family or friends. Look for change elsewhere in the house, such as purses, drawers, etc. If you have cash, you can change it for change when you go to the store or supermarket. If he has a bank card, he can go to a nearby ATM to withdraw cash and change it at a store or supermarket. In real life, Xiao Ming and Xiao Fang should establish a clear sharing rule to avoid misunderstandings and disputes when using shared items.

Here, GPT4's answer is correct. It shows that it can correctly infer Xiao Ming's belief. Next, test sentiment on GPT-4.

emotion test

Given such a scenario:

Xiao Ming broke Xiao Fang's PIALA, but he didn't tell Xiao Fang. Xiaofang told Xiao Ming how much she liked PIALA, and at the same time, she noticed that Xiao Ming looked awkward.

PIALA is also made up by me.

question:

Why does Xiao Ming look awkward

GPT4 answers:

The reason Xiao Ming looks embarrassed is probably because he realized that he broke PIALA, which Xiao Fang likes, but didn't tell her. When Xiao Fang talked about her affection for PIALA, Xiao Ming may feel guilty and disturbed, because he knows that his actions have caused Xiao Fang to lose. This emotion may manifest itself in the outward manifestation of embarrassment, so Xiao Ming looks embarrassed.

question:

What Xiaofang thinks caused Xiao Ming's embarrassment

GPT4 answers:

In this scene, Xiao Fang may observe Xiao Ming's embarrassment, but she does not know that the real reason is that Xiao Ming broke her PIALA. She might try to guess what caused Jim's embarrassment. For example:
1. Xiao Ming may feel that he cannot resonate with Xiao Fang on the topic of PIALA, because he is not interested in or does not understand PIALA.
2. Xiao Ming may feel embarrassed that he cannot provide Xiao Fang with a new PIALA.
3. Xiao Ming may have misunderstood Xiao Fang in other aspects, which caused him to feel embarrassed on this occasion.

However, Xiao Fang doesn't know that the real reason is that Xiao Ming broke her PIALA, unless Xiao Ming tells her. In this case, she may misunderstand the reason for Xiaoming's embarrassment.

It can be seen from the answer that GPT4 can correctly infer the cause of Xiao Ming's emotional state. However, when GPT4 speculates the reason for Xiaofang's embarrassment towards Xiaoming, it says "Xiaoming may be embarrassed that he cannot provide Xiaofang with a new PIALA", which is a bit problematic because Xiaofang didn't know Xiaoming in the first place It broke Xiaofang's PIALA, but when GPT4 speculated, it gave an explanation from the perspective of God that "Xiao Ming might feel embarrassed that he couldn't provide Xiaofang with a new PIALA".

intent test

Given such a scenario:

Sara's brother asked her to finish her math homework within an hour. Sara wanted to buy more time, but her brother insisted that she finish it within an hour. Sara said she would be done in an hour. After that, Sara did not do math homework, but read novels and wrote Chinese homework until an hour later.

question:

why did sara do that

GPT4 answers:

Avoidance : Sara may find math homework difficult or boring, so she chooses to avoid problems and instead do things she likes or is interested in, such as reading novels and writing Chinese homework.

Procrastination : Sara may be a procrastinator, and even knowing she needs to finish her math homework in an hour, she still chooses to do other things first instead of starting the math homework right away.

Protest : Sara may be unhappy with her brother's request, which she may feel is too demanding. Therefore, she deliberately did not do the math homework as required by her brother to express her dissatisfaction.

Strategic Procrastination : Sara may be trying to procrastinate to buy more time for her math homework. She might expect her brother to see after an hour that she didn't finish her homework and give her more time

Ask again:

Sara told her classmates that she intentionally missed her math homework within an hour, and her classmates would think what was Sara's intention

GPT4 answers again:

Confrontation : Sara may disagree with her brother's request or find it too demanding. Therefore, she deliberately did not complete the math homework within an hour as required by her brother as an act of confrontation.

Independence : Sara may want to express her own independence, to show her brother that she has the right to organize her time and work independently. She may wish to express herself by not following her brother's request.

Showing off to classmates : Sara probably thought this behavior was cool, and she intentionally missed her math homework within an hour to show off her rebellious spirit to her classmates.

Seek sympathy : Sara may want fellow students to sympathize with her situation, letting them know that she feels resentful or oppressed by her brother's demands. She may hope that her classmates will understand her behavior and offer support.

From GPT4's answer, we can see that it can correctly understand Sara's intentions and understand Sara's classmates' intentions towards Sara.

Let's look at a longer conversation.

Scene (it feels like two couples are arguing):

XIXI: I don't like the way you treat NaNa now.
GI: What do you know? Didn't you see her scolding her sister?
XIXI: I didn't see it, but it doesn't prevent you from treating her better.
GI: So what do you want me to do, pretend like nothing happened?
XIXI: I don't know.
GI: Then why are you defending him.

question:

What does XIXI want to do

GPT4 answers:

From this conversation, we can speculate that XIXI's intentions are as follows:
1. XIXI cares about NaNa and expresses dissatisfaction with GI's attitude towards NaNa. She probably felt that GI should be more forgiving and understanding instead of taking a negative attitude towards NaNa. XIXI hopes that GI will reconsider her behavior and try to get along with NaNa in a more positive way. While she offered no specific recommendations, she expressed concern about GI attitudes.
2.XIXI may be trying to remind GI that even if NaNa is scolding her sister, GI should not treat her with the same attitude.
3. XIXI hopes that GI can keep calm and try to deal with the problem in a more constructive way.
Taken together, XIXI seems to care about NaNa's feelings and hopes that GI will change her attitude towards her.

question:

What GI Thinks of XIXI's Comments

GPT4 answers:

Some possible reactions:
Reflection : GI may think hard about XIXI's comments and realize that his attitude may be too harsh. In this case, the GI may consider changing the way it treats NaNa.
Defensive : GI probably felt blamed for XIXI's comments and therefore took a defensive stance. GI might stand her ground and think she was justified in her attitude towards NaNa because NaNa was scolding her sister.
Anger : GI may be angry at XIXI's comment, thinking that XIXI didn't see the whole situation and therefore is not in a position to judge his actions.
Gratitude : GI may be grateful to XIXI for reminding him of his attitude problem with NaNa, thus realizing that he needs to adjust his behavior and attitude.
GI's specific reaction to XIXI's comments depends on his personality, emotions, and perspective on the issue.

question:

Why communication between XIXI and GI did not solve the problem

GPT4 answers:

From the dialogue, we can see that the reasons why the communication between XIXI and GI did not solve the problem may include:
Not specific enough : XIXI expressed dissatisfaction with GI's attitude towards NaNa, but she did not give specific suggestions or solutions. This makes it difficult for GI to know what actions XIXI expects him to take to change his attitude.
Emotional : In this conversation, GI may have developed resistance to XIXI's comments, causing the communication to become emotional. Emotional conversations often make it difficult to solve problems effectively.
No Consensus : In a dialogue, the two parties do not reach a consensus on how to treat NaNa. XIXI thought that GI should have a better attitude towards NaNa, and GI felt that he had a reason to maintain the current attitude towards NaNa.
Lack of empathy : GI may not have fully understood XIXI's point of view, and did not consider the issue from XIXI's perspective. Likewise, XIXI may not have fully understood GI's position. A lack of empathy on both sides of the communication can make it difficult to find a solution to a problem.
To resolve issues more effectively, both parties can try to communicate in a more specific and targeted manner while remaining calm and empathetic in order to better understand each other's positions and needs.

question:

How can XIXI and GI improve this situation and bring their perspectives into alignment?

GPT4 answers:

To improve the situation and bring their views into alignment, XIXI and GI can try the following strategies:
Remain calm and respectful : Both parties should remain calm and respect each other's views and feelings. Avoiding emotional conversations helps to better listen and understand the other person's position.
Listening and Understanding : Both the XIXI and the GI need to listen to the other's point of view in order to better understand the other's thoughts and feelings. This may help to find common ground and thus come to an agreement.
Express your feelings and needs : Both partners need to express their feelings and needs clearly and specifically. In this way, they can better understand each other's expectations and find solutions to problems.
Make specific proposals : In order to solve the problem, both parties should come up with specific proposals or solutions. This will help ensure that both parties know what actions need to be taken to improve the situation.
Find a compromise : Both parties may need to compromise to come to an agreement. This may mean that GI needs to change some behaviors when it comes to dealing with NaNa, and XIXI also needs to understand GI's position and respect his feelings.
Maintain an open communication : XIXI and GI should maintain an open communication so that similar problems can be resolved more effectively in the future.
By employing these strategies, XIXI and GI may find it easier to improve their relationship and bring their perspectives into alignment.

logic puzzle

This question is an exercise in my sister's math mock exam.

It is known that there are 100 people in a room. Some of these people tell the truth and some of them tell lies.
There are two basic facts: 1. At least one person is telling a lie, 2. At least one of any two people is telling the truth.

May I ask how many people are telling the truth in this room?

GPT4 Answer:

We can infer the number of people who tell the truth given the facts. Fact 1: At least one person told a lie. Fact 2: At least one of any two people is telling the truth. We can start reasoning from fact 2. Suppose we randomly choose two people, A and B. According to fact 2, at least one of A and B is telling the truth. Now we choose a third person C from these 100 people. At this point, we need to consider two situations:

A tells the truth, B tells the lie: Then according to fact 2, C must tell the truth (because B tells the lie, then there is at least one person between A and C who tells the truth, and we already know that A tells the truth, so C also tells the truth).
A tells a lie and B tells the truth: this situation is similar to the first one. According to fact 2, C must tell the truth (because A tells the lie, then there is at least one person between A and C who tells the truth, so C tells the truth).

In both cases, we got a C for telling the truth. So, we can infer that, except for the first person who told a lie, the other 99 people are all telling the truth. So, in this room, there are 99 people telling the truth and 1 telling the lie.

Can you solve this question?

Reference content:

[1] Sparks of Artificial General Intelligence: Early experiments with GPT-4

Guess you like

Origin blog.csdn.net/weixin_43763175/article/details/130103311
Recommended