Artificial Intelligence (2): Classification of Machine Learning Algorithms

According to the composition of the data set, machine learning algorithms can be divided into:

  • supervised learning
  • unsupervised learning
  • semi-supervised learning
  • reinforcement learning

1 Supervised learning

Definition: The input data is composed of input feature values ​​and target values.

  • The output of the function can be a continuous value (called regression),
  • Or the output is a finite number of discrete values ​​(called classification).

1.1 Regression problem

For example: predict housing prices, and fit a continuous curve based on the sample set.

 

1.2 Classification problem

For example: judging benign or malignant based on tumor characteristics, the result is "benign" or "malignant", which is discrete.

 

2 Unsupervised Learning

Definition: The input data is composed of input feature values, and there is no target value

  • The input data is not labeled and there are no definitive outcomes. The sample data category is unknown;
  • It is necessary to classify the sample set according to the similarity between samples.

Example:

Supervised and unsupervised algorithm comparison:

 

 

3 Semi-supervised learning

Definition: The training set contains both labeled sample data and unlabeled sample data.

Example:

Supervised learning training method:

 

Semi-supervised learning training method

 

4 Reinforcement Learning

definition:

The essence is the problem of make decisions, that is, making decisions automatically and making continuous decisions.

Example:

The child wants to walk, but before that, he needs to stand up first, and after standing up, he needs to maintain his balance. Then he has to take a leg first, whether it is the left leg or the right leg, and after taking a step, he needs to take the next step. step.

The child is the agent, who tries to manipulate the environment (the surface on which he walks) by taking actions (i.e. walking), and transitions from one state to another (i.e. each step he takes), when he completes the subtasks of the task (i.e. walking a few steps), the child was rewarded (given the chocolate to eat), and when he could not walk, the chocolate was not given.

It mainly contains five elements: agent, action, reward, environment, observation;

The goal of reinforcement learning is to obtain the most cumulative rewards. 

Comparing Supervised Learning and Reinforcement Learning

supervised learning reinforcement learning

feedback mapping

The output is the relationship between them, which can tell the algorithm what kind of input corresponds to what kind of output.

The output is the feedback reward function to the machine, which is used to judge whether the behavior is good or bad.

the feedback

time

Making a bad choice will immediately feed back to the algorithm.

There is a delay in the result feedback, and sometimes it may take many steps to know whether the previous choice of a certain step is good or bad.

enter

feature

The inputs are independent and identically distributed.

The input faced is always changing, and every time the algorithm makes an action, it affects the input of the next decision.

The concept of independent and identical distribution will be explained in the next chapter.

5 Summary

In

Out Purpose the case

supervised learning

tagged have feedback forecast result Cat and Dog Classification House Price Forecast

Unsupervised learning

no label no feedback discover underlying structures "Birds of a feather flock together"

Semi-Supervised Learning

some labeled, some unlabeled have feedback Reduce the difficulty of data labeling

reinforcement learning

Decision-making process and incentive system series of actions long-term profit maximization learn to play chess

Guess you like

Origin blog.csdn.net/u013938578/article/details/131356217