Data Mining Regression Analysis Chapter 5 Learning Exercises

Article directory

1. Multiple choice questions

1. A supermarket studied sales record data and found that people who buy beer are very likely to also buy diapers. What type of data mining problem does this belong to ( )

A. Association rule discovery;
B. Clustering;
C. Classification;
D. Nature Language processing;

Answer: Association rule discovery;

2. Confidence is an indicator of which measure of interest ( )

A. Simplicity;
B. Certainty;
C. Practicality;
D. Novelty Sex;

Answer: B

3. Among the following indicators, the one that can measure the strength of a rule and the independence between two sets at the same time is ().

A. Lift;
B. Leverage;
C. IS measurement;
D. Conviction degree;

Answer: D

4. Among the following data mining functions, the one that is widely used in shopping basket analysis is ().

A. Correlation analysis;
B. Classification and prediction;
C. Cluster analysis;
D . Evolution Analysis;

Answer: A

5. Suppose X={1,2,3} is a frequent itemset, how many association rules can be generated from X ( ).

A. 4;
B. 5;
C. 6;
D. 7;

Answer: C

2. Multiple choice questions

6. The following table is a shopping basket. Assuming that the support threshold is 40%, which of them are frequent closed itemsets ( ).

Insert image description here
A. abc;
B. ad;
C. cd;
D. de;

Answer: AD

7. The following statement about non-frequent patterns is correct ( ).

A. Its support is less than the threshold;
B. They are all uninteresting;
C. Its support is greater than the threshold;
D. Sensitive to abnormal data items;

Answer: AD

3. Fill in the blanks

1.Aprior algorithm includes two basic steps: ____ and ____.

Answer: (1) [ "Connection" ]
(2) [ " Pruning" ]

2.If L2={ {a,b},{a,c},{a,d},{b,c},{b,d}}, then the C3=____ generated by the connection.

答案:{ {a,b,c},{a,b,d},{a,c,d},{b,c,d}}

3. Rules that satisfy both ____ and ____ are called strong association rules.

Answer: (1) Minimum support threshold
(2) Minimum confidence threshold

4. In the closed pattern mining algorithm, directly searching for closed frequent itemsets and pruning the results is the most common method. The pruning strategies include ____ and ____.

Answer: (1) [ "item merging " ]
(2) [ "Sub-item set pruning" ]

5. Patterns that appear frequently in a data set are called:____.

Answer: (1) Frequent pattern

6. Association rule mining in large databases includes two processes: finding all ____ and generating ____ from ____.

Answer: (1) Frequent itemsets
(2) Frequent itemsets
(3) Strong association rules

7. Calculate the support of {Bread (A) => Beer (E)}: ____ (retain one decimal place)

Insert image description here

Answer: (1) [ “0.2”, “0.20” ]

4. True or False Questions

1. The association rule mining process is to discover the rules represented by all item sets that satisfy the minimum support.

A. 对
B. 错

Answer: B. Wrong

2. The use of a priori principles can help reduce the number of candidates that need to be explored when generating frequent itemsets.

Answer: A. Right

3. The a priori principle can be expressed as: if an itemset is frequent, then all non-empty subsets containing it are also frequent.

Answer: A is correct

4. The greater the total confidence of two item sets, the closer the relationship between the two item sets, and vice versa.

Answer: A is correct

Guess you like

Origin blog.csdn.net/qq_51688022/article/details/134701765