Use Excel for data analysis

(1) Sometimes, Excel can be used for simple data analysis, and it will also be used when writing data reports. For cases, use some of the functions of Excel to achieve the following goals.

(2) Now that you have a meal data, I want to know the following questions through excel:
  1. Which restaurant has the highest number of reviews in the country?
  2. Which city restaurant has the best taste per capita?
  3. Which type of restaurant evaluation Best?
  4. Among the restaurants with Sichuan cuisine, how many have the word "spicy" and how many have the word "ma"?
  5. How many restaurants are rated above 8.0 in terms of taste, environment, and service ? Which city do they account for the most
?
  6. In the Shanghai area, what are the top five hotels of various types?
  7. How many hotels are not rated?
  8. Divide the per capita price into 0~50, 50~100 , 100~ 150, 150~ 200, 200+, how many are there in each city? What is the proportion of it?
  9. Process a comprehensive evaluation coefficient from the evaluation, per capita, taste, environment, and service indicators, and calculate which ten stores are the best (open question).
  10. Do a descriptive analysis of Japanese cuisine in Shanghai (open questions)

(3) Data:
           

Link: Data download
Extraction code: 2ns3


(1) Question one

Problem: National Hotel is the highest number Comments Which?
Answer: use INDEX()and MATCH() data analysis

Step:
(1) find the maximum value of the number of reviews var
(2) through INDEX()to the coordinate positioning var in the "comments" column functions, through the MATCH() coordinates of its name lookup

Implementation process

(1) Find the value var with the highest number of reviews

=MAX(D:D)

(2) Find out which hotel has the highest number of reviews?

=INDEX(C:D,MATCH(21180,D:D,0),1)

In the end, the result is:

(2) Question two

Question:? Which city restaurants per capita tastes best
answer: use 数据透视表 for data analysis

Steps and implementation process:
(1) Select all tables, click "Insert", and click "Pivot Table".
           
(2) Create a pivot table
           
(3) Select row: city; select column: taste; select "value field settings": average
           

Finally, after sorting the data, the city with the best taste per capita is Shanghai, the highest is about: 7.95

(3) Question three

Question: Which type of catering has the best evaluation?
Answer: Using 数据透视表 data analysis

For question three, follow the same steps as question two.
Finally, sort the data, and the best type of food evaluation is hot pot, the highest is about: 80589
                 

(4) Question four

Question: Sichuan type of store, with the number of "hot" word, and how many with "Ma" word?
Answer: the use of COUNTIF()data analysis

Steps:
(1) the use of COUNTIF()statistics "Ma" the number of words in each name to appear in
(2) the use of COUNTIF()"hot" word count statistics for each store name appears in
(3) control the first line, click on the "Data", click on "Filter"; Select "Category" as "Sichuan cuisine"; finally, count the number of "Ma" in the "Spicy" column with the word "Spicy" and how many with the word "Ma"

Implementation process

(1) add one, using COUNTIF()statistics

=COUNTIF(C2,"*麻*")
=COUNTIF(C2,"*辣*")

(3) Counting the number of
           
results: Counting the number of
  "ma": 28
  Counting the number of "spicy": 44
  You can roughly understand the name of the restaurant whose type is Sichuan cuisine, and choose between "ma" and "spicy" at the same time, generally Like the word "spicy".

(5) Question 5

Question: taste, environment, services, three evaluated in more than 8.0 there are a few restaurants in which most of their share of the city?
Multi?

Answer: the use of “筛选”data analysis

Steps and realization process:
(1) Control the first line, click "Data", click "Filter"; select "Taste", "Environment", and "Service" to be above 8.0;
           
(2) Count the number of restaurants and copy again "City" uses 数据透视表 statistics and describes the information of the "city".
                 

Result:
  Statistics on the number of hotels: 457
  Beijing accounted for the most.

(6) Question 6

Question: in Shanghai, the top five of each type of hotel service?
Answer: use IF()and “筛选” data analysis

Steps and implementation process:
(1) Control the first line, click "Data", click "Filter"; select "City" as "Shanghai"; select "City", "Type", "Store Name", and "Service" indicators After that, assign values ​​to the new Sheet.
           

(2) First, the "service" is in descending order; then the "type" is in descending order (both are extended to the entire area).
           
You will find stores of the same type, and their services are sorted in descending order.

(3) Describe the "type" in increments of numbers, and use: IF()function.
           

(4) Next, filter the top five "<=5" in column "E".
           
           
Finally, I got the top five in various types of hotel services.

(7) Question 7

Question: How many restaurants have not been evaluated?
Answer: Directly operate on the data

Steps and implementation process:
Directly filter "reviews" as blanks, and then count the total number of types to be the total number of blanks. There are 1355 restaurants that have no reviews.

(8) Question 8

Question: Divide the per capita price into 0~50, 50~100, 100~150, 150~200, 200+. How many are there in each city? What is the proportion of it?
Solution: use “除”and “取整” and "数据透视表" data analysis

Steps and implementation process:
(1) the use “除”and “取整” the average price segment.
           

=E2/50
=INT(I2)+1

(2) Unify all the price_levels of ">=5" into 5, and use the "Pivot Table" to calculate the relationship between cities and prices.
           
Replace
           
                
(3) Assign the data to a new Sheet, and then calculate the proportion.
           

(9) Question 9

Question: Process a comprehensive evaluation coefficient from several indicators such as reviews, per capita, taste, environment, and service, and calculate which ten stores are the best (open question).
Solution: use log()and “线性加权” analyze

Step:
(1) null value and excluding outliers
(2) “口味” , “环境” , “服务” different weights after weight linear weighted, forming a new index.
(3) Using log()the function, will “点评” be reduced to the range of 1 to 5 (in base may be selected arbitrarily).
(4) the use of log()function, “人均” reduced to within the range of 1 to 5 (in base may be selected arbitrarily), average negative value. Because the lower the price, the more consumers will like it.
(5) Finally, all indicators are linearly weighted to form 综合评价系数 .

Realization process:
(1) Control the first row, click "Data", click "Filter"; remove empty values ​​and outliers; re-assign values ​​to a new Sheet.
(2) linear weighted
           
(3) (4) using log()the function
           

=LOG(D2,10)
=LOG(E2,5)

(5) Linear weighting and 综合评价系数 ranking.
           

Note: The weight of each indicator depends on demand.

(10) Question ten

Question: Japanese cuisine for the whole country, do a descriptive analysis (open question)
Solution: use “数据分析”for processing

Steps and realization process:
(1) Control the first row, click "Data", click "Filter"; select "Type" 日本料理; re-assign the value to the new Sheet.
           

(2) Click "Data", click "Analyze", click “数据分析”.
           
           
         
The above is a descriptive analysis of Japanese cuisine across the country.

Guess you like

Origin blog.csdn.net/qq_41709378/article/details/107119521