2023 Shenzhen Cup East Three Provinces Question A Full Nanny Tutorial Analysis on the Health of Urban Residents

Analysis of Factors Affecting the Health of Urban Residents in Question A 

The analysis of the health of urban residents in question A is mainly a questionnaire analysis topic. The overall analysis is not difficult. Basically, it can be done according to the video ideas of station B. The main difficulties are data sorting and cleaning and feature construction. Friends who need it can save it. one time.

background:

Chronic non-communicable diseases (hereinafter referred to as chronic diseases) represented by cardiovascular and cerebrovascular diseases, diabetes, malignant tumors and chronic obstructive pulmonary disease have become important issues affecting the health of Chinese residents. As people's lifestyles change, the prevalence of chronic diseases continues to rise. As we all know, health status is closely related to age, eating habits, physical activity, occupation and so on. How to achieve the purpose of promoting good health by reasonably arranging meals, moderate physical exercise, and practicing a healthy lifestyle is a common concern of the whole society. Attachment A1 is a survey questionnaire on "Chronic Non-communicable Diseases and Their Related Influencing Factors Epidemiology" conducted by a city's health research department for some residents. Attachment A2 is the corresponding survey data results. The eight guidelines proposed for a balanced resident diet in the revised Dietary Guidelines for Chinese Residents.
Ask your team to work on the following questions:

Question 1 Referring to Appendix A3, analyze the rationality of residents' eating habits in Appendix A2, and explain the main problems.

Idea:
build an indicator system to measure the rationality of residents’ eating habits, sort out the indicator system according to Annex A3, and then draw charts for each indicator for descriptive analysis, explaining that residents’ eating habits are consistent with Annex 3 "Chinese Residents Gaps in the Dietary Guidelines.
The difficulty here is to organize the data, and the analysis is not difficult

 


Question 2: Analyze whether the living habits and eating habits of residents are related to factors such as age, gender, marital status, education level, and occupation.

train of thought

Solution 1: Correlation analysis, first sort out the relevant variables of living habits indicators and eating habits indicators, and then conduct correlation analysis on age, gender, marital status, education level, occupation and other factors one by one, and then analyze the results of the previous correlation analysis Integrate to obtain the mean value of its correlation coefficient, and then determine whether there is a correlation between the overall and the above factors, and individually, which variables have low correlation or do not show correlation.
Solution 2: Logistic regression, first of all, you can sort out the variables related to the indicators of living habits and eating habits, and these variables are used as X, and then the age, gender, marital status, education level, occupation and other demographic factors are used as Y, such as Take gender as an example of Y, first analyze whether its F test is significant, if there is significance, then it shows that there is an impact relationship as a whole, then check the standardized regression coefficient of each item, and check the significant relationship on the individual;
solution 3: Machine learning + model interpretation (shap model), same as method 2, first check the indicators, then use machine learning to model a classification or regression model, and input the model into the shap model, so that each indicator can be determined from a non-linear perspective. Impact of Demographic Factors (Y)



Question 3 Based on the data in Appendix A2, deeply analyze the relationship and degree of common chronic diseases (such as hypertension, diabetes, etc.) with smoking, drinking, eating habits, living habits, nature of work, exercise and other factors.

train of thought

This question is the same as question 2, the only difference is to change Y, where Y is (0: no disease, 1: high blood pressure or diabetes), and then sort out these variables, it is recommended to ask question 2 You can use solution 3, and then apply the same solution as problem 2, so that the difficulty of solving problem 3 is reduced. If you want to show off your skills, you can use different machine learning for comparison.



Question 4 According to the specific conditions of the residents in Appendix A2, reasonably classify the residents, and put forward reasonable suggestions on healthy diet and exercise for various groups of people.

train of thought

The key core of this question is the direction of classification. From the point of view of the question, there are many types of classification, such as whether there is disease (high blood pressure or diabetes), or classification according to demographic characteristics, such as juvenile, youth, middle-aged, old , or obese groups, or eating habits, etc., so in fact, there are many ways to do this question, but they are inseparable. After the classification, we propose healthy diet, exercise and other aspects for various groups of people. It is a reasonable suggestion that this approach is the same analysis steps, this analysis can directly copy the analysis of the first question, but this time it is divided according to the population

Complete problem-solving ideas video and code acquisition can be seen at station B:

2023 2023 Northeast Three Provinces Shenzhen Cup Question A Nanny Solution Ideas and Code Analysis on the Health of Urban Residents_哔哩哔哩_bilibili

Guess you like

Origin blog.csdn.net/weixin_44099072/article/details/131928976