Commercial use SPSS data analysis

SPSS is a very powerful data processing software, then how to analyze data using SPSS it?

 

1. What is the SPSS

SPSS is referred to as the social science statistics software package, its official full name IBM SPSS Statistics. By the SPSS Inc. SPSS package was originally launched in 1968 and in 2009 was acquired by IBM, used primarily for management and statistical analysis of data in all areas. As the world standard social science data analysis, SPSS operation user interface is extremely friendly, and the resulting output interface is also very beautiful, but also equipped with a very detailed user manual.

 

1.1   SPSS core functionality

 

 

1.2   Data editing function

Can, additions and deletions to the data processing such as data editing function by the SPSS, the data may be split, weighted, sorting, and other processing necessary polymerization.

 

1.3   visualization

SPSS has a powerful graphics functions, the model can be automatically output descriptive analysis chart reflects the intrinsic relationship between the different variables; also can customize the basic properties chart by the user, so that the data analysis more beautiful. Wherein the base comprises FIG bar, pie charts, pie charts, bar charts, box plots, a histogram, PP FIG, QQ, etc. FIG. And its interaction diagram more attractive, including different styles of interaction FIG. 2D bar, strip interaction diagram, FIG interaction box, scatter diagrams and 3D interactive FIG.

 

1.4   Table editing functions

Users can use the form SPSS draw different styles, and you can edit the table in the viewer, can also be edited in a special editing window.

 

1.5   connection to other software

SPSS can open multiple types of data files, including Excel, Access, DaBase, text editor, Lotus 1-2-3, etc., while users can also save the image into different image formats.

 

1.6   statistical functions

CDA data analysts believe SPSS statistical functions for data analysis should focus on master module, this feature most of the mathematical statistical model analysis can be completed, including: regression analysis, contingency table analysis, cluster analysis, factor analysis, correlation correspondence analysis, time series analysis, discriminant analysis and the like.

 

2. How to analyze the data with SPSS

First, we must understand what the general flow of data analysis?

 

CDA Data Analyst will complete a data analysis project is divided into the following five processes:

 

 

 

2.1   data acquisition

There are three main external data acquisition mode, one is acquiring some domestic data on public sites such as the National Bureau of Statistics; one is to obtain data on the site by reptiles and other tools. Another is through internal corporate databases, SPSS has a rich database interface, you can easily read the data from the database.

 

2.2    Data storage

For the amount of data items, you can use excel to process data, but for the amount of data over a million items, use databases to store and management will be more efficient and convenient.

SPSS also has a data format, sav file their use as data storage. Users can save data after treatment for sav SPSS formats, but also can be very easy to convert sav files to other data formats.

 

2.3   Data Preprocessing

Data preprocessing, also known as data cleaning. In most cases, we get our hands on the format of the data is inconsistent, there are outliers, missing values ​​and other issues, and different project data preprocessing steps in different ways. CDA data analysts believe the data analysis, 80% of the work in processing the data, we can see the importance of data preprocessing in data analysis.

 

2.4   Modeling and Analysis

This phase must first know the structure of the data, combined with the project needs to select models.

 

Common data mining models are:

 

 

2.5   visual analysis

The final step is to write data analysis data analysis report, including general data visualization analysis.

 

After the Second, grasp the general flow of data analysis, SPSS as a tool to have to make the following segments according to the following procedure to complete a project and master:

 

 

 

END

Bi-mao wonderful classroom courses recommended:

1.Cloudera data analysis courses;

2.Spark和Hadoop开发员培训;

3.大数据机器学习之推荐系统;

4.Python数据分析与机器学习实战;

详情请关注我们公众号:碧茂大数据-课程产品-碧茂课堂

现在注册互动得海量学币,大量精品课程免费送!

Guess you like

Origin blog.csdn.net/ShuYunBIGDATA/article/details/91559289