Learning tool for large data analysis must be used, be sure to Favorites

In simple terms, we can put big data analysis tools simply divided into two dimensions:

The first dimension: the data storage layer - layer data report - Data analysis layer - presentation layer data

The second dimension: the user level - Departmental - Enterprise-class --BI

1, the data storage layer

Data storage involves the concept of a database and the database language, in this area do not have to delve deep, but at least understand how data is stored, the basic structure and data types of data. SQL query language is essential, proficient best. Can be found from the usual select, update, modify, delete delete, insert inserted into the basic structure and start reading.

. Access2003 Access07, etc. This is the most basic personal database, often used for personal or some of the basic data storage; MySQL database, the database application for the department] level or Internet is necessary at this time to master the key structure of the database and library SQL data query language ability;

SQL Server 2005 or later, the SME, - - some large enterprises can adopt a SQL Server database, in fact, this time in itself in addition to data storage, as well as data reporting and data analysis, and data mining tools are even where the ;

DB2, Oracle databases are large databases, mainly enterprise-class, mass storage needs of large enterprises in particular, or the data is a must, and generally large database companies offer very good data integration application platform;

BI level, in fact, this is not a database, but on the basis of the previous database, enterprise applications data warehouse. DataWarehouse, built on DW-class data storage are basically business intelligence platform, integrating a variety of data analysis, reporting, analysis and presentation! BI-level data warehouse BI product is combined with the trend of recent years.

In the process of getting started big data have met learning, industry, the lack of systematic learning path, learning systems planning, you are welcome to join my big learning data exchange skirt: 251 956 502, skirt documents have my years of study manual sorting of large data , development tools, PDF document with a book, you can download yourself.

2, reporting layer

Enterprise storage of data to be read, need to show, reporting tools is the tool most commonly used, especially in the country. Traditional reporting solutions is to demonstrate the problem, the current domestic soft sail report FineReport has been considered the top in the industry to do, is report with data analysis ideas, because of its excellent features open interfaces, reporting, forms capabilities, can do open up data and out, covering early business intelligence capabilities.

Tableau, with FineBI like layer can be divided in the report can be divided into data presentation layer. FineBI and Tableau belong in recent years, great software, I used FineBI conduct visual analysis and reports from the database as a visual data analysis software. In contrast, visualization Tableau better, but FineBI there is another kind of identity - business intelligence, so large data processing capacity in terms of better.

3, the data analysis layer

In fact, there are many layers to this analysis tool, of course, our most commonly used is Excel, I often use is the statistical analysis and data mining tools;

Excel software, the first version with the higher the better that is for sure; of course Excel in terms of a lot of people just mastered the 5% Excel functions, Excel is very powerful and can even do all the statistical analysis, but I have often said that! Excel has the ability to play as good as statistical tool to learn specialized statistical software;

SPSS software: the current version is 18, the name has changed a PASW Statistics; I start from 3.0 Programming Dos environmental analysis, changes to the current version of the change can be seen in the SPSS statistical package of social sciences, from a focus on medicine, chemistry, etc. began increasing emphasis on business analysis, it has now become a predictive analytics software;

SAS software: SAS actually more powerful relative SPSS, SAS is the platform, EM mining module platform integration, relatively speaking, SAS is more difficult to learn, but, if you master the SAS will be more valuable, such as discrete choice models, sampling problems orthogonal experimental design and SAS still relatively easy to use, in addition, more SAS learning materials, also publicly, there will be harvested!

JMP analysis: analysis of a branch of SAS

XLstat: Excel plug-in, you can do most of the SPSS statistical analysis

4, the presentation layer

The presentation layer, also known as data visualization, each of these tools provide almost a little show function. Tableau FineBI and visualization capabilities mentioned above have. In fact, in recent years, getting better and better visualization of Excel, coupled with some plug-ins, use feeling better.

PPT: office used for data analysis, report writing;

Xmind & Baidu mind map: carding process, help thinking analysis, presentation of hierarchical data analysis;

Xcelsius software: Dashboard data visualization and production reporting tools, you can read the database directly, modeling in Excel, the Internet show, the biggest feature is achievable dynamic report in the PPT.

Finally, it is noted that such a classification is not distinguish software, just to illustrate the application software. Sometimes we put on the database used to analyze the report, the report is to analyze and sometimes, sometimes the analysis is to show; of course, sometimes to show that analysis, but also reports, report data is stored!

Guess you like

Origin blog.51cto.com/14296550/2415765