摘自《Data Mining - Concepts and Techniques》

摘自《Data Mining - Concepts and Techniques》

The most popular data model for a data warehouse is a multidimensional model.Such a model can exist in the form of a star schema, a snowflake schema, or a fact constellation

schema. Let’s look at each of these schema types.

Star schema: The most common modeling paradigm is the star schema, in which the data warehouse contains (1) a large central table (fact table) containing the bulk of the data, with no redundancy, and (2) a set of smaller attendant tables (dimension tables), one for each dimension. The schema graph resembles a starburst, with the dimension tables displayed in a radial pattern around the central fact table.

Snowflake schema: The snowflake schema is a variant of the star schema model, where some dimension tables are normalized, thereby further splitting the data into additional tables. The resulting schema graph forms a shape similar to a snowflake.

Fact constellation: Sophisticated applications may require multiple fact tables to share dimension tables. This kind of schema can be viewed as a collection of stars, and hence is called a galaxy schema or a fact constellation.

猜你喜欢

转载自goaheadtw.iteye.com/blog/1734015