Zhuangshi Data Technology 07: Metadata

Insert picture description here

Hi, Minasang, I am Zhao Zhuangshi, everyone's strange girl!

It's another wonderful Saturday morning, today we will talk about-metadata!

Metadata is also a product position and concept that has only become popular in the past two years. Because it is biased towards the back, compared to cool terms such as data analysis, data BI, user stratification, and attribution, metadata is like a white flower. lotus.

Insert picture description here

01 What is metadata?

Well, why can metadata watch everything? Because it is "meta" data.

What is "metadata"-metadata?

Zhuang Shi shed tears of reunion after seeing this word for the first time, and even wanted to laugh a little. Could it be that metadata is the name of a big man who is familiar with modernism?
Insert picture description here

Okay, Zhuangshi first explained what is called "yuan" from a humanistic perspective.

Yuan still refers to the original and original meaning. Our meta-narrative, meta-data, and meta-fiction will tell you what “meta” is.

1. “Former” + “Storytelling” = Former storytelling

A complete explanation of the narrative, that is, the narration of historical meaning, experience and knowledge

2. "Meta" + "Data" = Metadata

Data about data

3. "Meta" + "Fiction" = Metafiction

In meta-fiction, the writer consciously exposes the fictional process of the novel, and produces a separation effect, so that the recipient understands that a novel is a fiction and cannot be regarded as a reality. In this way, fiction has acquired the meaning of ontology in the novel.

Well, to put it simply, Yuan is the original framework and elements of this thing. In the context of modernist large-scale industry, all mankind has entered the "order mode", so "yuan" is needed. So there is the "metamorphism" of everything.

Yes, will everyone hear a word called "metacognition" recently. There is nothing magical. To paraphrase the concept, meta-cognition is the cognition about cognition. Maybe the “first principles” of Shanyou teacher and the “friend of time” called the beast may be “meta-cognition”?

Insert picture description here

That metadata is easy to understand, which is "data about data". With metadata, it can make our data production and use more orderly.

Data production, some people will call it "background metadata": to guide data cleaning and loading.

Data usage, some people will divide it into "front-end metadata": descriptive, help us use reports and query tools more smoothly.

For the classification of metadata, we can divide it into the following three categories:

Business metadata describes metadata from the business level.

Technical metadata Various statistical information on the technical level, including data type, length, blood lineage, data analysis results, etc.

Process metadata ETL itself execution result statistics, such as how many rows are loaded, how many rows of data are discarded, and data loading time.

02 The core concept and key elements of metadata

Not much to say, let's go to the official Alibaba Cloud ppt:

Okay, if you don’t understand, Zhuangshi also drew a picture:

Insert picture description here
Insert picture description here
The original data production-use process

Insert picture description here

Related metadata involved

03 Metadata Products

There are currently three types of metadata management tools in China.

One is the specialized tools provided by companies such as IBM and CA, such as MetaStage acquired by IBM's acquisition of Ascential, and CA's DecisionBase;

The second is like DAG's MetaCenter, open source product Pentaho Metadata, they are not dependent on a certain BI product, but a third-party metadata management tool;
Insert picture description here

Open source product Pentaho Metadata diagram

Third, integrators like Puyuan and Dianthus also have their own metadata management tools: Puyuan MetaCube, Xinju Network Metadata Management System, Dianthus MetaOne, etc.

Universal metadata-driven microservice architecture:

https://cloud.tencent.com/developer/article/1080067

https://cloud.tencent.com/developer/article/1080078

Xinju Network Metadata Management System:

http://www.shsnc.com/index.php?m=content&c=index&a=lists&catid=188

Dianthus MetaOne product picture:

Insert picture description here
Insert picture description here

Specialized metadata management tools are better compatible with their own products. Once cross-system management is involved, it is not satisfactory.

04 The function and value of metadata

If you ask me, where is the function and value of metadata? At present, the industry has carried out some practices:

  1、血缘分析:向上、向下表级、字段级别的追溯数据。血缘分析可以让您轻松知道:“我正在查看的报告数据来源是什么?”、“数据经过哪些转换处理?”、“销售额”从包含税费更改为不包括税费,哪些下游字段受到了影响。血缘分析可以满足许多行业(包括医疗、金融、银行和制造业等)对所呈现数据的特殊监管及合规性要求。



 2、指标一致性分析:定期分析指标定义是否和实际情况一致。大佬会上对不齐数据是何等的尴尬。。。。



 3、实体关联查询:事实表与维度表的代理键自动关联。

05 Postscript

Break everything: from humanistic postmodernism to data postmodernism.

In the 1960s, anti-Western modern system philosophical trends emerged in Germany, France, and the United States, which was academically called "postmodernism". You may not know what postmodernism is. Take a picture and feel it:

Insert picture description here

Correct! It is Dali's "Eternal Memory".

Postmodernism is a spirit that uses disorder to oppose order, personal rants to oppose grand narratives, and deconstruction to oppose structure.

Griffin, one of the active postmodernists in contemporary America, said: “If the term postmodernism can be used in different ways to find common ground, it means that it refers to a wide range of emotions. , Rather than a common dogma—that is, an emotion that believes that humans can and must transcend modern times."

Behind all theories is the mainstream cognition + emotion of this era. Data is no exception. Let's take a look at the dispute between the two big bosses of the data warehouse in "Zhuangshixue Data Technology 01", we know that people are repeatedly jumping in order and speed.

Insert picture description here

Therefore, we have to consider today, what is the rationality of metadata today?

After the messy data is managed, is it necessary to do subtraction in the complexity and programming?

More data ≠ more information. How to turn data into information is something that every data person needs to think about continuously.

The private place of a data person is a big family that helps the data person grow, helping partners who are interested in data to clarify the learning direction and accurately improve their skills. Follow me and take you to explore the magical mysteries of data

1. Go back to "Data Products" and get <Interview Questions for Data Products from Big Factory>

2. Go back to "Data Center" and get <Dachang Data Center Information>

3. Go back to "Business Analysis" and get <Dachang Business Analysis Interview Questions>;

4. Go back to "make friends", join the exchange group, and get to know more data partners.

Guess you like

Origin blog.csdn.net/weixin_49880348/article/details/111404988