VS digital words and language and information

Language and mathematics are generated for the same purpose: to record and disseminate information.

Theory of Communication: the generation, transmission, receiving, and feedback.
The origin of words: When the multi-language and vocabulary to the human brain alone can not remember when, efficient record demand information arises.
Translation needs: people with different civilizations need to communicate or communicate.
Translation reach: the ability of different writing system in the recording of information is equivalent.

Text only carrier of information, rather than the information itself, without text, numbers can be stored in the same sense of information. This is the foundation of modern communications.

When too many characters, the first time the concept of generalization and classification began in the Chinese hieroglyphs, the intention of "day" is the sun, but it is also a sun rising from the downhill and then rising time period (one day). This clustering concept in principle and clustering today natural language processing or machine learning are very similar.

According to the text to mean clustering, will eventually bring some ambiguity, to solve this problem need to rely on context.

Redundant information is information security protection. Rosetta Stone content on the same information is repeated three times, so as long as there is a well-preserved contents, the original message will not be lost, it will have to channel coding guidance.

Is the basis of the digital counting system. Early figures do not write in form, but breaking a finger, which is why we use the decimal today. When the ten fingers found not enough, the binary system is generated. But the Mayan civilization is finished counting fingers and toes began to carry, so the Mayan civilization is to use Vigesimal, Maya a century (the Sun) is four hundred years. With respect to the decimal Vigesimal more complex, such as the multiplication table is replaced Vigesimal chess set in a 19 x 19.

For a different number of digits, said Chinese people use a ten Tsumoru trillion (trillion and represent one million trillion), on behalf of the Romans by I 1, V represents the 5, X represents the 10, L represents the 50, C represents 100 , D representative of 500, M 1000 representatives. Both representations have introduced the concept of simple coding, in China, the coding rule is multiplication, the meaning of the wording of 2 million is 2 x 100 x 10000, while in Rome, decoding and subtraction rule is: Small number appears in large numbers to the left to decrease, the right to add. For example IV represents 5-1 = 4. The rules are complex and difficult to describe large numbers and fractions. While the invention spend behind streaked on M represents a thousand times, but if you want to write one billion words, or write a blackboard. Description Digital most effective is the ancient Indians, the invention is 10 digits including zero.

Pictograph to phonetic is a leap in the manner described because the human object, evolved from the appearance of the object to an abstract concept, while unconsciously using coding information. Not only that, coding is still very reasonable, common words such as short, uncommon word length. Similarly the communication, if the channel is wider, the information need not be transferred to the compression, a narrow channel, the information needs to be compressed as much as possible before transmission, and decompressed at the receiving end. This phenomenon today video playback settings on the broadband Internet and mobile Internet exactly the same with us, the former is the result of broadband transmission, and therefore the resolution can be made higher, the latter due to the limitations of air channel bandwidth, transmission speed slower one to two orders of magnitude, so the resolution is lower.

If the words from the letters to the encoding rules of word formation is the word, then the syntax is the encoding and decoding rules of the language. However, comparatively speaking, the word can be considered limited and closed the set, and the language is unlimited and open collections. Mathematically, the former can have complete codec rule, while the latter will not have this feature. Therefore, any language has grammar rules can not cover.

Published 16 original articles · won praise 0 · Views 231

Guess you like

Origin blog.csdn.net/qq_44713502/article/details/103107199