Ling Jiu software: NLPIR big data platform for Chinese mining industries enabling

  With the increasingly frequent China's rapid economic development and foreign relations, the Chinese position in the world has gradually increased. Although Chinese is considered one of the most difficult languages ​​in the world, but in recent years, the Chinese people in the world go to school or continue to increase, these people throughout Asia, Europe, the Americas, Africa. And people do not learn Chinese students simply learn the language, culture, history major, students learn many countries in economic, trade, legal profession have begun to learn Chinese, they think Chinese will master the job and work helpful.

  Chinese characters into an information processing information processing section and two Chinese information processing, specifically includes input character, word, sentence, chapter, storage, transmission, output, recognition, conversion, compression, retrieval, analysis, understanding and generation, etc. processing technology. Chinese use computers to process information, that is, Chinese information processing, also known as the Chinese information processing.

  Chinese information processing is to collect, store, transport and use of relevant information in Chinese, is the use of computer and modern communications, lighting, layout, and other automated techniques Character information input and output sorting, processing, conversion, transmission, replication, and other an emerging science and technology seed processing. Its cross-cutting nature make it a branch of the "information science"; its comprehensive application to become an example of the "System project". It involves language, computer science, information science, engineering, psychology, mathematical statistics, acoustics, automatic identification technology, artificial intelligence, network technology, literature search to learn more. I can say it is a new multi-edge science. China to implement advanced information processing technology, Chinese information technology is an important resource development. Chinese information network has become a modern society, our nervous system, which will facilitate the rapid improvement of people's cultural and social productivity. Chinese information processing project has established modern Chinese language information system so that condensation in the language of knowledge and information play a greater efficiency, optimal use of the Chinese characters.

  Currently Chinese information processing capability gap with the international advanced level is still very great. For example: automatic segmentation and POS tagging, has yet to develop a system like the Japanese word that is widely accepted word tagging system. As can be seen from the method used, with the deepening of the study, based on statistical methods has gradually expose their own shortcomings, statistical methods can not solve all the problems, still requires a combination of rule-based approach, in order to be a breakthrough in accuracy ;

  Chinese information processing syntactic analysis and semantic analysis of the problem ; the problem of application of Chinese information processing, such as information input keyboard input and character recognition development has matured, but the realization of speech recognition is very difficult to adapt to the variations between different people a voice and a noise outside; Chinese information processing dispersed and there is low-level duplication, lack of uniform norms and standards issues; state of isolation modern Chinese research in the field of computer and no fundamental changes occur; Chinese language and minority language of information processing technology compared with the international level, there is a considerable gap.

  NLPIR intelligent semantic analysis of large data platform is based on the comprehensive needs of Chinese data mining, network integration of precise collection, natural language understanding, research text mining and semantic search, the whole chain of technology and process development platform for Internet content sharing.

  NLPIR big data semantic intelligence analysis platform mainly accurate capture, document conversion, new words found quantities word, language statistics, text clustering, text classification, abstract entity, intelligent filtering, sentiment analysis, document de-duplication, full-text search, encoding conversion functional modules and other items, the platform provides client tools, cloud services and secondary development interface and other products use the form. Among the various middleware API can be seamlessly integrated into the customer's various types of complex application systems, compatible with Windows, Linux, Android, Maemo5, FreeBSD and other operating system platforms, it can be used for Java, Python, C, C # and other developers language use.

  With the depth of information technology in various fields of application of our social life , Chinese information processing is becoming the way people work and live without means, Chinese information processing will have even broader market. This will enable the Chinese information processing and efficient Chinese search engine, real-time machine translation, text processing large-scale Chinese, Western text automatically identify cross-platform conversion, semantic understanding Pan-Chinese, Chinese e-commerce technology to achieve a major breakthrough. Chinese information processing has become a basic technology research, development, and application of China's information industry, the Internet's growing today, Chinese information processing technology will be more mature and innovative.

Guess you like

Origin www.cnblogs.com/ljrj/p/11225633.html