Professor Qin Bing of Harbin Institute of Technology | Alignment of Human Values in Big Language Models

Alignment of human values ​​in the big language model

From: HIT SCIR

Enter the NLP group —> join the NLP exchange group

43ca8675bf5f202062241dca49dbcc20.jpeg

Bing Qin Harbin Institute of Technology Social Computing and Information Retrieval Research Center

Title: Alignment of Human Values ​​in Large Language Models

Abstract: At present, the fourth wave of artificial intelligence technology represented by ChatGPT is sweeping all aspects of social and economic life. Relying on its astonishing ability, this kind of technology is rapidly affecting the development of society. However, due to its technical reasons, the use of large language models represented by ChatGPT may still cause harm to humans and society, such as generating inaccurate information to mislead humans, outputting harmful information to harm humans, etc. How to align the values ​​of large language models with humans is a major problem that needs to be solved urgently. This report will propose to align the big language model with human values ​​from four levels, which are: factual (telling facts and logic), legal (conforming to laws and regulations), reasonable (conforming to morality) and cultural (conforming to national regional culture). Based on this framework, we sort out the existing work in detail, not only summarizing the existing tasks and methods, but more importantly, finding valuable research directions and possible development trends in the future. We firmly believe that the goal of artificial intelligence is to provide theoretical support for the improvement of human well-being and the harmonious coexistence of all things! Therefore, through this report, it is hoped that more researchers will be able to improve the fidelity and value alignment of the generated content of the large language model model, so that artificial intelligence technology will be more beneficial to human society.

Lecturer: Qin Bing, professor of Computing Department of Harbin Institute of Technology, doctoral supervisor, deputy director of Institute of Natural Language Processing of Harbin Institute of Technology, director of Social Computing and Information Retrieval Research Center. National key research and development projects, the person in charge of key projects of the National Natural Science Foundation of China. Expert of the Science and Technology Innovation 2030-"New Generation Artificial Intelligence" major project management expert group of the Ministry of Science and Technology, executive director of the Chinese Information Society of China/deputy director of the Language and Knowledge Computing Committee/director of the Emotional Computing Committee, natural language processing of the Heilongjiang Computer Society Director of the special committee. Main research directions: natural language processing, knowledge graph, affective computing, text generation. Won the first prize of Qian Weichang Chinese Information Processing Science and Technology Award of Chinese Information Society, the first prize of Heilongjiang Science and Technology Award, the second prize of Heilongjiang Science and Technology Award and the second prize of Heilongjiang Technology Invention Award. Selected into the "2020 Global Women in Artificial Intelligence and AI 2000 Most Influential Scholars List" and "Forbes China 2020 Technology Women List".

PPT sharing

Harbin Institute of Technology Social Computing and Information Retrieval Research Center

Qin Bing

eb7d49c6127988ca7fb33ccdd267e235.png

0a43326602afc94274e8b0faca3a1e0e.png

0a39f70a012f2f1fe82d72d52f04d327.png

66d0a56a31d23b271352ea2e45be6e0c.png

c582505f1bcba3fb0f395ee08b32f917.png

061ee2c5124efac8051cab8b9543c760.png

ec870c59c8c29fe97523ffee29d468d3.png

2ef7b08b64ed0a262464ba98529f27fb.png

2a4418765a5114b1dd38c9083ffd9787.png

e64930611145ff6e6a288a116a5b9fc1.png

796031bc422ee0b8135dfa14daafc038.png

66c8138bcca60cf0756c8334fb642e0c.png

754634f8249b6352ee0593d496dcd691.png

46ac1502a6cdadccf53a548f021327b7.png

fe9a50871d41aba02ac00cb539f242b8.png

f2548b7cd7315ee834209e05458d4922.png

66546f239357199db4319bc12cf9468d.png

d0f6c8678d6df88cc0edf68ffe64e6a8.png

e2369a0e3e87d9920c9d8987696d58a0.png

85d4481ce2ebaeecede5cc2aa9d6a726.png

bd97e2264a6625801efc54a600fbc562.png

3f3ff4f471be994a9bd1ad3b46eb23ee.png

b2a91ac00133a3feda29aaa5d42755e8.png

2f4e4d0826694f0e2de82066f856c677.png

7e1879db44661024215212920fba73aa.png

3ecc70f5edac6774769a2a7f265ef90b.png

23208545ce9df9b1d47ebff205b48a95.png

2b8abd1587f69abf93a9ea6463519ad0.png

6d2d513dbce7b97649283641029302aa.png

1dc66de15b137d1f196cac46714882b0.png

35d8c12e404f95f91babb6662dc733a8.png

b26a8c59b716ad7bcf9a77d8688a222b.png

b3830f0c534a8c4df812be90fb78a7ef.png

4d1963582a47064cb623218ffd79e37d.png

a0ce9813fd452d1b81cd6901474136ee.png

62407f259173552f6b867199f794624a.png

8c1475a15745a0ca035b6307ca560003.png

143e691b7fdf88f08f646533dea36bc9.png

33a772e36eab7d0b8b21455c402c6ebc.png

3c60b794aab633a6d8763546cd443914.png

01f56f51e5d65827ba12a438b3d56855.png

4656b5c80649b0b451baf0caca03b0d6.png

8609b73b6962df02472b60e4f36ff395.png

5d9808cc7c916c813089c8de1b924e02.png

d3081e016626d96a4a10614e0fe9b122.png

f459380e7fddb6c32f43b7180f542a2d.png

37d517001631384c0c2c0848453ebdda.png

970ed6e5200c9b296686ac5045d175d2.png

2bf8674dbbd6978af3c7c5919a2b1b88.png

2e418626cdeed6a00ee875e3cf66c236.png

a5ff413c9db5aea8b959dbee7202a2e6.png

e48c4796dd850e50fce137f75fb2a187.png

ac08345006d998db091035ac8aec18f9.png

979dc35a10c23cdc3d5ee4669be0d0d1.png

5862b9b584a232dcfde3e45f5a2fcc50.png

502af95a73b877948fa583df5b99cebb.png

1189e63dc69e4836dad30b3859b9197b.png

8a2f6107c22232aa21cacc0abaf3297d.png

6aa7b2dc810f400e825b7140b9e2bda2.png

bc25e0c6959dbf17b02a22f3310e483b.png

14d2f020d6314c8ae4df268e1fe2f4c6.png

4a7f10721297a731270f3f29702d65b1.png

ddbd327781b3ac3768c8d8c67cc9e42f.png

dc472cebd73b46749613817a044874f2.png

153a7825743eded1b5918ba4ac27092b.png

9b5db30101e4ec4dd119b0b4d8dc82f3.png

Editor in charge of this issue: Zhao Yanyan

Editor of this issue: Yang Xin

Enter the NLP group —> join the NLP exchange group

Guess you like

Origin blog.csdn.net/qq_27590277/article/details/132095184