The Stanford 2023 AI Index Report is out! China dominates the top AI conferences, and the number of papers published by the Chinese Academy of Sciences ranks first in the world

81c18c2ef3af996454c240471256bc53.png

The 2023 Artificial Intelligence Index Report is released! This report shows that China ranks first in the world in AI top conference papers, but the number of citations is lower than that of the United States. In addition, among the world's top ten AI paper publications, China has 9 seats, catching up with MIT one after another.

Recently, Stanford released the 2023 AI Index Report.

825c19a1bb739e161a0e26c679ec34cb.jpeg

It is worth noting that the Stanford AI Index report lists the top ten institutions in the world for "AI paper publication volume", and 9 of them are from China, catching up with MIT one after another.

They are: Chinese Academy of Sciences, Tsinghua University, University of Chinese Academy of Sciences, Shanghai Jiaotong University, Zhejiang University, Harbin Institute of Technology, Beijing University of Aeronautics and Astronautics, University of Electronic Science and Technology of China, Peking University, and MIT.

0a237a0ceb1c4d9455b66baadc45c0e4.jpeg

This year's report is divided into eight main sections: Research and Development, Technology Performance, Ethics of AI Technology, Economics, Education, Policy and Governance, Diversity, and Public Perspectives.

The following extracts several key points from the report.

The cooperation between China and the United States ranks first in the world

From 2010 to 2021, although the pace of cross-border cooperation in AI papers has slowed down, since 2010, the number of artificial intelligence research cooperation between the United States and China has increased by about 4 times , which is 2.5 times more than the total number of cooperation between China and the United Kingdom.

However, from 2020 to 2021, the total number of Sino-US cooperation will only increase by 2.1%, the smallest year-on-year growth rate since 2010.

Furthermore, the total number of AI papers has more than doubled since 2010. From 200,000 in 2010 to almost 500,000 in 2021 (49,601).

cef084ace256d663493c160a03ba4497.jpeg

In terms of the type of AI papers published, in 2021, 60% of all published AI papers will be journal articles, 17% will be conference papers, and 13% will come from repositories.

While journal and repository papers have grown 3-fold and 26.6-fold, respectively, over the past 12 years, the number of conference papers has declined since 2019.

0d6a0fa29f1f59a170ae3089782ce195.jpeg

Pattern recognition, machine learning, and computer vision are still hot topics in research in the field of artificial intelligence.

a1b409204de2129414207318850dc7b5.jpeg

China continues to lead in the total number of journals, conferences and repositories.

The U.S. still leads in terms of AI conference and repository citations, but those leads are slowly being eroded. Nonetheless, most of the world's large-scale language models and multimodal models (54% in 2022) are produced by US institutions.

a82c81a6a65f189d3c9622dac895cb50.jpeg

China dominates the list of AI top conferences, but the number of citations is lower than that of the United States

In the publication of AI journal papers, China has always maintained its leading position, with 39.8% in 2021, followed by the European Union and the United Kingdom (15.1%), and then the United States (10.0%).

0dce1f6e403c868f3b664703334e0fc8.jpeg

Since 2010, the proportion of citation frequency of Chinese artificial intelligence journal papers has gradually increased, while the European Union, the United Kingdom, and the United States have all declined. China, the European Union and the United Kingdom and the United States accounted for 65.7% of the total global citations.

88b32f7309b3d72477c42dc9e724b952.jpeg

So, what about the publication of papers by the world's top conferences?

In 2021, China will account for the largest share of the number of papers published at the world's top AI conferences with 26.15%, followed by the European Union and the United Kingdom with 20.29%, and the United States with 17.23%.

e5f2ad46b4ea017dfbe79de2703afd68.jpeg

Judging from the citations of top conference papers, although China is highly productive, its citations are lower than those of the United States. The citations of the top conference papers in the United States are 23.9%, and 22.02% in China.

It can be seen from the side that China publishes the largest number of papers, but the quality is not as high as that of the United States.

39b1525b6ef2221b2693c964f6345a9c.jpeg

The United States leads the world in AI paper repository submissions with 23.48%. The lowest in China, 11.87%.

e05e38e872d5c291ce1f847d7d07a408.jpeg

9 institutions in China, AI papers published surpassed MIT

In 2021, China will account for 9 of the world's top ten institutions for the total number of papers published. The total number of papers published by different institutions is shown in the figure below. MIT ranks tenth, with 1,745 papers published.

d3f603685777e7ef61b57a750bd6bce9.jpeg

In terms of computer vision (CV), ten institutions in China rank among the top ten in the world. They are Chinese Academy of Sciences, Shanghai Jiaotong University, University of Chinese Academy of Sciences, Tsinghua University, Zhejiang University, Beihang University, Wuhan University, Beijing Institute of Technology, Harbin Institute of Technology, and Tianjin University.

f29e5e7d8c443784ac925439750341c2.jpeg

In the field of natural language processing (NLP), things are different.

The top ten institutions/companies in the world are: Chinese Academy of Sciences, Carnegie Mellon University, Microsoft, Tsinghua University, Carnegie Mellon University-Australia, Google, Peking University, University of Chinese Academy of Sciences, Ali, Amazon.

1001358d9197c985ee5a19674af0e59f.jpeg

Speech recognition ranks as follows:

625efb4c4549baf7f8b2c56744e22fb4.jpeg

Industry leads academia

Among the important artificial intelligence machine learning systems released in 2022, the language system accounts for the most, with 23, which is 6 times the number of multimodal systems.

90aed0516ecfdb6e23bd5a3ca268b2a1.jpeg

Industry leads academia in paper output.

Until 2014, most important models were published by academia. Since then, the industry has turned around. By 2022, 32 important machine learning models will be born in industry, while only 3 in academia.

8975048c6245a471434038f222007b42.jpeg

It can be seen that building state-of-the-art artificial intelligence systems increasingly requires a large amount of data, computing power, and financial resources compared with non-profit organizations and academia, while industry players certainly have more financial resources to do this. matter.

In 2022, the United States will produce the largest number of significant machine learning systems with 16, followed by the United Kingdom (8) and China (3).

Furthermore, since 2002, the U.S. has surpassed the U.K. and the European Union, China, in terms of the total number of significant machine learning systems created

805eafc22dac96eae066934e16af7d02.jpeg

Looking at the distribution of researchers behind these important AI systems, the United States has the most researchers, 285 people, more than twice that of the United Kingdom, and nearly six times that of China.

fb94978dd548b45fdc471aab381649a5.jpeg

LLM is getting bigger and bigger, and the computing power is more expensive

Large language and multimodal models, sometimes referred to as base models, are an emerging and increasingly popular type of AI model that are trained on large amounts of data and are suitable for a variety of downstream applications.

Large-scale language and multimodal models such as ChatGPT, DALL-E 2, and MakeA-Video have demonstrated impressive capabilities and started to be widely deployed in the real world.

Analyzing the country affiliation of the authors of these models, the majority of these researchers were from US institutions (54.2%).

0e0d455732a1580f05c62dd8aac0ad76.jpeg

The Stanford AI Index report also lays out a timeline for the release of large language and multimodal models.

764514917c15dc0a3f22100d488b8a85.jpeg

Large language models are getting bigger and more expensive.

The first large-scale language model, GPT-2, was released in 2019, with 1.5 billion parameters and a training cost of about $50,000. Google PaLM is one of the large language models launching in 2022, with 540 billion parameters and a cost of up to $8 million.

392227b37127fd38535965493f8d6c9d.jpeg

From the perspective of parameters and training costs, PalmM is 360 times larger and 160 times more expensive than GPT-2.

Not just PalM, but in general, large language and multimodal models are getting bigger and more expensive.

For example, DeepMind's large-scale language model Chinchilla, which will launch in May 2022, is estimated to cost $2.1 million, while BLOOM's training costs about $2.3 million.

d8668c8907c4df693583ab8b1ad2fbb3.jpeg

Progress of GAN in face generation over time, the last image was generated by Diffusion-GAN, a model that achieves state-of-the-art state-of-the-art on STL-10.

7ddb2fef4d48080e0c94fafdf798b5b7.jpeg

Last year, with the release of models such as OpenAI's DALL-E 2, Stability AI's Stable Diffusion, Midjourney, Meta's Make-AScene, and Google's Imagen, text-to-image generation models have gradually entered the public eye.

Below, enter the same prompt, "A panda playing the piano on a warm Paris night", with images generated by three publicly accessible AI text-to-image systems, DALL-E 2, Stable Diffusion, and Midjourney, respectively.

d2ad9a1d0fa7ebbaee6456c227369c11.jpeg

Of all the recently released text-to-image generation models, Google's Imagen performs the best on the COCO benchmark.

This year, the Google researchers who created Imagen also released DrawBench, a harder text-to-image benchmark designed to challenge increasingly powerful text-to-image models.

b21f4217fdac0fa9ec9c373c6176c6d0.jpeg

In addition, the report also introduced some biases in the current generative AI model. For example, when DELLE-2 prompted the CEO, everyone seemed to adopt a confident posture with crossed arms.

7230539a7f6d0bb4e9b4256025449b73.jpeg

In Midjourney, when prompted to generate "influencers," it generated 4 images of older-looking white males.

dad463be12383e9f809b7aa6af392b94.jpeg

For the full report, see:

https://aiindex.stanford.edu/wp-content/uploads/2023/04/HAI_AI-Index_Report_2023.pdf

Guess you like

Origin blog.csdn.net/weixin_41033724/article/details/130050892