Today, MathGPT, China's first 100-billion-dollar large-scale mathematical model, goes online and starts public beta testing

From: Good Future

Enter the NLP group —> join the NLP exchange group

On August 24th, in TAL’s 20th anniversary live broadcast event, CTO Tian Mi announced that MathGPT, a 100-billion-level large-scale model in the mathematics field developed by TAL, was officially launched and opened for public testing. From now on, users can apply for a free trial experience through the official website (https://www.mathgpt.com/, or click "read the original text" at the end of the article).

In May of this year, TAL announced that it was developing a self-developed large mathematical model, named MathGPT. MathGPT is a large-scale model in the vertical field of mathematics with the core of problem-solving and lecture algorithms for mathematics enthusiasts and scientific research institutions around the world. It is also the first large-scale model specially designed for mathematics in China.

When users use MathGPT, they can upload math questions in text or pictures, and then they can get dialogue-style answer feedback. They can also use the "random question" button to randomly generate math questions and give answers by the system. Currently, MathGPT supports PC and mobile experiences in Chinese and English versions.

194a85068dfa4e315979a0ff51810f44.png

According to Tian Mi, MathGPT brings together TAL's years of education, teaching and research data accumulation, focusing on the field of mathematics. The training, reasoning, and deployment framework of hundreds of billions of large models endows the model with powerful capabilities. Through high-quality educational data, multi-task continuous training and supervised fine-tuning such as topic calculation, explanation, and question-and-answer are realized, showing excellent performance. In addition, with the help of human feedback alignment, the comprehensive quality of the model will be further improved. MathGPT has obvious advantages in problem solving accuracy, stability and user experience.

According to the official website of MathGPT, MathGPT's mathematical computing ability has covered mathematics problems in elementary school, junior high school, and high school. Q&A interaction.

f8396730cf486d48587292dfba9eb959.png

MathGPT technical report

The MathGPT technical report shows that among the test results of six public mathematics evaluation collections including CEval-Math, AGIEval-Math, APE5K, CMMLU-Math, Gaokao Mathematics and Math401, TAL’s MathGPT has achieved the highest scores in multiple tests. At the same time, MathGPT also performed well on the general test collection of C-Eval's middle and high schools.

5a082259b437bb3493b2dee2de8b5c57.png

MathGPT's C-Eval list of junior and senior high school subjects

In terms of problem-solving stability and explanation friendliness, MathGPT conducts model training based on a large number of famous teachers' problem-solving process data, and the model's problem-solving steps are professional and clear.

Taking a sequence question as an example, the answer given by MathGPT includes three parts: "analysis", "detailed explanation" and "finishing points", which is more detailed than the rough explanation of the general large model. "Analysis" provides the problem-solving ideas , way of thinking, to help users better understand the questions, "detailed explanation" gives specific calculation methods and answers, and the last "finishing" link reminds the test points, difficulties, and key points of the questions, helping users review and reflect on the questions Intention, draw inferences about other cases from one instance.

96917c33fd1d80d8f0e96106a2264123.png

For users, researching mathematical problems is not only about getting the answers themselves, but also about the problem-solving principles and thinking logic behind the answers. Compared with other general-purpose large models, MathGPT can solve problems with higher accuracy, and can also analyze the answers more clearly and explain them more clearly, and better meet the core needs of users to use AI products to solve mathematical problems.

At the same time as MathGPT was released, TAL also updated a representative and challenging math task evaluation set on its official website for global artificial intelligence experts and math enthusiasts to experience and evaluate.

Tian Mi said that he hopes to make MathGPT play a greater role in the field of mathematics education. TAL is willing to share with the industry the experience and methods of developing hundreds of billions of large models based on large-scale and high-quality content, and make progress together with the industry.

Based on MathGPT, TAL will continue to explore learning methods in an AI environment to better serve learners and math lovers around the world, share its experience with the industry in a timely manner, and help positive changes in educational technology through AI technology.

With the progress of the public beta, MathGPT's problem-solving ability will continue to improve. According to Tian Mi, the product-level application based on MathGPT is also being accelerated and will be released in the near future.


Enter the NLP group —> join the NLP exchange group

Guess you like

Origin blog.csdn.net/qq_27590277/article/details/132486459