First of all, we need to know that the softmax function is generally used for multi-classification. Wikipedia gives the following definition:
The intuitive understanding is that the index value of the element is the sum of the index values of all elements!
Attaching the picture on Professor Li Hongyi's PPT is easier to understand: