- The language model probability of a particular sentence appears telling you how much.
- In order to establish a good RNN model needs to include a large corpus of training set.
- Each word transferred to the one-hot vector, including punctuation marks and end marks, no word as an input.
- Enter the first step is the zero vector time to do a sorftmax, output probabilities of all dictionary words. After each step the input is a word one-hot, the output of the next word probability. Summing all output cross entropy, then back propagation.
- The multiplied output of whole sentences.