Transformer code is simple to understand (preliminary understanding)

Transformer code is simple to understand

In order to better understand Transformer's code, I consulted relevant information and integrated it with reference to the content of many big guys.

The main content is written in blocks according to Transformer's architecture, which can be viewed and modified on Jupyter.

The schematic diagram is as follows:

TRM

The relevant content is already very detailed, so I hope to check the relevant warehouse to understand.
Both channels are available:

  1. GItHub-Transfoemer_code
  2. Gitee-Transfoemer_code

Guess you like

Origin blog.csdn.net/m0_56075892/article/details/123685611