https://friendlyvillain.github.io/posts/transformer-principle/
Transformer with Pytorch - Simple Record