https://123dok.net/document/yr3mlgov-overview-the-transformer-based-models-for-nlp-tasks.html