https://wa008.github.io/posts/try-of-torchview-to-accelerate-finetune-new/
Acceleration of LLM - Matrix Multiplication - Informal's blog