https://mpost.io/microsoft-has-introduced-multimodal-language-model-otter-for-visual-understanding-based-on-the-massive-instructional-visual-text-dataset-mimic-it/