https://abtuo.github.io/posts/challenges-vision-language-models/