This media is not supported in your browser
VIEW IN TELEGRAM
🍄 Video Understanding with GPT-4V(ision) 🍄
👉 #Microsoft unveils MM-Vid, the most advanced video understanding framework (w/ #chatgpt4). Impressive results on long-form videos & intricate tasks such as audio description & multimodal high-level comprehension
😎Review https://t.ly/RISMm
😎Paper arxiv.org/pdf/2310.19773.pdf
😎Project https://multimodal-vid.github.io
👉 #Microsoft unveils MM-Vid, the most advanced video understanding framework (w/ #chatgpt4). Impressive results on long-form videos & intricate tasks such as audio description & multimodal high-level comprehension
😎Review https://t.ly/RISMm
😎Paper arxiv.org/pdf/2310.19773.pdf
😎Project https://multimodal-vid.github.io
🤯22👍9🔥2👏1😱1