https://huggingface.co/XiaomiMiMo/MiMo-VL-7B-RL
https://github.com/XiaomiMiMo/MiMo-VL
New Vision Language Model(VLM) that outperforms Qwen2.5-VL #models #vlm
https://github.com/XiaomiMiMo/MiMo-VL
New Vision Language Model(VLM) that outperforms Qwen2.5-VL #models #vlm
huggingface.co
XiaomiMiMo/MiMo-VL-7B-RL · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Deep Learning
https://github.com/parthsarthi03/raptor #Frameworks
https://github.com/illuin-tech/colpali
Efficient Document Retrieval with Vision Language Models #Frameworks #Models
Efficient Document Retrieval with Vision Language Models #Frameworks #Models
GitHub
GitHub - illuin-tech/colpali: The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.
The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol. - illuin-tech/colpali