https://huggingface.co/XiaomiMiMo/MiMo-VL-7B-RL
https://github.com/XiaomiMiMo/MiMo-VL
New Vision Language Model(VLM) that outperforms Qwen2.5-VL #models #vlm
https://github.com/XiaomiMiMo/MiMo-VL
New Vision Language Model(VLM) that outperforms Qwen2.5-VL #models #vlm
huggingface.co
XiaomiMiMo/MiMo-VL-7B-RL · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Deep Learning
https://github.com/parthsarthi03/raptor #Frameworks
https://github.com/illuin-tech/colpali
Efficient Document Retrieval with Vision Language Models #Frameworks #Models
Efficient Document Retrieval with Vision Language Models #Frameworks #Models
GitHub
GitHub - illuin-tech/colpali: The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.
The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol. - illuin-tech/colpali
https://moondream.ai/blog/moondream-3-preview A small vision language model (VLM) designed for use in extreme cases or on devices. #Models
Moondream
A fast & powerful vision model that rocks.
https://www.perceptron.inc/blog/introducing-isaac-0-1 Another vision language model(VLM) with similar properties #Models
marketing.perceptron.inc
A layer of intelligence for the physical world.
We are a research company building the future of Physical AGI.
We are a research company building the future of Physical AGI.