ictnlp/LLaVA-Mini
LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.
Language: Python
#efficient #gpt4o #gpt4v #large_language_models #large_multimodal_models #llama #llava #multimodal #multimodal_large_language_models #video #vision #vision_language_model #visual_instruction_tuning
Stars: 173 Issues: 7 Forks: 11
https://github.com/ictnlp/LLaVA-Mini
LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.
Language: Python
#efficient #gpt4o #gpt4v #large_language_models #large_multimodal_models #llama #llava #multimodal #multimodal_large_language_models #video #vision #vision_language_model #visual_instruction_tuning
Stars: 173 Issues: 7 Forks: 11
https://github.com/ictnlp/LLaVA-Mini
GitHub
GitHub - ictnlp/LLaVA-Mini: LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images,…
LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner. - GitHub - ictnlp/LLaVA-Mini: LLaVA-Mi...
liweiphys/layra
LAYRA is a ready-to-use visual RAG system with a complete UI built with Next.js and FastAPI, preserving document layout, tables, paragraphs, and graphical elements without any structural fragmentation.
Language: TypeScript
#agent #colpali #colqwen #document_parser #fastapi #gpt_4o #knowledge_base #llm #nextjs #pdf_parser #qwen #rag #visual_rag
Stars: 190 Issues: 3 Forks: 15
https://github.com/liweiphys/layra
LAYRA is a ready-to-use visual RAG system with a complete UI built with Next.js and FastAPI, preserving document layout, tables, paragraphs, and graphical elements without any structural fragmentation.
Language: TypeScript
#agent #colpali #colqwen #document_parser #fastapi #gpt_4o #knowledge_base #llm #nextjs #pdf_parser #qwen #rag #visual_rag
Stars: 190 Issues: 3 Forks: 15
https://github.com/liweiphys/layra
GitHub
GitHub - liweiphys/layra: LAYRA—an enterprise-ready, out-of-the-box solution—unlocks next-generation intelligent systems powered…
LAYRA—an enterprise-ready, out-of-the-box solution—unlocks next-generation intelligent systems powered by visual RAG and limitless visual multi-step agent workflow orchestration. - liweiphys/layra
👍1
iMoonLab/yolov13
Implementation of "YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception".
Language: Python
#correlation_modelling #hypergraph_learning #object_detection #real_time_object_detection #visual_recognition #yolo #yolov13
Stars: 230 Issues: 14 Forks: 22
https://github.com/iMoonLab/yolov13
Implementation of "YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception".
Language: Python
#correlation_modelling #hypergraph_learning #object_detection #real_time_object_detection #visual_recognition #yolo #yolov13
Stars: 230 Issues: 14 Forks: 22
https://github.com/iMoonLab/yolov13
GitHub
GitHub - iMoonLab/yolov13: Implementation of "YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception".
Implementation of "YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception". - iMoonLab/yolov13
👍1