b7leung/MLE-Flashcards
200+ detailed flashcards useful for reviewing topics in machine learning, computer vision, and computer science.
#ai #artificial_intelligence #computer_science #computer_vision #flashcards #interview #interview_preparation #machine_learning #review
Stars: 121 Issues: 1 Forks: 9
https://github.com/b7leung/MLE-Flashcards
200+ detailed flashcards useful for reviewing topics in machine learning, computer vision, and computer science.
#ai #artificial_intelligence #computer_science #computer_vision #flashcards #interview #interview_preparation #machine_learning #review
Stars: 121 Issues: 1 Forks: 9
https://github.com/b7leung/MLE-Flashcards
GitHub
GitHub - b7leung/MLE-Flashcards: 200+ detailed flashcards useful for reviewing topics in machine learning, computer vision, and…
200+ detailed flashcards useful for reviewing topics in machine learning, computer vision, and computer science. - b7leung/MLE-Flashcards
❤1
clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
Language: Python
#computer_vision #document_ai #eccv_2022 #multimodal_pre_trained_model #nlp #ocr
Stars: 98 Issues: 2 Forks: 5
https://github.com/clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
Language: Python
#computer_vision #document_ai #eccv_2022 #multimodal_pre_trained_model #nlp #ocr
Stars: 98 Issues: 2 Forks: 5
https://github.com/clovaai/donut
GitHub
GitHub - clovaai/donut: Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator…
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022 - clovaai/donut
❤1
roboflow-ai/notebooks
Set of Jupyter Notebooks linked to Roboflow Blogpost and used in our YouTube videos.
Language: Jupyter Notebook
#computer_vision #deep_learning #deep_neural_networks #image_classification #image_segmentation #object_detection #pytorch #tutorial #yolov5 #yolov6 #yolov7
Stars: 126 Issues: 1 Forks: 14
https://github.com/roboflow-ai/notebooks
Set of Jupyter Notebooks linked to Roboflow Blogpost and used in our YouTube videos.
Language: Jupyter Notebook
#computer_vision #deep_learning #deep_neural_networks #image_classification #image_segmentation #object_detection #pytorch #tutorial #yolov5 #yolov6 #yolov7
Stars: 126 Issues: 1 Forks: 14
https://github.com/roboflow-ai/notebooks
GitHub
GitHub - roboflow/notebooks: A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything…
A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like YOLO11, RT-DETR, SAM ...
SkalskiP/courses
This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)
Language: Python
#computer_vision #deep_learning #deep_neural_networks #machine_learning #mlops #multimodal #natural_language_processing #nlp #transformers #tutorial
Stars: 323 Issues: 0 Forks: 29
https://github.com/SkalskiP/courses
This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)
Language: Python
#computer_vision #deep_learning #deep_neural_networks #machine_learning #mlops #multimodal #natural_language_processing #nlp #transformers #tutorial
Stars: 323 Issues: 0 Forks: 29
https://github.com/SkalskiP/courses
GitHub
GitHub - SkalskiP/courses: This repository is a curated collection of links to various courses and resources about Artificial Intelligence…
This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI) - SkalskiP/courses
👍1
kevmo314/magic-copy
Magic Copy is a Chrome extension that uses Meta's Segment Anything Model to extract a foreground object from an image and copy it to the clipboard.
Language: TypeScript
#chrome_extension #computer_vision #image_processing
Stars: 375 Issues: 2 Forks: 24
https://github.com/kevmo314/magic-copy
Magic Copy is a Chrome extension that uses Meta's Segment Anything Model to extract a foreground object from an image and copy it to the clipboard.
Language: TypeScript
#chrome_extension #computer_vision #image_processing
Stars: 375 Issues: 2 Forks: 24
https://github.com/kevmo314/magic-copy
GitHub
GitHub - kevmo314/magic-copy: Magic Copy is a Chrome extension that uses Meta's Segment Anything Model to extract a foreground…
Magic Copy is a Chrome extension that uses Meta's Segment Anything Model to extract a foreground object from an image and copy it to the clipboard. - kevmo314/magic-copy
Jumpat/SegmentAnythingin3D
Segment Anything in 3D with NeRFs
#3d #3d_segmentation #computer_vision #nerf #segment_anything #segmentation
Stars: 224 Issues: 2 Forks: 7
https://github.com/Jumpat/SegmentAnythingin3D
Segment Anything in 3D with NeRFs
#3d #3d_segmentation #computer_vision #nerf #segment_anything #segmentation
Stars: 224 Issues: 2 Forks: 7
https://github.com/Jumpat/SegmentAnythingin3D
GitHub
GitHub - Jumpat/SegmentAnythingin3D: Segment Anything in 3D with NeRFs (NeurIPS 2023 & IJCV 2025)
Segment Anything in 3D with NeRFs (NeurIPS 2023 & IJCV 2025) - Jumpat/SegmentAnythingin3D
X-PLUG/mPLUG-Owl
mPLUG-Owl🦉: Modularization Empowers Large Language Models with Multimodality
Language: Python
#alpaca #chatbot #chatgpt #computer_vision #damo #gpt #gpt4 #gpt4_api #huggingface #instruction_tuning #large_language_models #llama #mplug #mplug_owl #multimodal #pretraining #pytorch #transformer #visual_reasoning #visual_recognition
Stars: 209 Issues: 1 Forks: 9
https://github.com/X-PLUG/mPLUG-Owl
mPLUG-Owl🦉: Modularization Empowers Large Language Models with Multimodality
Language: Python
#alpaca #chatbot #chatgpt #computer_vision #damo #gpt #gpt4 #gpt4_api #huggingface #instruction_tuning #large_language_models #llama #mplug #mplug_owl #multimodal #pretraining #pytorch #transformer #visual_reasoning #visual_recognition
Stars: 209 Issues: 1 Forks: 9
https://github.com/X-PLUG/mPLUG-Owl
GitHub
GitHub - X-PLUG/mPLUG-Owl: mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family - X-PLUG/mPLUG-Owl
nv-tlabs/NKSR
[CVPR 2023 Highlight] Neural Kernel Surface Reconstruction
Language: Python
#3d_reconstruction #computer_vision #graphics #neural_kernel #point_cloud
Stars: 250 Issues: 7 Forks: 6
https://github.com/nv-tlabs/NKSR
[CVPR 2023 Highlight] Neural Kernel Surface Reconstruction
Language: Python
#3d_reconstruction #computer_vision #graphics #neural_kernel #point_cloud
Stars: 250 Issues: 7 Forks: 6
https://github.com/nv-tlabs/NKSR
GitHub
GitHub - nv-tlabs/NKSR: [CVPR 2023 Highlight] Neural Kernel Surface Reconstruction
[CVPR 2023 Highlight] Neural Kernel Surface Reconstruction - nv-tlabs/NKSR
👎1
graphdeco-inria/gaussian-splatting
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Language: Python
#computer_graphics #computer_vision #radiance_field
Stars: 348 Issues: 4 Forks: 14
https://github.com/graphdeco-inria/gaussian-splatting
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Language: Python
#computer_graphics #computer_vision #radiance_field
Stars: 348 Issues: 4 Forks: 14
https://github.com/graphdeco-inria/gaussian-splatting
GitHub
GitHub - graphdeco-inria/gaussian-splatting: Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance…
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering" - graphdeco-inria/gaussian-splatting
👍1
cvg/glue-factory
Training library for local feature detection and matching
Language: Python
#computer_vision #deep_learning #iccv2023 #image_matching
Stars: 200 Issues: 1 Forks: 13
https://github.com/cvg/glue-factory
Training library for local feature detection and matching
Language: Python
#computer_vision #deep_learning #iccv2023 #image_matching
Stars: 200 Issues: 1 Forks: 13
https://github.com/cvg/glue-factory
GitHub
GitHub - cvg/glue-factory: Training library for local feature detection and matching
Training library for local feature detection and matching - cvg/glue-factory
hustvl/GaussianDreamer
GaussianDreamer: Fast Generation from Text to 3D Gaussian Splatting with Point Cloud Priors
Language: Python
#aigc #computer_vision #diffusion_models #dreamfusion #gaussian_splatting #nerf #radiance_field #text_to_3d
Stars: 134 Issues: 0 Forks: 1
https://github.com/hustvl/GaussianDreamer
GaussianDreamer: Fast Generation from Text to 3D Gaussian Splatting with Point Cloud Priors
Language: Python
#aigc #computer_vision #diffusion_models #dreamfusion #gaussian_splatting #nerf #radiance_field #text_to_3d
Stars: 134 Issues: 0 Forks: 1
https://github.com/hustvl/GaussianDreamer
GitHub
GitHub - hustvl/GaussianDreamer: [CVPR 2024] GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion…
[CVPR 2024] GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models - hustvl/GaussianDreamer
👍2
hkchengrex/Cutie
[arXiv 2023] Putting the Object Back Into Video Object Segmentation
Language: Python
#computer_vision #deep_learning #pytorch #segmentation #video_editing #video_object_segmentation #video_segmentation
Stars: 123 Issues: 1 Forks: 12
https://github.com/hkchengrex/Cutie
[arXiv 2023] Putting the Object Back Into Video Object Segmentation
Language: Python
#computer_vision #deep_learning #pytorch #segmentation #video_editing #video_object_segmentation #video_segmentation
Stars: 123 Issues: 1 Forks: 12
https://github.com/hkchengrex/Cutie
GitHub
GitHub - hkchengrex/Cutie: [CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation
[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation - hkchengrex/Cutie
lxe/llavavision
A simple "Be My Eyes" web app with a llama.cpp/llava backend
Language: JavaScript
#ai #artificial_intelligence #computer_vision #llama #llamacpp #llm #local_llm #machine_learning #multimodal #webapp
Stars: 284 Issues: 0 Forks: 7
https://github.com/lxe/llavavision
A simple "Be My Eyes" web app with a llama.cpp/llava backend
Language: JavaScript
#ai #artificial_intelligence #computer_vision #llama #llamacpp #llm #local_llm #machine_learning #multimodal #webapp
Stars: 284 Issues: 0 Forks: 7
https://github.com/lxe/llavavision
GitHub
GitHub - lxe/llavavision: A simple "Be My Eyes" web app with a llama.cpp/llava backend
A simple "Be My Eyes" web app with a llama.cpp/llava backend - lxe/llavavision
roboflow/awesome-openai-vision-api-experiments
Examples showing how to use the OpenAI vision API to run inference on images, video files and webcam streams
Language: Python
#chatgpt #computer_vision #openai
Stars: 439 Issues: 1 Forks: 20
https://github.com/roboflow/awesome-openai-vision-api-experiments
Examples showing how to use the OpenAI vision API to run inference on images, video files and webcam streams
Language: Python
#chatgpt #computer_vision #openai
Stars: 439 Issues: 1 Forks: 20
https://github.com/roboflow/awesome-openai-vision-api-experiments
GitHub
GitHub - roboflow/awesome-openai-vision-api-experiments: Must-have resource for anyone who wants to experiment with and build on…
Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥 - roboflow/awesome-openai-vision-api-experiments
❤1
spla-tam/SplaTAM
SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM
Language: Python
#computer_vision #gaussian_splatting #robotics #slam
Stars: 270 Issues: 2 Forks: 20
https://github.com/spla-tam/SplaTAM
SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM
Language: Python
#computer_vision #gaussian_splatting #robotics #slam
Stars: 270 Issues: 2 Forks: 20
https://github.com/spla-tam/SplaTAM
GitHub
GitHub - spla-tam/SplaTAM: SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM (CVPR 2024)
SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM (CVPR 2024) - spla-tam/SplaTAM
3DTopia/OpenLRM
An open-source impl. of Large Reconstruction Models
Language: Python
#3d #aigc #computer_vision #generation
Stars: 165 Issues: 2 Forks: 2
https://github.com/3DTopia/OpenLRM
An open-source impl. of Large Reconstruction Models
Language: Python
#3d #aigc #computer_vision #generation
Stars: 165 Issues: 2 Forks: 2
https://github.com/3DTopia/OpenLRM
GitHub
GitHub - 3DTopia/OpenLRM: An open-source impl. of Large Reconstruction Models
An open-source impl. of Large Reconstruction Models - 3DTopia/OpenLRM
🔥1
robertknight/ocrs
A modern OCR engine (extracts text from images), written in Rust
Language: Rust
#computer_vision #machine_learning #ocr
Stars: 220 Issues: 3 Forks: 4
https://github.com/robertknight/ocrs
A modern OCR engine (extracts text from images), written in Rust
Language: Rust
#computer_vision #machine_learning #ocr
Stars: 220 Issues: 3 Forks: 4
https://github.com/robertknight/ocrs
GitHub
GitHub - robertknight/ocrs: Rust library and CLI tool for OCR (extracting text from images)
Rust library and CLI tool for OCR (extracting text from images) - robertknight/ocrs
🥰1👏1