qunash/stable-diffusion-2-gui
Lightweight Stable Diffusion v 2.0 web UI: txt2img, img2img, inpaint, upscale4x
Language: Jupyter Notebook
#image_generation #stable_diffusion
Stars: 114 Issues: 0 Forks: 5
https://github.com/qunash/stable-diffusion-2-gui
Lightweight Stable Diffusion v 2.0 web UI: txt2img, img2img, inpaint, upscale4x
Language: Jupyter Notebook
#image_generation #stable_diffusion
Stars: 114 Issues: 0 Forks: 5
https://github.com/qunash/stable-diffusion-2-gui
GitHub
GitHub - qunash/stable-diffusion-2-gui: Lightweight Stable Diffusion v 2.1 web UI: txt2img, img2img, depth2img, inpaint and upscale4x.
Lightweight Stable Diffusion v 2.1 web UI: txt2img, img2img, depth2img, inpaint and upscale4x. - qunash/stable-diffusion-2-gui
jaketae/storyteller
Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
Language: Python
#gpt #image_generation #pytorch #stable_diffusion #text_to_image #text_to_speech #text_to_video #video_generation
Stars: 119 Issues: 1 Forks: 6
https://github.com/jaketae/storyteller
Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
Language: Python
#gpt #image_generation #pytorch #stable_diffusion #text_to_image #text_to_speech #text_to_video #video_generation
Stars: 119 Issues: 1 Forks: 6
https://github.com/jaketae/storyteller
GitHub
GitHub - jaketae/storyteller: Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech - jaketae/storyteller
NVlabs/prismer
The implementation of "Prismer: A Vision-Language Model with An Ensemble of Experts".
Language: Python
#image_captioning #language_model #multi_modal_learning #multi_task_learning #vision_and_language #vision_language_model #vqa
Stars: 479 Issues: 6 Forks: 21
https://github.com/NVlabs/prismer
The implementation of "Prismer: A Vision-Language Model with An Ensemble of Experts".
Language: Python
#image_captioning #language_model #multi_modal_learning #multi_task_learning #vision_and_language #vision_language_model #vqa
Stars: 479 Issues: 6 Forks: 21
https://github.com/NVlabs/prismer
GitHub
GitHub - NVlabs/prismer: The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts". - NVlabs/prismer
kevmo314/magic-copy
Magic Copy is a Chrome extension that uses Meta's Segment Anything Model to extract a foreground object from an image and copy it to the clipboard.
Language: TypeScript
#chrome_extension #computer_vision #image_processing
Stars: 375 Issues: 2 Forks: 24
https://github.com/kevmo314/magic-copy
Magic Copy is a Chrome extension that uses Meta's Segment Anything Model to extract a foreground object from an image and copy it to the clipboard.
Language: TypeScript
#chrome_extension #computer_vision #image_processing
Stars: 375 Issues: 2 Forks: 24
https://github.com/kevmo314/magic-copy
GitHub
GitHub - kevmo314/magic-copy: Magic Copy is a Chrome extension that uses Meta's Segment Anything Model to extract a foreground…
Magic Copy is a Chrome extension that uses Meta's Segment Anything Model to extract a foreground object from an image and copy it to the clipboard. - kevmo314/magic-copy
OpenGVLab/InternChat
InternChat allows you to interact with ChatGPT by clicking, dragging and drawing using a pointing device.
Language: Python
#chatgpt #click #foundation_model #gpt #gpt_4 #gradio #husky #image_captioning #internimage #langchain #llama #llm #multimodal #ocr #sam #segment_anything #vicuna #video #video_generation #vqa
Stars: 231 Issues: 1 Forks: 10
https://github.com/OpenGVLab/InternChat
InternChat allows you to interact with ChatGPT by clicking, dragging and drawing using a pointing device.
Language: Python
#chatgpt #click #foundation_model #gpt #gpt_4 #gradio #husky #image_captioning #internimage #langchain #llama #llm #multimodal #ocr #sam #segment_anything #vicuna #video #video_generation #vqa
Stars: 231 Issues: 1 Forks: 10
https://github.com/OpenGVLab/InternChat
GitHub
GitHub - OpenGVLab/InternGPT: InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now…
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editin...
Zeqiang-Lai/DragGAN
Unofficial implementation of "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold"
Language: Python
#draggan #image_editing #image_generation
Stars: 179 Issues: 4 Forks: 21
https://github.com/Zeqiang-Lai/DragGAN
Unofficial implementation of "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold"
Language: Python
#draggan #image_editing #image_generation
Stars: 179 Issues: 4 Forks: 21
https://github.com/Zeqiang-Lai/DragGAN
GitHub
GitHub - OpenGVLab/DragGAN: Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the…
Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功能实现,在线Demo,本地部署试用,代码、模型已全部开源,支持Windows, ma...
axodox/axodox-machinelearning
This repository contains a C++ ONNX implementation of StableDiffusion.
Language: C++
#cpp #image_generation #mit_license #native #nuget #onnx #stable_diffusion
Stars: 241 Issues: 1 Forks: 8
https://github.com/axodox/axodox-machinelearning
This repository contains a C++ ONNX implementation of StableDiffusion.
Language: C++
#cpp #image_generation #mit_license #native #nuget #onnx #stable_diffusion
Stars: 241 Issues: 1 Forks: 8
https://github.com/axodox/axodox-machinelearning
GitHub
GitHub - axodox/axodox-machinelearning: This repository contains a pure C++ ONNX implementation of multiple offline AI models,…
This repository contains a pure C++ ONNX implementation of multiple offline AI models, such as StableDiffusion (1.5 and XL), ControlNet, Midas, HED and OpenPose. - axodox/axodox-machinelearning
Yujun-Shi/DragDiffusion
Official code for DragDiffusion
Language: Python
#artificial_intelligence #diffusion_models #dragdiffusion #draggan #image_editing
Stars: 288 Issues: 3 Forks: 23
https://github.com/Yujun-Shi/DragDiffusion
Official code for DragDiffusion
Language: Python
#artificial_intelligence #diffusion_models #dragdiffusion #draggan #image_editing
Stars: 288 Issues: 3 Forks: 23
https://github.com/Yujun-Shi/DragDiffusion
GitHub
GitHub - Yujun-Shi/DragDiffusion: [CVPR2024, Highlight] Official code for DragDiffusion
[CVPR2024, Highlight] Official code for DragDiffusion - Yujun-Shi/DragDiffusion
leejet/stable-diffusion.cpp
Stable Diffusion in pure C/C++
Language: C
#ai #cplusplus #diffusion #ggml #image_generation #latent_diffusion #stable_diffusion #text2image #txt2img
Stars: 238 Issues: 5 Forks: 12
https://github.com/leejet/stable-diffusion.cpp
Stable Diffusion in pure C/C++
Language: C
#ai #cplusplus #diffusion #ggml #image_generation #latent_diffusion #stable_diffusion #text2image #txt2img
Stars: 238 Issues: 5 Forks: 12
https://github.com/leejet/stable-diffusion.cpp
GitHub
GitHub - leejet/stable-diffusion.cpp: Stable Diffusion and Flux in pure C/C++
Stable Diffusion and Flux in pure C/C++. Contribute to leejet/stable-diffusion.cpp development by creating an account on GitHub.
dreamgaussian/dreamgaussian
Generative Gaussian Splatting for Efficient 3D Content Creation
Language: Python
#image_to_3d #text_to_3d
Stars: 307 Issues: 2 Forks: 17
https://github.com/dreamgaussian/dreamgaussian
Generative Gaussian Splatting for Efficient 3D Content Creation
Language: Python
#image_to_3d #text_to_3d
Stars: 307 Issues: 2 Forks: 17
https://github.com/dreamgaussian/dreamgaussian
GitHub
GitHub - dreamgaussian/dreamgaussian: [ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation
[ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation - dreamgaussian/dreamgaussian