GitHub repos

qunash/stable-diffusion-2-gui
Lightweight Stable Diffusion v 2.0 web UI: txt2img, img2img, inpaint, upscale4x
Language: Jupyter Notebook
#image_generation #stable_diffusion
Stars: 114 Issues: 0 Forks: 5
https://github.com/qunash/stable-diffusion-2-gui

GitHub

GitHub - qunash/stable-diffusion-2-gui: Lightweight Stable Diffusion v 2.1 web UI: txt2img, img2img, depth2img, inpaint and upscale4x.

Lightweight Stable Diffusion v 2.1 web UI: txt2img, img2img, depth2img, inpaint and upscale4x. - qunash/stable-diffusion-2-gui

2.3K views23:01

GitHub repos

jaketae/storyteller
Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
Language: Python
#gpt #image_generation #pytorch #stable_diffusion #text_to_image #text_to_speech #text_to_video #video_generation
Stars: 119 Issues: 1 Forks: 6
https://github.com/jaketae/storyteller

GitHub

GitHub - jaketae/storyteller: Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech

Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech - jaketae/storyteller

2.4K views23:03

GitHub repos

NVlabs/prismer
The implementation of "Prismer: A Vision-Language Model with An Ensemble of Experts".
Language: Python
#image_captioning #language_model #multi_modal_learning #multi_task_learning #vision_and_language #vision_language_model #vqa
Stars: 479 Issues: 6 Forks: 21
https://github.com/NVlabs/prismer

GitHub

GitHub - NVlabs/prismer: The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".

The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts". - NVlabs/prismer

3.5K views23:07

GitHub repos

kevmo314/magic-copy
Magic Copy is a Chrome extension that uses Meta's Segment Anything Model to extract a foreground object from an image and copy it to the clipboard.
Language: TypeScript
#chrome_extension #computer_vision #image_processing
Stars: 375 Issues: 2 Forks: 24
https://github.com/kevmo314/magic-copy

GitHub

GitHub - kevmo314/magic-copy: Magic Copy is a Chrome extension that uses Meta's Segment Anything Model to extract a foreground…

Magic Copy is a Chrome extension that uses Meta's Segment Anything Model to extract a foreground object from an image and copy it to the clipboard. - kevmo314/magic-copy

2.2K views10:09

GitHub repos

OpenGVLab/InternChat
InternChat allows you to interact with ChatGPT by clicking, dragging and drawing using a pointing device.
Language: Python
#chatgpt #click #foundation_model #gpt #gpt_4 #gradio #husky #image_captioning #internimage #langchain #llama #llm #multimodal #ocr #sam #segment_anything #vicuna #video #video_generation #vqa
Stars: 231 Issues: 1 Forks: 10
https://github.com/OpenGVLab/InternChat

GitHub

GitHub - OpenGVLab/InternGPT: InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now…

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editin...

2.3K views22:10

GitHub repos

Zeqiang-Lai/DragGAN
Unofficial implementation of "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold"
Language: Python
#draggan #image_editing #image_generation
Stars: 179 Issues: 4 Forks: 21
https://github.com/Zeqiang-Lai/DragGAN

GitHub

GitHub - OpenGVLab/DragGAN: Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the…

Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" （DragGAN 全功能实现，在线Demo，本地部署试用，代码、模型已全部开源，支持Windows, ma...

2.2K views04:11

GitHub repos

axodox/axodox-machinelearning
This repository contains a C++ ONNX implementation of StableDiffusion.
Language: C++
#cpp #image_generation #mit_license #native #nuget #onnx #stable_diffusion
Stars: 241 Issues: 1 Forks: 8
https://github.com/axodox/axodox-machinelearning

GitHub

GitHub - axodox/axodox-machinelearning: This repository contains a pure C++ ONNX implementation of multiple offline AI models,…

This repository contains a pure C++ ONNX implementation of multiple offline AI models, such as StableDiffusion (1.5 and XL), ControlNet, Midas, HED and OpenPose. - axodox/axodox-machinelearning

2.8K views16:12

GitHub repos

Yujun-Shi/DragDiffusion
Official code for DragDiffusion
Language: Python
#artificial_intelligence #diffusion_models #dragdiffusion #draggan #image_editing
Stars: 288 Issues: 3 Forks: 23
https://github.com/Yujun-Shi/DragDiffusion

GitHub

GitHub - Yujun-Shi/DragDiffusion: [CVPR2024, Highlight] Official code for DragDiffusion

[CVPR2024, Highlight] Official code for DragDiffusion - Yujun-Shi/DragDiffusion

3.0K views16:14

GitHub repos

leejet/stable-diffusion.cpp
Stable Diffusion in pure C/C++
Language: C
#ai #cplusplus #diffusion #ggml #image_generation #latent_diffusion #stable_diffusion #text2image #txt2img
Stars: 238 Issues: 5 Forks: 12
https://github.com/leejet/stable-diffusion.cpp

GitHub

GitHub - leejet/stable-diffusion.cpp: Stable Diffusion and Flux in pure C/C++

Stable Diffusion and Flux in pure C/C++. Contribute to leejet/stable-diffusion.cpp development by creating an account on GitHub.

2.7K views04:16

GitHub repos

dreamgaussian/dreamgaussian
Generative Gaussian Splatting for Efficient 3D Content Creation
Language: Python
#image_to_3d #text_to_3d
Stars: 307 Issues: 2 Forks: 17
https://github.com/dreamgaussian/dreamgaussian

GitHub

GitHub - dreamgaussian/dreamgaussian: [ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation

[ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation - dreamgaussian/dreamgaussian

2.1K views10:19

About

Blog

Apps

Platform