LiheYoung/Depth-Anything
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Language: Python
#depth_estimation #image_synthesis #metric_depth_estimation #monocular_depth_estimation
Stars: 1116 Issues: 11 Forks: 62
https://github.com/LiheYoung/Depth-Anything
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Language: Python
#depth_estimation #image_synthesis #metric_depth_estimation #monocular_depth_estimation
Stars: 1116 Issues: 11 Forks: 62
https://github.com/LiheYoung/Depth-Anything
GitHub
GitHub - LiheYoung/Depth-Anything: [CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model…
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation - LiheYoung/Depth-Anything
YangLing0818/RPG-DiffusionMaster
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
Language: Python
#image_editing #large_language_models #multimodal_large_language_models #text_to_image_diffusion
Stars: 272 Issues: 5 Forks: 14
https://github.com/YangLing0818/RPG-DiffusionMaster
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
Language: Python
#image_editing #large_language_models #multimodal_large_language_models #text_to_image_diffusion
Stars: 272 Issues: 5 Forks: 14
https://github.com/YangLing0818/RPG-DiffusionMaster
GitHub
GitHub - YangLing0818/RPG-DiffusionMaster: [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating…
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG) - YangLing0818/RPG-DiffusionMaster
3DTopia/LGM
LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.
Language: Python
#gaussian_splatting #image_to_3d #text_to_3d
Stars: 308 Issues: 7 Forks: 15
https://github.com/3DTopia/LGM
LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.
Language: Python
#gaussian_splatting #image_to_3d #text_to_3d
Stars: 308 Issues: 7 Forks: 15
https://github.com/3DTopia/LGM
GitHub
GitHub - 3DTopia/LGM: [ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.
[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation. - 3DTopia/LGM
mayuelala/FollowYourClick
[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
#image_animation #image_to_video_generation #video_generation
Stars: 445 Issues: 0 Forks: 10
https://github.com/mayuelala/FollowYourClick
[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
#image_animation #image_to_video_generation #video_generation
Stars: 445 Issues: 0 Forks: 10
https://github.com/mayuelala/FollowYourClick
GitHub
GitHub - mayuelala/FollowYourClick: [AAAI 2025] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click:…
[AAAI 2025] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts" - GitHub - mayuelala/Foll...
FoundationVision/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction"
Language: Python
#auto_regressive_model #diffusion_models #image_generation #transformers
Stars: 440 Issues: 6 Forks: 10
https://github.com/FoundationVision/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction"
Language: Python
#auto_regressive_model #diffusion_models #image_generation #transformers
Stars: 440 Issues: 6 Forks: 10
https://github.com/FoundationVision/VAR
GitHub
GitHub - FoundationVision/VAR: [NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl.…
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction&...
AiuniAI/Unique3D
Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
Language: Python
#3d_aigc #aigc #image_to_3d
Stars: 262 Issues: 4 Forks: 12
https://github.com/AiuniAI/Unique3D
Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
Language: Python
#3d_aigc #aigc #image_to_3d
Stars: 262 Issues: 4 Forks: 12
https://github.com/AiuniAI/Unique3D
GitHub
GitHub - AiuniAI/Unique3D: [NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
[NeurIPS 2024] Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image - AiuniAI/Unique3D
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Language: Python
#face_animation #image_animation #video_animation
Stars: 653 Issues: 5 Forks: 102
https://github.com/fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Language: Python
#face_animation #image_animation #video_animation
Stars: 653 Issues: 5 Forks: 102
https://github.com/fudan-generative-vision/hallo
GitHub
GitHub - fudan-generative-vision/hallo: Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation - fudan-generative-vision/hallo
gcui-art/album-ai
AI-First Album: Chat with your gallery using plain language! LLM Vision + RAG + Album/Gallery/.
Language: TypeScript
#ai #album #gpt_4o_mini #haiku #image #llm #rag
Stars: 272 Issues: 1 Forks: 23
https://github.com/gcui-art/album-ai
AI-First Album: Chat with your gallery using plain language! LLM Vision + RAG + Album/Gallery/.
Language: TypeScript
#ai #album #gpt_4o_mini #haiku #image #llm #rag
Stars: 272 Issues: 1 Forks: 23
https://github.com/gcui-art/album-ai
GitHub
GitHub - gcui-art/album-ai: AI-First Album: Chat with your gallery using plain language! LLM Vision + RAG + Album/Gallery.
AI-First Album: Chat with your gallery using plain language! LLM Vision + RAG + Album/Gallery. - gcui-art/album-ai
C-Naoki/image-stitcher
This is a python implementation for stitching images.
Language: Jupyter Notebook
#image_analysis #images #python
Stars: 190 Issues: 0 Forks: 4
https://github.com/C-Naoki/image-stitcher
This is a python implementation for stitching images.
Language: Jupyter Notebook
#image_analysis #images #python
Stars: 190 Issues: 0 Forks: 4
https://github.com/C-Naoki/image-stitcher
GitHub
GitHub - C-Naoki/image-stitcher: This is a python implementation for stitching images.
This is a python implementation for stitching images. - C-Naoki/image-stitcher
facebookresearch/watermark-anything
Official implementation of the paper "Watermark Anything with Localized Messages"
Language: Jupyter Notebook
#image #watermarking
Stars: 450 Issues: 0 Forks: 6
https://github.com/facebookresearch/watermark-anything
Official implementation of the paper "Watermark Anything with Localized Messages"
Language: Jupyter Notebook
#image #watermarking
Stars: 450 Issues: 0 Forks: 6
https://github.com/facebookresearch/watermark-anything
GitHub
GitHub - facebookresearch/watermark-anything: Official implementation of the paper "Watermark Anything with Localized Messages"
Official implementation of the paper "Watermark Anything with Localized Messages" - facebookresearch/watermark-anything