This media is not supported in your browser
VIEW IN TELEGRAM
π₯¬ Diffusive Sketch-Guided Text-to-Image π₯¬
π#Google unveils a universal approach for T2I (pre-trained) diffusion: free-hand, saliency-guided, etc.
πReview https://bit.ly/3XFVMj2
πProject sketch-guided-diffusion.github.io/
πPaper sketch-guided-diffusion.github.io/files/sketch-guided-preprint.pdf
π#Google unveils a universal approach for T2I (pre-trained) diffusion: free-hand, saliency-guided, etc.
πReview https://bit.ly/3XFVMj2
πProject sketch-guided-diffusion.github.io/
πPaper sketch-guided-diffusion.github.io/files/sketch-guided-preprint.pdf
π€―4β‘1β€1π1π₯1π₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯« Plug 'n' play self-checkout π₯«
π#Google's new shelf-checking #AI: recognizing billions of products, even purchased/moved
πReview https://bit.ly/3J58hQe
πNews https://cloud.google.com/blog/transform/nrf-2023-google-cloud-big-show-big-moment-hybrid-retail
π#Google's new shelf-checking #AI: recognizing billions of products, even purchased/moved
πReview https://bit.ly/3J58hQe
πNews https://cloud.google.com/blog/transform/nrf-2023-google-cloud-big-show-big-moment-hybrid-retail
π€―8π7
This media is not supported in your browser
VIEW IN TELEGRAM
πDREAMIX:General Diffusive Video Editorπ
π#Google unveils the first diffusion-based method able to perform text-based motion/appearance editing of general videos
πReview https://bit.ly/3I3Hq6B
πPaper arxiv.org/pdf/2302.01329.pdf
πProject dreamix-video-editing.github.io/
π#Google unveils the first diffusion-based method able to perform text-based motion/appearance editing of general videos
πReview https://bit.ly/3I3Hq6B
πPaper arxiv.org/pdf/2302.01329.pdf
πProject dreamix-video-editing.github.io/
π€―24π±3π2β€1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯¦ ReBotNet: Neural Enhancement π₯¦
π#Google unveils ReBotNet, novel real-time video enhancement for live video calls & streams
πReview https://bit.ly/3z8oqhG
πPaper arxiv.org/pdf/2303.13504.pdf
πProject jeya-maria-jose.github.io/rebotnet-web
π#Google unveils ReBotNet, novel real-time video enhancement for live video calls & streams
πReview https://bit.ly/3z8oqhG
πPaper arxiv.org/pdf/2303.13504.pdf
πProject jeya-maria-jose.github.io/rebotnet-web
π₯13π3β€2π₯°2π€©2β‘1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯¦ Zip-NeRF: the Anti-Aliasing NeRF π₯¦
π#Google unveils a novel version of NeRF able to fix the aliasing problem being 22x faster in training than SOTA.
πReview https://bit.ly/3L1hZ6M
πPaper arxiv.org/pdf/2304.06706.pdf
πProject https://jonbarron.info/zipnerf
π#Google unveils a novel version of NeRF able to fix the aliasing problem being 22x faster in training than SOTA.
πReview https://bit.ly/3L1hZ6M
πPaper arxiv.org/pdf/2304.06706.pdf
πProject https://jonbarron.info/zipnerf
π€―13π₯4π3
This media is not supported in your browser
VIEW IN TELEGRAM
π Track Everything Everywhere π
π#Google unveils OmniMotion: full-length motion tracking for every pixel in every frame of video.
πReview https://t.ly/Krvw
πPaper arxiv.org/pdf/2306.05422.pdf
πProject omnimotion.github.io/
πDemo omnimotion.github.io/#interactive_demo
πCode github.com/qianqianwang68/omnimotion
π#Google unveils OmniMotion: full-length motion tracking for every pixel in every frame of video.
πReview https://t.ly/Krvw
πPaper arxiv.org/pdf/2306.05422.pdf
πProject omnimotion.github.io/
πDemo omnimotion.github.io/#interactive_demo
πCode github.com/qianqianwang68/omnimotion
π₯23β€5π€―3π€©1π©1
AI with Papers - Artificial Intelligence & Deep Learning
π Drag-GAN: user-friendly image-manipulation π π Manual deforming of (real and generated) images over pose, shape, expression and layout. πReview https://bit.ly/3BFyXlR πPaper arxiv.org/pdf/2305.10973.pdf πProject vcai.mpi-inf.mpg.de/projects/DragGANβ¦
Linkedin
π₯π₯ Source Code of Drag-GAN IS OUT! | Alessandro Ferrari | 40 comments
π₯π₯ Source Code of Drag-GAN IS OUT! π₯π₯
πManual deforming of (real and generated) images over pose, shape, expression and layout. Source Code just released a few hours ago π
ππ’π π‘π₯π’π π‘ππ¬:
β Max Planck + MIT + #Google AR/VR = π€―
β Supervising handle points to moveβ¦
πManual deforming of (real and generated) images over pose, shape, expression and layout. Source Code just released a few hours ago π
ππ’π π‘π₯π’π π‘ππ¬:
β Max Planck + MIT + #Google AR/VR = π€―
β Supervising handle points to moveβ¦
π₯25π±6β€3π₯°1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
πΈ Computational Burst Photography in App πΈ
π#Google unveils a novel computational burst system to democratize the professional photography via smartphone
πReview https://t.ly/5ibJX
πPaper arxiv.org/pdf/2308.01379.pdf
πProject https://motion-mode.github.io
π#Google unveils a novel computational burst system to democratize the professional photography via smartphone
πReview https://t.ly/5ibJX
πPaper arxiv.org/pdf/2308.01379.pdf
πProject https://motion-mode.github.io
π₯6π₯°3π2π€©1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯Lumiere: SOTA video-genπ₯
π#Google unveils Lumiere: Space-Time Diffusion Model for Realistic Video Generation. It's the new SOTA, tasks: Text-to-Video, Video Stylization, Cinemagraphs & Video Inpainting.
πReview https://t.ly/nalJR
πPaper https://lnkd.in/d-PvrGjT
πProject https://t.ly/gK8hz
π#Google unveils Lumiere: Space-Time Diffusion Model for Realistic Video Generation. It's the new SOTA, tasks: Text-to-Video, Video Stylization, Cinemagraphs & Video Inpainting.
πReview https://t.ly/nalJR
πPaper https://lnkd.in/d-PvrGjT
πProject https://t.ly/gK8hz
π₯18β€4π3π2π€©2π₯°1π€―1π©1
π§ 350+ Free #AI Courses by #Googleπ§
π350+ free courses from #Google to become professional in #AI & #Cloud. The full catalog (900+) includes a variety of activity: videos, documents, labs, coding, and quizzes. 15+ supported languages. No excuse.
β πππ§ππ«πππ’π―π ππ
β ππ§ππ«π¨ ππ¨ ππππ¬
β ππ π°π’ππ‘ ππ
β ππππ, ππ, ππ
β πππ¬π©π¨π§π¬π’ππ₯π ππ
πReview: https://t.ly/517Dr
πFull list: https://www.cloudskillsboost.google/catalog?page=1
π350+ free courses from #Google to become professional in #AI & #Cloud. The full catalog (900+) includes a variety of activity: videos, documents, labs, coding, and quizzes. 15+ supported languages. No excuse.
β πππ§ππ«πππ’π―π ππ
β ππ§ππ«π¨ ππ¨ ππππ¬
β ππ π°π’ππ‘ ππ
β ππππ, ππ, ππ
β πππ¬π©π¨π§π¬π’ππ₯π ππ
πReview: https://t.ly/517Dr
πFull list: https://www.cloudskillsboost.google/catalog?page=1
β€13π3π2πΎ2π₯1
This media is not supported in your browser
VIEW IN TELEGRAM
π Graph Neural Network in TF π
π#Google TensorFlow-GNN: novel library to build Graph Neural Networks on TensorFlow. Source Code released under Apache 2.0 license π
πReview https://t.ly/TQfg-
πCode github.com/tensorflow/gnn
πBlog blog.research.google/2024/02/graph-neural-networks-in-tensorflow.html
π#Google TensorFlow-GNN: novel library to build Graph Neural Networks on TensorFlow. Source Code released under Apache 2.0 license π
πReview https://t.ly/TQfg-
πCode github.com/tensorflow/gnn
πBlog blog.research.google/2024/02/graph-neural-networks-in-tensorflow.html
β€17π4π1
This media is not supported in your browser
VIEW IN TELEGRAM
βοΈ One2Avatar: Pic -> 3D Avatar βοΈ
π#Google presents a new approach to generate animatable photo-realistic avatars from only a few/one image. Impressive results.
πReview https://t.ly/AS1oc
πPaper arxiv.org/pdf/2402.11909.pdf
πProject zhixuany.github.io/one2avatar_webpage/
π#Google presents a new approach to generate animatable photo-realistic avatars from only a few/one image. Impressive results.
πReview https://t.ly/AS1oc
πPaper arxiv.org/pdf/2402.11909.pdf
πProject zhixuany.github.io/one2avatar_webpage/
π12β€3π€©3π₯2
This media is not supported in your browser
VIEW IN TELEGRAM
πͺ BOG: Fine Geometric Views πͺ
π #Google (+TΓΌbingen) unveils Binary Opacity Grids, a novel method to reconstruct triangle meshes from multi-view images able to capture fine geometric detail such as leaves, branches & grass. New SOTA, real-time on Google Pixel 8 Pro (and similar).
πReview https://t.ly/E6T0W
πPaper https://lnkd.in/dQEq3zy6
πProject https://lnkd.in/dYYCadx9
πDemo https://lnkd.in/d92R6QME
π #Google (+TΓΌbingen) unveils Binary Opacity Grids, a novel method to reconstruct triangle meshes from multi-view images able to capture fine geometric detail such as leaves, branches & grass. New SOTA, real-time on Google Pixel 8 Pro (and similar).
πReview https://t.ly/E6T0W
πPaper https://lnkd.in/dQEq3zy6
πProject https://lnkd.in/dYYCadx9
πDemo https://lnkd.in/d92R6QME
π₯8π€―4π3π₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
π¦ ObjectDrop: automagical objects removal π¦
π#Google unveils ObjectDrop, the new SOTA in photorealistic object removal and insertion. Focus on shadows and reflections, impressive!
πReview https://t.ly/ZJ6NN
πPaper https://arxiv.org/pdf/2403.18818.pdf
πProject https://objectdrop.github.io/
π#Google unveils ObjectDrop, the new SOTA in photorealistic object removal and insertion. Focus on shadows and reflections, impressive!
πReview https://t.ly/ZJ6NN
πPaper https://arxiv.org/pdf/2403.18818.pdf
πProject https://objectdrop.github.io/
π14π€―8β€4π₯3πΎ2
π¦ Hyper-Detailed Image Descriptions π¦
π#Google unveils ImageInWords (IIW), a carefully designed HIL annotation framework for curating hyper-detailed image descriptions and a new dataset resulting from this process
πReview https://t.ly/engkl
πPaper arxiv.org/pdf/2405.02793
πRepo github.com/google/imageinwords
πProject google.github.io/imageinwords
πData huggingface.co/datasets/google/imageinwords
π#Google unveils ImageInWords (IIW), a carefully designed HIL annotation framework for curating hyper-detailed image descriptions and a new dataset resulting from this process
πReview https://t.ly/engkl
πPaper arxiv.org/pdf/2405.02793
πRepo github.com/google/imageinwords
πProject google.github.io/imageinwords
πData huggingface.co/datasets/google/imageinwords
β€11π₯3π2π€―2πΎ1
This media is not supported in your browser
VIEW IN TELEGRAM
πOmniGlue: Foundation Matcherπ
π#Google OmniGlue from #CVPR24: the first learnable image matcher powered by foundation models. Impressive OOD results!
πReview https://t.ly/ezaIc
πPaper https://arxiv.org/pdf/2405.12979
πProject hwjiang1510.github.io/OmniGlue/
πCode https://github.com/google-research/omniglue/
π#Google OmniGlue from #CVPR24: the first learnable image matcher powered by foundation models. Impressive OOD results!
πReview https://t.ly/ezaIc
πPaper https://arxiv.org/pdf/2405.12979
πProject hwjiang1510.github.io/OmniGlue/
πCode https://github.com/google-research/omniglue/
π€―10β€6π2π1
This media is not supported in your browser
VIEW IN TELEGRAM
π SOTA Multi-Garment VTOn Editing π
π#Google (+UWA) unveils M&M VTO, novel mix 'n' match virtual try-on that takes as input multiple garment images, text description for garment layout and an image of a person. It's the new SOTA both qualitatively and quantitatively. Impressive results!
πReview https://t.ly/66mLN
πPaper arxiv.org/pdf/2406.04542
πProject https://mmvto.github.io
π#Google (+UWA) unveils M&M VTO, novel mix 'n' match virtual try-on that takes as input multiple garment images, text description for garment layout and an image of a person. It's the new SOTA both qualitatively and quantitatively. Impressive results!
πReview https://t.ly/66mLN
πPaper arxiv.org/pdf/2406.04542
πProject https://mmvto.github.io
π4β€3π₯°3π₯1π€―1π±1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯₯ OmniNOCS: largest 3D NOCS π₯₯
πOmniNOCS by #Google (+Georgia) is a unified NOCS (Normalized Object Coordinate Space) dataset that contains data across different domains with 90+ object classes. The largest NOCS dataset to date. Data & Code available under Apache 2.0π
πReview https://t.ly/xPgBn
πPaper arxiv.org/pdf/2407.08711
πProject https://omninocs.github.io/
πData github.com/google-deepmind/omninocs
πOmniNOCS by #Google (+Georgia) is a unified NOCS (Normalized Object Coordinate Space) dataset that contains data across different domains with 90+ object classes. The largest NOCS dataset to date. Data & Code available under Apache 2.0π
πReview https://t.ly/xPgBn
πPaper arxiv.org/pdf/2407.08711
πProject https://omninocs.github.io/
πData github.com/google-deepmind/omninocs
π₯4β€3π2π1π₯°1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
πͺ Diffusion Models for Transparency πͺ
πMIT (+ #Google) unveils Alchemist, a novel method to control material attributes of objects like roughness, metallic, albedo & transparency in real images. Amazing work but code not announcedπ₯Ί
πReview https://t.ly/U98_G
πPaper arxiv.org/pdf/2312.02970
πProject www.prafullsharma.net/alchemist/
πMIT (+ #Google) unveils Alchemist, a novel method to control material attributes of objects like roughness, metallic, albedo & transparency in real images. Amazing work but code not announcedπ₯Ί
πReview https://t.ly/U98_G
πPaper arxiv.org/pdf/2312.02970
πProject www.prafullsharma.net/alchemist/
π₯17π4β‘1β€1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
πΊ Diffusion Game Engine πΊ
π#Google unveils GameNGen: the first game engine powered entirely by a neural #AI that enables real-time interaction with a complex environment over long trajectories at HQ. No code announced but I love it π
πReview https://t.ly/_WR5z
πPaper https://lnkd.in/dZqgiqb9
πProject https://lnkd.in/dJUd2Fr6
π#Google unveils GameNGen: the first game engine powered entirely by a neural #AI that enables real-time interaction with a complex environment over long trajectories at HQ. No code announced but I love it π
πReview https://t.ly/_WR5z
πPaper https://lnkd.in/dZqgiqb9
πProject https://lnkd.in/dJUd2Fr6
π₯10π5β€2π1