AI with Papers - Artificial Intelligence & Deep Learning

🥬 Diffusive Sketch-Guided Text-to-Image 🥬

👉#Google unveils a universal approach for T2I (pre-trained) diffusion: free-hand, saliency-guided, etc.

😎Review https://bit.ly/3XFVMj2
😎Project sketch-guided-diffusion.github.io/
😎Paper sketch-guided-diffusion.github.io/files/sketch-guided-preprint.pdf

🤯4⚡1❤1👍1🔥1🥰1

3.56K views07:51

This media is not supported in your browser

VIEW IN TELEGRAM

🥫 Plug 'n' play self-checkout 🥫

👉#Google's new shelf-checking #AI: recognizing billions of products, even purchased/moved

😎Review https://bit.ly/3J58hQe
😎News https://cloud.google.com/blog/transform/nrf-2023-google-cloud-big-show-big-moment-hybrid-retail

🤯8👍7

4.12K views08:42

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🐓DREAMIX:General Diffusive Video Editor🐓

👉#Google unveils the first diffusion-based method able to perform text-based motion/appearance editing of general videos

😎Review https://bit.ly/3I3Hq6B
😎Paper arxiv.org/pdf/2302.01329.pdf
😎Project dreamix-video-editing.github.io/

🤯24😱3👍2❤1

5.2K viewsedited 07:34

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🥦 ReBotNet: Neural Enhancement 🥦

👉#Google unveils ReBotNet, novel real-time video enhancement for live video calls & streams

😎Review https://bit.ly/3z8oqhG
😎Paper arxiv.org/pdf/2303.13504.pdf
😎Project jeya-maria-jose.github.io/rebotnet-web

🔥13👍3❤2🥰2🤩2⚡1

5.69K viewsedited 08:05

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🥦 Zip-NeRF: the Anti-Aliasing NeRF 🥦

👉#Google unveils a novel version of NeRF able to fix the aliasing problem being 22x faster in training than SOTA.

😎Review https://bit.ly/3L1hZ6M
😎Paper arxiv.org/pdf/2304.06706.pdf
😎Project https://jonbarron.info/zipnerf

🤯13🔥4👍3

5.51K views12:11

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🌈 Track Everything Everywhere 🌈

👉#Google unveils OmniMotion: full-length motion tracking for every pixel in every frame of video.

😎Review https://t.ly/Krvw
😎Paper arxiv.org/pdf/2306.05422.pdf
😎Project omnimotion.github.io/
😎Demo omnimotion.github.io/#interactive_demo
😎Code github.com/qianqianwang68/omnimotion

🔥23❤5🤯3🤩1💩1

7.97K views07:06

AI with Papers - Artificial Intelligence & Deep Learning

🀄 Drag-GAN: user-friendly image-manipulation 🀄 👉 Manual deforming of (real and generated) images over pose, shape, expression and layout. 😎Review https://bit.ly/3BFyXlR 😎Paper arxiv.org/pdf/2305.10973.pdf 😎Project vcai.mpi-inf.mpg.de/projects/DragGAN…

🔥🔥 Source Code IS OUT! 🔥🔥

😎 More: https://t.ly/ZddLl

🔥🔥 Source Code of Drag-GAN IS OUT! | Alessandro Ferrari | 40 comments

🔥🔥 Source Code of Drag-GAN IS OUT! 🔥🔥

👉Manual deforming of (real and generated) images over pose, shape, expression and layout. Source Code just released a few hours ago 👇

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Max Planck + MIT + #Google AR/VR = 🤯
✅Supervising handle points to move…

🔥25😱6❤3🥰1🤯1

6.98K viewsedited 07:20

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

📸 Computational Burst Photography in App 📸

👉#Google unveils a novel computational burst system to democratize the professional photography via smartphone

😎Review https://t.ly/5ibJX
😎Paper arxiv.org/pdf/2308.01379.pdf
😎Project https://motion-mode.github.io

🔥6🥰3👍2🤩1

4.77K views07:12

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔥Lumiere: SOTA video-gen🔥

👉#Google unveils Lumiere: Space-Time Diffusion Model for Realistic Video Generation. It's the new SOTA, tasks: Text-to-Video, Video Stylization, Cinemagraphs & Video Inpainting.

👉Review https://t.ly/nalJR
👉Paper https://lnkd.in/d-PvrGjT
👉Project https://t.ly/gK8hz

🔥18❤4👍3👏2🤩2🥰1🤯1💩1

6.52K viewsedited 07:45

AI with Papers - Artificial Intelligence & Deep Learning

🧠350+ Free #AI Courses by #Google🧠

👉350+ free courses from #Google to become professional in #AI & #Cloud. The full catalog (900+) includes a variety of activity: videos, documents, labs, coding, and quizzes. 15+ supported languages. No excuse.

✅𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈
✅𝐈𝐧𝐭𝐫𝐨 𝐭𝐨 𝐋𝐋𝐌𝐬
✅𝐂𝐕 𝐰𝐢𝐭𝐡 𝐓𝐅
✅𝐃𝐚𝐭𝐚, 𝐌𝐋, 𝐀𝐈
✅𝐑𝐞𝐬𝐩𝐨𝐧𝐬𝐢𝐛𝐥𝐞 𝐀𝐈

👉Review: https://t.ly/517Dr
👉Full list: https://www.cloudskillsboost.google/catalog?page=1

❤13👍3👏2🍾2🔥1

6.92K viewsedited 12:47

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍇 Graph Neural Network in TF 🍇

👉#Google TensorFlow-GNN: novel library to build Graph Neural Networks on TensorFlow. Source Code released under Apache 2.0 license 💙

👉Review https://t.ly/TQfg-
👉Code github.com/tensorflow/gnn
👉Blog blog.research.google/2024/02/graph-neural-networks-in-tensorflow.html

❤17👍4👏1

8.55K viewsedited 08:24

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

☀️ One2Avatar: Pic -> 3D Avatar ☀️

👉#Google presents a new approach to generate animatable photo-realistic avatars from only a few/one image. Impressive results.

👉Review https://t.ly/AS1oc
👉Paper arxiv.org/pdf/2402.11909.pdf
👉Project zhixuany.github.io/one2avatar_webpage/

👏12❤3🤩3🔥2

7.71K views07:55

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪟 BOG: Fine Geometric Views 🪟

👉 #Google (+Tübingen) unveils Binary Opacity Grids, a novel method to reconstruct triangle meshes from multi-view images able to capture fine geometric detail such as leaves, branches & grass. New SOTA, real-time on Google Pixel 8 Pro (and similar).

👉Review https://t.ly/E6T0W
👉Paper https://lnkd.in/dQEq3zy6
👉Project https://lnkd.in/dYYCadx9
👉Demo https://lnkd.in/d92R6QME

🔥8🤯4👏3🥰1

8.2K viewsedited 14:42

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

💦 ObjectDrop: automagical objects removal 💦

👉#Google unveils ObjectDrop, the new SOTA in photorealistic object removal and insertion. Focus on shadows and reflections, impressive!

👉Review https://t.ly/ZJ6NN
👉Paper https://arxiv.org/pdf/2403.18818.pdf
👉Project https://objectdrop.github.io/

👍14🤯8❤4🔥3🍾2

8.16K views14:18

AI with Papers - Artificial Intelligence & Deep Learning

🦑 Hyper-Detailed Image Descriptions 🦑

👉#Google unveils ImageInWords (IIW), a carefully designed HIL annotation framework for curating hyper-detailed image descriptions and a new dataset resulting from this process

👉Review https://t.ly/engkl
👉Paper arxiv.org/pdf/2405.02793
👉Repo github.com/google/imageinwords
👉Project google.github.io/imageinwords
👉Data huggingface.co/datasets/google/imageinwords

❤11🔥3👍2🤯2🍾1

7.94K viewsedited 16:01

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍀OmniGlue: Foundation Matcher🍀

👉#Google OmniGlue from #CVPR24: the first learnable image matcher powered by foundation models. Impressive OOD results!

👉Review https://t.ly/ezaIc
👉Paper https://arxiv.org/pdf/2405.12979
👉Project hwjiang1510.github.io/OmniGlue/
👉Code https://github.com/google-research/omniglue/

🤯10❤6👍2👏1

8.66K viewsedited 06:39

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

👗 SOTA Multi-Garment VTOn Editing 👗

👉#Google (+UWA) unveils M&M VTO, novel mix 'n' match virtual try-on that takes as input multiple garment images, text description for garment layout and an image of a person. It's the new SOTA both qualitatively and quantitatively. Impressive results!

👉Review https://t.ly/66mLN
👉Paper arxiv.org/pdf/2406.04542
👉Project https://mmvto.github.io

👍4❤3🥰3🔥1🤯1😱1

8.03K views06:51

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🥥 OmniNOCS: largest 3D NOCS 🥥

👉OmniNOCS by #Google (+Georgia) is a unified NOCS (Normalized Object Coordinate Space) dataset that contains data across different domains with 90+ object classes. The largest NOCS dataset to date. Data & Code available under Apache 2.0💙

👉Review https://t.ly/xPgBn
👉Paper arxiv.org/pdf/2407.08711
👉Project https://omninocs.github.io/
👉Data github.com/google-deepmind/omninocs

🔥4❤3👏2👍1🥰1🤯1

7.14K views05:33

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪄 Diffusion Models for Transparency 🪄

👉MIT (+ #Google) unveils Alchemist, a novel method to control material attributes of objects like roughness, metallic, albedo & transparency in real images. Amazing work but code not announced🥺

👉Review https://t.ly/U98_G
👉Paper arxiv.org/pdf/2312.02970
👉Project www.prafullsharma.net/alchemist/

🔥17👍4⚡1❤1🤯1

9.33K views11:56

AI with Papers - Artificial Intelligence & Deep Learning

0:05

This media is not supported in your browser

VIEW IN TELEGRAM

🐺 Diffusion Game Engine 🐺

👉#Google unveils GameNGen: the first game engine powered entirely by a neural #AI that enables real-time interaction with a complex environment over long trajectories at HQ. No code announced but I love it 💙

👉Review https://t.ly/_WR5z
👉Paper https://lnkd.in/dZqgiqb9
👉Project https://lnkd.in/dJUd2Fr6

🔥10👍5❤2👏1

9.84K views06:28

About

Blog

Apps

Platform