AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
237 videos
11 files
1.27K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯¬ Diffusive Sketch-Guided Text-to-Image πŸ₯¬

πŸ‘‰#Google unveils a universal approach for T2I (pre-trained) diffusion: free-hand, saliency-guided, etc.

😎Review https://bit.ly/3XFVMj2
😎Project sketch-guided-diffusion.github.io/
😎Paper sketch-guided-diffusion.github.io/files/sketch-guided-preprint.pdf
🀯4⚑1❀1πŸ‘1πŸ”₯1πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯« Plug 'n' play self-checkout πŸ₯«

πŸ‘‰#Google's new shelf-checking #AI: recognizing billions of products, even purchased/moved

😎Review https://bit.ly/3J58hQe
😎News https://cloud.google.com/blog/transform/nrf-2023-google-cloud-big-show-big-moment-hybrid-retail
🀯8πŸ‘7
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ“DREAMIX:General Diffusive Video EditorπŸ“

πŸ‘‰#Google unveils the first diffusion-based method able to perform text-based motion/appearance editing of general videos

😎Review https://bit.ly/3I3Hq6B
😎Paper arxiv.org/pdf/2302.01329.pdf
😎Project dreamix-video-editing.github.io/
🀯24😱3πŸ‘2❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯¦ ReBotNet: Neural Enhancement πŸ₯¦

πŸ‘‰#Google unveils ReBotNet, novel real-time video enhancement for live video calls & streams

😎Review https://bit.ly/3z8oqhG
😎Paper arxiv.org/pdf/2303.13504.pdf
😎Project jeya-maria-jose.github.io/rebotnet-web
πŸ”₯13πŸ‘3❀2πŸ₯°2🀩2⚑1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯¦ Zip-NeRF: the Anti-Aliasing NeRF πŸ₯¦

πŸ‘‰#Google unveils a novel version of NeRF able to fix the aliasing problem being 22x faster in training than SOTA.

😎Review https://bit.ly/3L1hZ6M
😎Paper arxiv.org/pdf/2304.06706.pdf
😎Project https://jonbarron.info/zipnerf
🀯13πŸ”₯4πŸ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
🌈 Track Everything Everywhere 🌈

πŸ‘‰#Google unveils OmniMotion: full-length motion tracking for every pixel in every frame of video.

😎Review https://t.ly/Krvw
😎Paper arxiv.org/pdf/2306.05422.pdf
😎Project omnimotion.github.io/
😎Demo omnimotion.github.io/#interactive_demo
😎Code github.com/qianqianwang68/omnimotion
πŸ”₯23❀5🀯3🀩1πŸ’©1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ“Έ Computational Burst Photography in App πŸ“Έ

πŸ‘‰#Google unveils a novel computational burst system to democratize the professional photography via smartphone

😎Review https://t.ly/5ibJX
😎Paper arxiv.org/pdf/2308.01379.pdf
😎Project https://motion-mode.github.io
πŸ”₯6πŸ₯°3πŸ‘2🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯Lumiere: SOTA video-genπŸ”₯

πŸ‘‰#Google unveils Lumiere: Space-Time Diffusion Model for Realistic Video Generation. It's the new SOTA, tasks: Text-to-Video, Video Stylization, Cinemagraphs & Video Inpainting.

πŸ‘‰Review https://t.ly/nalJR
πŸ‘‰Paper https://lnkd.in/d-PvrGjT
πŸ‘‰Project https://t.ly/gK8hz
πŸ”₯18❀4πŸ‘3πŸ‘2🀩2πŸ₯°1🀯1πŸ’©1
🧠350+ Free #AI Courses by #Google🧠

πŸ‘‰350+ free courses from #Google to become professional in #AI & #Cloud. The full catalog (900+) includes a variety of activity: videos, documents, labs, coding, and quizzes. 15+ supported languages. No excuse.

βœ…π†πžπ§πžπ«πšπ­π’π―πž π€πˆ
βœ…πˆπ§π­π«π¨ 𝐭𝐨 π‹π‹πŒπ¬
βœ…π‚π• 𝐰𝐒𝐭𝐑 𝐓𝐅
βœ…πƒπšπ­πš, πŒπ‹, π€πˆ
βœ…π‘πžπ¬π©π¨π§π¬π’π›π₯𝐞 π€πˆ

πŸ‘‰Review: https://t.ly/517Dr
πŸ‘‰Full list: https://www.cloudskillsboost.google/catalog?page=1
❀13πŸ‘3πŸ‘2🍾2πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‡ Graph Neural Network in TF πŸ‡

πŸ‘‰#Google TensorFlow-GNN: novel library to build Graph Neural Networks on TensorFlow. Source Code released under Apache 2.0 license πŸ’™

πŸ‘‰Review https://t.ly/TQfg-
πŸ‘‰Code github.com/tensorflow/gnn
πŸ‘‰Blog blog.research.google/2024/02/graph-neural-networks-in-tensorflow.html
❀17πŸ‘4πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
β˜€οΈ One2Avatar: Pic -> 3D Avatar β˜€οΈ

πŸ‘‰#Google presents a new approach to generate animatable photo-realistic avatars from only a few/one image. Impressive results.

πŸ‘‰Review https://t.ly/AS1oc
πŸ‘‰Paper arxiv.org/pdf/2402.11909.pdf
πŸ‘‰Project zhixuany.github.io/one2avatar_webpage/
πŸ‘12❀3🀩3πŸ”₯2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺŸ BOG: Fine Geometric Views πŸͺŸ

πŸ‘‰ #Google (+TΓΌbingen) unveils Binary Opacity Grids, a novel method to reconstruct triangle meshes from multi-view images able to capture fine geometric detail such as leaves, branches & grass. New SOTA, real-time on Google Pixel 8 Pro (and similar).

πŸ‘‰Review https://t.ly/E6T0W
πŸ‘‰Paper https://lnkd.in/dQEq3zy6
πŸ‘‰Project https://lnkd.in/dYYCadx9
πŸ‘‰Demo https://lnkd.in/d92R6QME
πŸ”₯8🀯4πŸ‘3πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ’¦ ObjectDrop: automagical objects removal πŸ’¦

πŸ‘‰#Google unveils ObjectDrop, the new SOTA in photorealistic object removal and insertion. Focus on shadows and reflections, impressive!

πŸ‘‰Review https://t.ly/ZJ6NN
πŸ‘‰Paper https://arxiv.org/pdf/2403.18818.pdf
πŸ‘‰Project https://objectdrop.github.io/
πŸ‘14🀯8❀4πŸ”₯3🍾2
πŸ¦‘ Hyper-Detailed Image Descriptions πŸ¦‘

πŸ‘‰#Google unveils ImageInWords (IIW), a carefully designed HIL annotation framework for curating hyper-detailed image descriptions and a new dataset resulting from this process

πŸ‘‰Review https://t.ly/engkl
πŸ‘‰Paper arxiv.org/pdf/2405.02793
πŸ‘‰Repo github.com/google/imageinwords
πŸ‘‰Project google.github.io/imageinwords
πŸ‘‰Data huggingface.co/datasets/google/imageinwords
❀11πŸ”₯3πŸ‘2🀯2🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ€OmniGlue: Foundation MatcherπŸ€

πŸ‘‰#Google OmniGlue from #CVPR24: the first learnable image matcher powered by foundation models. Impressive OOD results!

πŸ‘‰Review https://t.ly/ezaIc
πŸ‘‰Paper https://arxiv.org/pdf/2405.12979
πŸ‘‰Project hwjiang1510.github.io/OmniGlue/
πŸ‘‰Code https://github.com/google-research/omniglue/
🀯10❀6πŸ‘2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘— SOTA Multi-Garment VTOn Editing πŸ‘—

πŸ‘‰#Google (+UWA) unveils M&M VTO, novel mix 'n' match virtual try-on that takes as input multiple garment images, text description for garment layout and an image of a person. It's the new SOTA both qualitatively and quantitatively. Impressive results!

πŸ‘‰Review https://t.ly/66mLN
πŸ‘‰Paper arxiv.org/pdf/2406.04542
πŸ‘‰Project https://mmvto.github.io
πŸ‘4❀3πŸ₯°3πŸ”₯1🀯1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯₯ OmniNOCS: largest 3D NOCS πŸ₯₯

πŸ‘‰OmniNOCS by #Google (+Georgia) is a unified NOCS (Normalized Object Coordinate Space) dataset that contains data across different domains with 90+ object classes. The largest NOCS dataset to date. Data & Code available under Apache 2.0πŸ’™

πŸ‘‰Review https://t.ly/xPgBn
πŸ‘‰Paper arxiv.org/pdf/2407.08711
πŸ‘‰Project https://omninocs.github.io/
πŸ‘‰Data github.com/google-deepmind/omninocs
πŸ”₯4❀3πŸ‘2πŸ‘1πŸ₯°1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ„ Diffusion Models for Transparency πŸͺ„

πŸ‘‰MIT (+ #Google) unveils Alchemist, a novel method to control material attributes of objects like roughness, metallic, albedo & transparency in real images. Amazing work but code not announcedπŸ₯Ί

πŸ‘‰Review https://t.ly/U98_G
πŸ‘‰Paper arxiv.org/pdf/2312.02970
πŸ‘‰Project www.prafullsharma.net/alchemist/
πŸ”₯17πŸ‘4⚑1❀1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐺 Diffusion Game Engine 🐺

πŸ‘‰#Google unveils GameNGen: the first game engine powered entirely by a neural #AI that enables real-time interaction with a complex environment over long trajectories at HQ. No code announced but I love it πŸ’™

πŸ‘‰Review https://t.ly/_WR5z
πŸ‘‰Paper https://lnkd.in/dZqgiqb9
πŸ‘‰Project https://lnkd.in/dJUd2Fr6
πŸ”₯10πŸ‘5❀2πŸ‘1