AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ¬ META's Animated Drawings is out! πŸͺ¬

πŸ‘‰#META unveils an easy-to-use method for animating human-like figures drawn by children.

😎Review https://bit.ly/3mGeQQv
😎Paper arxiv.org/pdf/2303.12741.pdf
😎Project fairanimateddrawings.com
😱16πŸ₯°5πŸ‘4πŸ‘2🀩2⚑1πŸ”₯1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🌻DDS: diffusive text-based image editing🌻

πŸ‘‰Google unveils a novel text-based image editing for modifications of an input image towards a text description.

😎Review https://bit.ly/3L52UBl
😎Paper arxiv.org/pdf/2304.07090.pdf
😎Project delta-denoising-score.github.io
πŸ”₯12❀2πŸ‘2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ… Inpaint Anything: Segmentation + Inpainting πŸͺ…

πŸ‘‰Remove / Fill /Replace anything (also via prompt). "Inpaint Anything", a new paradigm of β€œclicking & filling"

😎Review https://bit.ly/43JNREE
😎Paper arxiv.org/pdf/2304.06790.pdf
😎Code github.com/geekyutao/Inpaint-Anything
πŸ‘16🀯8❀3😒1
Hi friends,
right now I'm flying to NY for a business trip!

πŸ‘‰ Is there anyone studying/working @NYU? I'd love to visit the campus and (eventually) attend to a few lessons about AI/CV/MATH on Monday (or this Friday)

Send me a DM -> @argovision
❀15πŸ‘6🍾5🀯3🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ Track Anything: SAM-powered tracking πŸ”₯

πŸ‘‰ SUSTech VIP Lab proposes TAM, a "novel" video tracker powered by SAM

😎Review https://bit.ly/44jwI4W
😎Paper arxiv.org/pdf/2304.11968.pdf
😎Code github.com/gaomingqi/Track-Anything
πŸ”₯17πŸ‘4🀯2😱2πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
🌱 Segment Everything Everywhere 🌱

πŸ‘‰ Segmenting everything using visual/language prompts (BBs, scribbles, text & audio)

😎Review https://bit.ly/3LEiOmx
😎Paper arxiv.org/pdf/2304.06718.pdf
😎Demo huggingface.co/spaces/xdecoder/SEEM
😎Code github.com/UX-Decoder/Segment-Everything-Everywhere-All-At-Once
πŸ”₯13❀4🀯1🀩1
πŸ¦’ Look mom, I'm a giraffe πŸ¦’

πŸ‘‰ A patent to transpose adversarial patches onto a knitted fabric. Be undetectable or associated with incorrect category such as "animal" (giraffe, zebra, etc)

😎 More: https://bit.ly/3LzjSGV
❀20πŸ‘4🀩4πŸ”₯3πŸ’©3πŸ‘1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🐊 RelPose++: SOTA 6D from 2-8 pics 🐊

πŸ‘‰CMU unveils a novel neural method for 6D camera poses from only 2-8 images

😎Review https://bit.ly/42ioJ6K
😎Paper arxiv.org/pdf/2305.04926.pdf
😎Project amyxlase.github.io/relpose-plus-plus
😎Code github.com/amyxlase/relpose-plus-plus
πŸ”₯16🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ¦• 6D Non-Prehensile Manipulation πŸ¦•

πŸ‘‰#META (+CMU) unveils HACMan, novel 6D non-prehensile manipulation of objects

😎Review https://bit.ly/3NP1jl1
😎Paper arxiv.org/pdf/2305.03942.pdf
😎Project hacman-2023.github.io
πŸ‘6πŸ”₯4🀯3😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ›Έ Virtual Occlusions in #AR πŸ›Έ

πŸ‘‰Niantic (#pokemongo) on a novel approach for virtual assets to appear β€˜sitting among’ the real world objects

😎Review https://bit.ly/3o04wn6
😎Paper arxiv.org/pdf/2305.07014.pdf
😎Project nianticlabs.github.io/implicit-depth
😎Code github.com/nianticlabs/implicit-depth
πŸ”₯11🀯5πŸ‘3⚑1🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
🍿 De-Aging Harrison Ford via SD 🍿

πŸ‘‰Stable Diffusion for Hollywood: preview of the next autotune of entertainment industry. A discussionπŸ‘‡

😎 More: https://bit.ly/41EzaQK
🀯19πŸ”₯9πŸ‘6πŸ’©3⚑1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ° #3D Auto-Reconstruction πŸͺ°

πŸ‘‰AutoRecon: automated discovery & reconstruction of objects from multi-view pics.

😎Review https://bit.ly/3MxI0f4
😎Paper arxiv.org/pdf/2305.08810.pdf
😎Project zju3dv.github.io/autorecon/
😎Code github.com/zju3dv/AutoRecon
πŸ”₯11❀4🀯3πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘š Multi-Layered 3D Garments Animation πŸ‘š

πŸ‘‰S-Lab unveils LayersNet: animating multi-layered garments driven by various external forces, such as human bodies & wind

😎Review https://bit.ly/435b42F
😎Paper arxiv.org/pdf/2305.10418.pdf
😎Project mmlab-ntu.github.io/project/layersnet
πŸ”₯6😱2❀1πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🎫 100% Mask-Free VIS 🎫

πŸ‘‰ETH Z unveils MaskFreeVIS: novel high-performing VIS without any mask annotations.

😎Review https://bit.ly/3Wg7CQB
😎Paper arxiv.org/pdf/2303.15904.pdf
😎Project www.vis.xyz/pub/maskfreevis/
😎Code github.com/SysCV/maskfreevis
πŸ”₯6πŸ‘4🀯2❀1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ€„ Drag-GAN: user-friendly image-manipulation πŸ€„

πŸ‘‰ Manual deforming of (real and generated) images over pose, shape, expression and layout.

😎Review https://bit.ly/3BFyXlR
😎Paper arxiv.org/pdf/2305.10973.pdf
😎Project vcai.mpi-inf.mpg.de/projects/DragGAN
😎Code github.com/XingangPan/DragGAN
πŸ”₯34🀯18❀6πŸ‘4😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ—ΊοΈ AI-generated stereotypical men πŸ—ΊοΈ

πŸ‘‰A thread about generating stereotypical person from 15 countries all around the world. And yes, Italian love Pizza.

😎 More https://bit.ly/3oo0t4c
🀣6❀3🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🍢 AVOS Multiscale Encoder-Decoder ViT 🍢

πŸ‘‰ MED-VT, world's first Multiscale Encoder Decoder Video Transformer for AVOS

😎Review https://bit.ly/3MohFi1
😎Paper arxiv.org/pdf/2304.05930.pdf
😎Project rkyuca.github.io/medvt
😎Code github.com/rkyuca/medvt
πŸ‘13πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
🌊 Neural Dynamic Image-Based Rendering 🌊

πŸ‘‰ DynIBaR: synthesizing novel views from monocular video depicting a complex dynamic scene.

😎Review https://t.ly/90Kw
😎Paper arxiv.org/pdf/2211.11082.pdf
😎Project https://dynibar.github.io/
😎Code github.com/google/dynibar
❀9πŸ‘3πŸ₯°1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🦁 Open Semantic Segmentation 🦁

πŸ‘‰SSSegmentation: open source supervised semantic segmentation toolbox based on #PyTorch

😎Review https://t.ly/ZE9q
😎Paper arxiv.org/pdf/2305.17091.pdf
😎Code github.com/SegmentationBLWX/sssegmentation
πŸ”₯10❀4⚑1πŸ‘1🀯1🀩1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸŽ—οΈ 4D Humans with Transformers πŸŽ—οΈ

πŸ‘‰Novel approach to reconstruct and track humans (even in unusual poses)

😎Review https://t.ly/XGv_
😎Paper arxiv.org/pdf/2305.20091.pdf
😎Project shubham-goel.github.io/4dhumans/#
😎Code github.com/shubham-goel/4D-Humans
🀯10πŸ‘7πŸ”₯5❀2⚑1