AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
Hi friends,
right now I'm flying to NY for a business trip!

πŸ‘‰ Is there anyone studying/working @NYU? I'd love to visit the campus and (eventually) attend to a few lessons about AI/CV/MATH on Monday (or this Friday)

Send me a DM -> @argovision
❀15πŸ‘6🍾5🀯3🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ Track Anything: SAM-powered tracking πŸ”₯

πŸ‘‰ SUSTech VIP Lab proposes TAM, a "novel" video tracker powered by SAM

😎Review https://bit.ly/44jwI4W
😎Paper arxiv.org/pdf/2304.11968.pdf
😎Code github.com/gaomingqi/Track-Anything
πŸ”₯17πŸ‘4🀯2😱2πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
🌱 Segment Everything Everywhere 🌱

πŸ‘‰ Segmenting everything using visual/language prompts (BBs, scribbles, text & audio)

😎Review https://bit.ly/3LEiOmx
😎Paper arxiv.org/pdf/2304.06718.pdf
😎Demo huggingface.co/spaces/xdecoder/SEEM
😎Code github.com/UX-Decoder/Segment-Everything-Everywhere-All-At-Once
πŸ”₯13❀4🀯1🀩1
πŸ¦’ Look mom, I'm a giraffe πŸ¦’

πŸ‘‰ A patent to transpose adversarial patches onto a knitted fabric. Be undetectable or associated with incorrect category such as "animal" (giraffe, zebra, etc)

😎 More: https://bit.ly/3LzjSGV
❀20πŸ‘4🀩4πŸ”₯3πŸ’©3πŸ‘1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🐊 RelPose++: SOTA 6D from 2-8 pics 🐊

πŸ‘‰CMU unveils a novel neural method for 6D camera poses from only 2-8 images

😎Review https://bit.ly/42ioJ6K
😎Paper arxiv.org/pdf/2305.04926.pdf
😎Project amyxlase.github.io/relpose-plus-plus
😎Code github.com/amyxlase/relpose-plus-plus
πŸ”₯16🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ¦• 6D Non-Prehensile Manipulation πŸ¦•

πŸ‘‰#META (+CMU) unveils HACMan, novel 6D non-prehensile manipulation of objects

😎Review https://bit.ly/3NP1jl1
😎Paper arxiv.org/pdf/2305.03942.pdf
😎Project hacman-2023.github.io
πŸ‘6πŸ”₯4🀯3😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ›Έ Virtual Occlusions in #AR πŸ›Έ

πŸ‘‰Niantic (#pokemongo) on a novel approach for virtual assets to appear β€˜sitting among’ the real world objects

😎Review https://bit.ly/3o04wn6
😎Paper arxiv.org/pdf/2305.07014.pdf
😎Project nianticlabs.github.io/implicit-depth
😎Code github.com/nianticlabs/implicit-depth
πŸ”₯11🀯5πŸ‘3⚑1🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
🍿 De-Aging Harrison Ford via SD 🍿

πŸ‘‰Stable Diffusion for Hollywood: preview of the next autotune of entertainment industry. A discussionπŸ‘‡

😎 More: https://bit.ly/41EzaQK
🀯19πŸ”₯9πŸ‘6πŸ’©3⚑1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ° #3D Auto-Reconstruction πŸͺ°

πŸ‘‰AutoRecon: automated discovery & reconstruction of objects from multi-view pics.

😎Review https://bit.ly/3MxI0f4
😎Paper arxiv.org/pdf/2305.08810.pdf
😎Project zju3dv.github.io/autorecon/
😎Code github.com/zju3dv/AutoRecon
πŸ”₯11❀4🀯3πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘š Multi-Layered 3D Garments Animation πŸ‘š

πŸ‘‰S-Lab unveils LayersNet: animating multi-layered garments driven by various external forces, such as human bodies & wind

😎Review https://bit.ly/435b42F
😎Paper arxiv.org/pdf/2305.10418.pdf
😎Project mmlab-ntu.github.io/project/layersnet
πŸ”₯6😱2❀1πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🎫 100% Mask-Free VIS 🎫

πŸ‘‰ETH Z unveils MaskFreeVIS: novel high-performing VIS without any mask annotations.

😎Review https://bit.ly/3Wg7CQB
😎Paper arxiv.org/pdf/2303.15904.pdf
😎Project www.vis.xyz/pub/maskfreevis/
😎Code github.com/SysCV/maskfreevis
πŸ”₯6πŸ‘4🀯2❀1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ€„ Drag-GAN: user-friendly image-manipulation πŸ€„

πŸ‘‰ Manual deforming of (real and generated) images over pose, shape, expression and layout.

😎Review https://bit.ly/3BFyXlR
😎Paper arxiv.org/pdf/2305.10973.pdf
😎Project vcai.mpi-inf.mpg.de/projects/DragGAN
😎Code github.com/XingangPan/DragGAN
πŸ”₯34🀯18❀6πŸ‘4😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ—ΊοΈ AI-generated stereotypical men πŸ—ΊοΈ

πŸ‘‰A thread about generating stereotypical person from 15 countries all around the world. And yes, Italian love Pizza.

😎 More https://bit.ly/3oo0t4c
🀣6❀3🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🍢 AVOS Multiscale Encoder-Decoder ViT 🍢

πŸ‘‰ MED-VT, world's first Multiscale Encoder Decoder Video Transformer for AVOS

😎Review https://bit.ly/3MohFi1
😎Paper arxiv.org/pdf/2304.05930.pdf
😎Project rkyuca.github.io/medvt
😎Code github.com/rkyuca/medvt
πŸ‘13πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
🌊 Neural Dynamic Image-Based Rendering 🌊

πŸ‘‰ DynIBaR: synthesizing novel views from monocular video depicting a complex dynamic scene.

😎Review https://t.ly/90Kw
😎Paper arxiv.org/pdf/2211.11082.pdf
😎Project https://dynibar.github.io/
😎Code github.com/google/dynibar
❀9πŸ‘3πŸ₯°1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🦁 Open Semantic Segmentation 🦁

πŸ‘‰SSSegmentation: open source supervised semantic segmentation toolbox based on #PyTorch

😎Review https://t.ly/ZE9q
😎Paper arxiv.org/pdf/2305.17091.pdf
😎Code github.com/SegmentationBLWX/sssegmentation
πŸ”₯10❀4⚑1πŸ‘1🀯1🀩1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸŽ—οΈ 4D Humans with Transformers πŸŽ—οΈ

πŸ‘‰Novel approach to reconstruct and track humans (even in unusual poses)

😎Review https://t.ly/XGv_
😎Paper arxiv.org/pdf/2305.20091.pdf
😎Project shubham-goel.github.io/4dhumans/#
😎Code github.com/shubham-goel/4D-Humans
🀯10πŸ‘7πŸ”₯5❀2⚑1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ—½ Neuralangelo Digital Twins. INSANEπŸ—½

πŸ‘‰ A novel framework from #Nvidia for Hi-Fi 3D Digital twins.

😎Review https://t.ly/rxoF4
😎Project research.nvidia.com/labs/dir/neuralangelo
😎Paper research.nvidia.com/labs/dir/neuralangelo/paper.pdf
πŸ”₯15πŸ‘4🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🦜 ColorDiffuser: Text-to-Video Colorization 🦜

πŸ‘‰HK University unveils ColorDiffuser: adapting pre-trained text-to-image latent diffusion model for video colorization

😎Review https://t.ly/XGv_
😎Paper arxiv.org/pdf/2306.01732.pdf
😎Project colordiffuser.github.io/
😎Code github.com/ColorDiffuser/ColorDiffuser
🀯8❀2🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
🌻 Extending Mona Lisa with AI 🌻

πŸ‘‰ A guy on Reddit extends Mona Lisa Painting with #Photoshop AI. The result is surprising.

😎More https://t.ly/j_2r
🀯20πŸ‘5🀩4πŸ”₯3😱2🀣2⚑1