AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸ‘Đ‍🚀 HD Avatar via Text & Pose ðŸ‘Đ‍🚀

👉 Generating expressive #3D avatars from nothing but text descriptions & pose guidance

😎Review https://t.ly/wrSMH
😎Paper arxiv.org/pdf/2308.03610.pdf
😎Project avatarverse3d.github.io
âĪ7ðŸĨ°4👍1ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
🐘 Controllable Synthetic Data (extending Image-Net) 🐘

👉#META's PUG, a new generation of interactive environments for representation learning. Extending Image-Net!

😎Review https://t.ly/nCYs0
😎Paper arxiv.org/pdf/2308.03977.pdf
😎Project pug.metademolab.com
😎Code github.com/facebookresearch/PUG
ðŸ”Ĩ4âĪ2👍1ðŸĪĐ1
This media is not supported in your browser
VIEW IN TELEGRAM
🌈 Tracking by Persistent Dynamic View Synthesis 🌈

👉Novel simultaneous addressing of dynamic scene novel-view synthesis + 6-DOF tracking of all dense scene elements

😎Review https://t.ly/Bc535
😎Paper arxiv.org/pdf/2308.09713.pdf
😎Project dynamic3dgaussians.github.io
😎Code github.com/JonathonLuiten/Dynamic3DGaussians
ðŸĪŊ10ðŸ”Ĩ3ðŸ˜ą1
🛒 Digital Twins for AutoRetail Checkout 🛒

👉From #Nvidia a novel approach for using 3D assets for training 2D detection and tracking model in AutoRetail Checkout

😎Review https://t.ly/Ea7kt
😎Paper arxiv.org/pdf/2308.09708.pdf
😎Code github.com/yorkeyao/Automated-Retail-Checkout
ðŸ”Ĩ2ðŸĨ°2ðŸ˜ą2
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸĨŽSportsMOT + MixSort = Sport MOTðŸĨŽ

👉Nanjing just released a MOT dataset for sports scenes + the SOTA code/model for tracking (MixSort)

😎Review https://t.ly/NHUxL
😎Paper arxiv.org/pdf/2304.05170.pdf
😎Code github.com/MCG-NJU/MixSort
😎Project deeperaction.github.io/datasets/sportsmot.html
ðŸ”Ĩ12👍2ðŸĪŊ2âĪ1ðŸĪĐ1
⚡ïļFeature Matching at Light Speed⚡ïļ

👉LightGlue is a lightweight feature matcher with high accuracy and blazing fast inference

😎Review https://t.ly/jkecX
😎Paper arxiv.org/pdf/2306.13643.pdf
😎Code github.com/cvg/LightGlue
âĪ23ðŸ”Ĩ6ðŸ˜ą4👍3⚡2ðŸū1
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸ•đïļ CoDeF: Video Content Deformation Fields ðŸ•đïļ

👉CoDeF is a new type of video representation for video-editing tasks

😎Review https://t.ly/PIVl-
😎Paper arxiv.org/pdf/2308.07926.pdf
😎Project https://qiuyu96.github.io/CoDeF
😎Code https://github.com/qiuyu96/CoDeF
âĪ18ðŸ”Ĩ4👍2ðŸĨ°1ðŸĪŊ1ðŸ˜ą1
Hello everybody,
a lot of you asked me to open the comments to better enjoy the posts. I want to follow your suggestion, hope you will enjoy this new mood!

ðŸ”Ĩ NO SPAM
ðŸ”Ĩ NO COMMERCIAL
ðŸ”Ĩ NO UNRESPECTFUL MESSAGEs

ðŸ§ĄJUST AI & SCIENCE

⚠ïļ BAN AT THE FIRST VIOLATION ⚠ïļ
âĪ44👍28ðŸ”Ĩ6👏1ðŸĪŊ1ðŸū1
AI with Papers - Artificial Intelligence & Deep Learning pinned ÂŦHello everybody, a lot of you asked me to open the comments to better enjoy the posts. I want to follow your suggestion, hope you will enjoy this new mood! ðŸ”Ĩ NO SPAM ðŸ”Ĩ NO COMMERCIAL ðŸ”Ĩ NO UNRESPECTFUL MESSAGEs ðŸ§ĄJUST AI & SCIENCE ⚠ïļ BAN AT THE FIRSTâ€ĶÂŧ
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸĶ  Instance-Level Semantics of Cells ðŸĶ 

👉TYC: novel dataset for understanding instance-level semantics & motions of cells in microstructures

😎Review https://t.ly/y-4VZ
😎Paper arxiv.org/pdf/2308.12116.pdf
😎Project christophreich1996.github.io/tyc_dataset/
😎Code github.com/ChristophReich1996/TYC-Dataset
😎Data tudatalib.ulb.tu-darmstadt.de/handle/tudatalib/3930
👍8ðŸ”Ĩ3âĪ1⚡1ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸŒĩPOCO: 3D HPS + ConfidenceðŸŒĩ

👉 Novel framework for HPS: #3D human body + confidence in a single feed-forward pass

😎Review https://t.ly/cDePe
😎Paper arxiv.org/pdf/2308.12965.pdf
😎Project https://poco.is.tue.mpg.de
ðŸ”Ĩ5👍3âĪ2ðŸĪŊ1ðŸ˜ą1
This media is not supported in your browser
VIEW IN TELEGRAM
🌆 NeO360: NeRF for Sparse Outdoor 🌆

👉#Toyota (+GIT) unveils NeO360: 360â—Ķ outdoor scenes from a single or a few posed RGB images

😎Review https://t.ly/JDJZg
😎Paper arxiv.org/pdf/2308.12967.pdf
😎Project zubair-irshad.github.io/projects/neo360.html
âĪ13👍3ðŸ”Ĩ2ðŸĨ°1ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸĨ• Scenimefy: I-2-I for anime ðŸĨ•

👉S-Lab unveils a novel semi-supervised I-2-I translation framework + HD dataset for anime

😎Review https://t.ly/IsdEG
😎Paper arxiv.org/pdf/2308.12968.pdf
😎Code https://github.com/Yuxinn-J/Scenimefy
😎Project https://yuxinn-j.github.io/projects/Scenimefy.html
ðŸĨ°13âĪ2ðŸ”Ĩ1ðŸū1
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸĻ Watch Your Steps: Editing by Text ðŸĻ

👉The novel SOTA in image & scene (text) editing via denoising diffusion models

😎Review https://t.ly/fv9wn
😎Paper arxiv.org/pdf/2308.08947.pdf
😎Project ashmrz.github.io/WatchYourSteps
âĪ4👍3ðŸĪŊ3ðŸ”Ĩ1
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸ’Ą Relighting NeRF ðŸ’Ą

👉Neural implicit radiance representation for free viewpoint relighting of an object lit by a moving point light

😎Review https://t.ly/J-3_L
😎Project nrhints.github.io
😎Code github.com/iamNCJ/NRHints
😎Paper nrhints.github.io/pdfs/nrhints-sig23.pdf
ðŸĪŊ3👍2âĪ1⚡1ðŸ”Ĩ1
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸŠķ ReST: Multi-Camera MOT ðŸŠķ

👉Novel reconfigurable two-steps graph model for multi-camera multi object video tracking (MC-MOT)

😎Review https://t.ly/3C5tb
😎Paper arxiv.org/pdf/2308.13229.pdf
😎Code github.com/chengche6230/ReST
ðŸ”Ĩ7âĪ3ðŸĪĐ2
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸŒēMagicEdit: Magic Video EditðŸŒē

👉MagicEdit: explicit disentangling content, structure & motion for Hi-Fi and temporally coherent video editing

😎Report https://t.ly/tREX4
😎Paper arxiv.org/pdf/2308.14749.pdf
😎Project magic-edit.github.io
😎Code github.com/magic-research/magic-edit
ðŸĨ°8âĪ4👍3ðŸ”Ĩ1ðŸ˜ą1ðŸĪĐ1
This media is not supported in your browser
VIEW IN TELEGRAM
✂ïļ VideoCutLER: Simple UVIS ✂ïļ

👉VideoCutLER is a simple unsupervised video instance segmentation (UVIS) method without relying on optical flows

😎Review https://t.ly/PBBjG
😎Paper arxiv.org/pdf/2308.14710.pdf
😎Project people.eecs.berkeley.edu/~xdwang/projects/CutLER
😎Code github.com/facebookresearch/CutLER/tree/main/videocutler
ðŸ”Ĩ8👍3âĪ2ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸĶ 3D Pigeons Pose & Tracking ðŸĶ

👉 3D-MuPPET: estimate and track 3D poses of pigeons with multiple-views

😎Review https://t.ly/jfAJJ
😎Paper arxiv.org/pdf/2308.15316.pdf
😎Code github.com/alexhang212/3D-MuPPET/
ðŸĪĢ17ðŸĪŊ14👍4ðŸĨ°2âĪ1ðŸĪĐ1