AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ•ΉοΈ CoDeF: Video Content Deformation Fields πŸ•ΉοΈ

πŸ‘‰CoDeF is a new type of video representation for video-editing tasks

😎Review https://t.ly/PIVl-
😎Paper arxiv.org/pdf/2308.07926.pdf
😎Project https://qiuyu96.github.io/CoDeF
😎Code https://github.com/qiuyu96/CoDeF
❀18πŸ”₯4πŸ‘2πŸ₯°1🀯1😱1
Hello everybody,
a lot of you asked me to open the comments to better enjoy the posts. I want to follow your suggestion, hope you will enjoy this new mood!

πŸ”₯ NO SPAM
πŸ”₯ NO COMMERCIAL
πŸ”₯ NO UNRESPECTFUL MESSAGEs

🧑JUST AI & SCIENCE

⚠️ BAN AT THE FIRST VIOLATION ⚠️
❀44πŸ‘28πŸ”₯6πŸ‘1🀯1🍾1
AI with Papers - Artificial Intelligence & Deep Learning pinned Β«Hello everybody, a lot of you asked me to open the comments to better enjoy the posts. I want to follow your suggestion, hope you will enjoy this new mood! πŸ”₯ NO SPAM πŸ”₯ NO COMMERCIAL πŸ”₯ NO UNRESPECTFUL MESSAGEs 🧑JUST AI & SCIENCE ⚠️ BAN AT THE FIRST…»
This media is not supported in your browser
VIEW IN TELEGRAM
🦠 Instance-Level Semantics of Cells 🦠

πŸ‘‰TYC: novel dataset for understanding instance-level semantics & motions of cells in microstructures

😎Review https://t.ly/y-4VZ
😎Paper arxiv.org/pdf/2308.12116.pdf
😎Project christophreich1996.github.io/tyc_dataset/
😎Code github.com/ChristophReich1996/TYC-Dataset
😎Data tudatalib.ulb.tu-darmstadt.de/handle/tudatalib/3930
πŸ‘8πŸ”₯3❀1⚑1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🌡POCO: 3D HPS + Confidence🌡

πŸ‘‰ Novel framework for HPS: #3D human body + confidence in a single feed-forward pass

😎Review https://t.ly/cDePe
😎Paper arxiv.org/pdf/2308.12965.pdf
😎Project https://poco.is.tue.mpg.de
πŸ”₯5πŸ‘3❀2🀯1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸŒ† NeO360: NeRF for Sparse Outdoor πŸŒ†

πŸ‘‰#Toyota (+GIT) unveils NeO360: 360β—¦ outdoor scenes from a single or a few posed RGB images

😎Review https://t.ly/JDJZg
😎Paper arxiv.org/pdf/2308.12967.pdf
😎Project zubair-irshad.github.io/projects/neo360.html
❀13πŸ‘3πŸ”₯2πŸ₯°1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯• Scenimefy: I-2-I for anime πŸ₯•

πŸ‘‰S-Lab unveils a novel semi-supervised I-2-I translation framework + HD dataset for anime

😎Review https://t.ly/IsdEG
😎Paper arxiv.org/pdf/2308.12968.pdf
😎Code https://github.com/Yuxinn-J/Scenimefy
😎Project https://yuxinn-j.github.io/projects/Scenimefy.html
πŸ₯°13❀2πŸ”₯1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🐨 Watch Your Steps: Editing by Text 🐨

πŸ‘‰The novel SOTA in image & scene (text) editing via denoising diffusion models

😎Review https://t.ly/fv9wn
😎Paper arxiv.org/pdf/2308.08947.pdf
😎Project ashmrz.github.io/WatchYourSteps
❀4πŸ‘3🀯3πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ’‘ Relighting NeRF πŸ’‘

πŸ‘‰Neural implicit radiance representation for free viewpoint relighting of an object lit by a moving point light

😎Review https://t.ly/J-3_L
😎Project nrhints.github.io
😎Code github.com/iamNCJ/NRHints
😎Paper nrhints.github.io/pdfs/nrhints-sig23.pdf
🀯3πŸ‘2❀1⚑1πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺΆ ReST: Multi-Camera MOT πŸͺΆ

πŸ‘‰Novel reconfigurable two-steps graph model for multi-camera multi object video tracking (MC-MOT)

😎Review https://t.ly/3C5tb
😎Paper arxiv.org/pdf/2308.13229.pdf
😎Code github.com/chengche6230/ReST
πŸ”₯7❀3🀩2
This media is not supported in your browser
VIEW IN TELEGRAM
🌲MagicEdit: Magic Video Edit🌲

πŸ‘‰MagicEdit: explicit disentangling content, structure & motion for Hi-Fi and temporally coherent video editing

😎Report https://t.ly/tREX4
😎Paper arxiv.org/pdf/2308.14749.pdf
😎Project magic-edit.github.io
😎Code github.com/magic-research/magic-edit
πŸ₯°8❀4πŸ‘3πŸ”₯1😱1🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
βœ‚οΈ VideoCutLER: Simple UVIS βœ‚οΈ

πŸ‘‰VideoCutLER is a simple unsupervised video instance segmentation (UVIS) method without relying on optical flows

😎Review https://t.ly/PBBjG
😎Paper arxiv.org/pdf/2308.14710.pdf
😎Project people.eecs.berkeley.edu/~xdwang/projects/CutLER
😎Code github.com/facebookresearch/CutLER/tree/main/videocutler
πŸ”₯8πŸ‘3❀2🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐦 3D Pigeons Pose & Tracking 🐦

πŸ‘‰ 3D-MuPPET: estimate and track 3D poses of pigeons with multiple-views

😎Review https://t.ly/jfAJJ
😎Paper arxiv.org/pdf/2308.15316.pdf
😎Code github.com/alexhang212/3D-MuPPET/
🀣17🀯14πŸ‘4πŸ₯°2❀1🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
🎍RoboTAP: Dense Tracking for Few-Shot Imitation🎍

πŸ‘‰RoboTAP: novel dense tracking representation for robotic arm

😎Review https://t.ly/MCO_V
😎Paper arxiv.org/pdf/2308.15975.pdf
😎Project https://robotap.github.io/
😎Code github.com/deepmind/tapnet
πŸ”₯8πŸ‘2🀯2🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
β›ΊFACET: Fairness in Computer Visionβ›Ί

πŸ‘‰#META AI opens a large, publicly available dataset for classification, detection & segmentation. Potential performance disparities & challenges across sensitive demographic attributes

😎Review https://t.ly/mKn-t
😎Paper arxiv.org/pdf/2309.00035.pdf
😎Dataset https://facet.metademolab.com/
πŸ”₯10❀6πŸ‘4πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
β™ŠοΈ Doppelgangers in Structures β™ŠοΈ

πŸ‘‰A novel learning-based approach for visual disambiguation: distinguishing illusory matches to produce correct, disambiguated #3D reconstructions

😎Review https://t.ly/9yLot
😎Paper arxiv.org/pdf/2309.02420.pdf
😎Code github.com/RuojinCai/Doppelgangers
😎Project doppelgangers-3d.github.io/
πŸ”₯8πŸ‘3🀯2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸƒ Tracking Anything with Decoupled VOS πŸƒ

πŸ‘‰A novel VOS approach that extends SAM for open-world video segmentation with no user input required

😎Review https://t.ly/xeobR
😎Paper arxiv.org/pdf/2309.03903.pdf
😎Project hkchengrex.com/Tracking-Anything-with-DEVA
😎Code github.com/hkchengrex/Tracking-Anything-with-DEVA
😎Colab https://colab.research.google.com/drive/1OsyNVoV_7ETD1zIE8UWxL3NXxu12m_YZ
πŸ”₯13πŸ‘6🀯4❀2😒1🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ· Diffusive Consistent Video Editing πŸͺ·

πŸ‘‰ Weizmann Institute of Science unveils TokenFlow, a novel text-to-image diffusion model for text-driven video editing

😎Review https://t.ly/ru8km
😎Paper arxiv.org/pdf/2307.10373.pdf
😎Project diffusion-tokenflow.github.io
😎Code github.com/omerbt/TokenFlow
❀9πŸ‘6πŸ”₯2🀯1😱1😒1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯πŸ”₯ #META's DINOv2 is now commercial! πŸ”₯πŸ”₯

πŸ‘‰Universal features for image classification, instance retrieval, video understanding, depth & semantic segmentation. Now suitable for commercial.

😎Review https://t.ly/LNrGy
😎Paper arxiv.org/pdf/2304.07193.pdf
😎Code github.com/facebookresearch/dinov2
😎Demo dinov2.metademolab.com/
πŸ”₯15πŸ‘3❀1🀯1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ§„FreeMan: towards #3D Humans πŸ§„

πŸ‘‰FreeMan: the first large-scale, real-world, multi-view dataset for #3D human pose estimation. 11M frames!

😎Review https://t.ly/ICxpA
😎Paper arxiv.org/pdf/2309.05073.pdf
😎Project wangjiongw.github.io/freeman
πŸ‘6🀯4πŸ₯°1