AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
🏸 Segment Anything in HQ 🏸

👉HQ-SAM: SAM with the ability to accurately segment objects, maintaining promptable design, efficiency, zero-shot generalizability

😎Review https://t.ly/GxX5B
😎Paper arxiv.org/pdf/2306.01567.pdf
😎Models github.com/SysCV/SAM-HQ
đŸ”Ĩ18👍4đŸ¤¯1😱1😍1
This media is not supported in your browser
VIEW IN TELEGRAM
🌈 Track Everything Everywhere 🌈

👉#Google unveils OmniMotion: full-length motion tracking for every pixel in every frame of video.

😎Review https://t.ly/Krvw
😎Paper arxiv.org/pdf/2306.05422.pdf
😎Project omnimotion.github.io/
😎Demo omnimotion.github.io/#interactive_demo
😎Code github.com/qianqianwang68/omnimotion
đŸ”Ĩ23❤5đŸ¤¯3🤩1💩1
This media is not supported in your browser
VIEW IN TELEGRAM
đŸ‘ī¸ Scene Five: Through Her Eyes đŸ‘ī¸

👉 #3D scene reconstruction of what a person is observing using only the reflections of their eyes

😎Review https://t.ly/uBO6
😎Paper arxiv.org/pdf/2306.09348.pdf
😎Project https://world-from-eyes.github.io/
đŸ¤¯28đŸ”Ĩ12💩2🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
đŸ§ŋ NeRF-Supervised Deep Stereo đŸ§ŋ

👉A novel pioneering pipeline for training deep stereo networks WITH NO ground-truth

😎Review https://t.ly/c7j-
😎Project nerfstereo.github.io/
😎Dataset https://amsacta.unibo.it/id/eprint/7218/
😎Code github.com/fabiotosi92/NeRF-Supervised-Deep-Stereo
😎Paper https://openaccess.thecvf.com/content/CVPR2023/papers/Tosi_NeRF-Supervised_Deep_Stereo_CVPR_2023_paper.pdf
đŸĨ°8🤩3❤1👍1💩1😍1
This media is not supported in your browser
VIEW IN TELEGRAM
đŸĢŖ Text-Guided Adversarial Makeup đŸĢŖ

👉Novel facial privacy protection via adversarial latent codes. Makeup vs Face Recognition.

😎Review https://t.ly/pBCP
😎Paper arxiv.org/pdf/2306.10008.pdf
😎Code github.com/fahadshamshad/Clip2Protect
❤6👍1đŸ”Ĩ1đŸĨ°1💩1
Media is too big
VIEW IN TELEGRAM
đŸĻˇ Few-Shot Geometry-Aware Keypoints đŸĻˇ

👉UBC (+Flawless AI) unveils the new SOTA in semantic keypoints localization. Suitable for faces, animals, cars, mouth, teeth & more

😎Review https://t.ly/-0qN
😎Paper arxiv.org/pdf/2303.17216.pdf
😎Project xingzhehe.github.io/FewShot3DKP/
đŸ¤¯10👍4❤2⚡2👏2🤩2đŸ”Ĩ1
This media is not supported in your browser
VIEW IN TELEGRAM
🚔 Fooling Neural Forensic Classifiers 🚔

👉Adversarial faces able to fool the forensic classifiers, while remaining undetectable by humans

😎Review https://t.ly/33Cc
😎Paper arxiv.org/pdf/2306.13091.pdf
😎Project koushiksrivats.github.io/face_attribute_attack
😎Code github.com/koushiksrivats/face_attribute_attack
đŸ˜ĸ6❤4👏2😱2🍾2👍1đŸ¤¯1😍1
panohead_overview-min.gif
24.3 MB
đŸĨ PanoHead: 3D Full-Head Synthesis đŸĨ

👉#ByteDance (+UW-M) unveils PanoHead: 360â—Ļ view-consistent portraits from a single-view image

😎Review https://t.ly/MrLNR
😎Paper arxiv.org/pdf/2303.13071.pdf
😎Project sizhean.github.io/panohead
😎Code github.com/sizhean/panohead
đŸ”Ĩ7❤4đŸ¤¯3😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🔮SAM-PT: Segment Anything+Tracking🔮

👉SAM-PT is the first method to utilize sparse point propagation for Video Object Segmentation (VOS).

😎Review https://t.ly/QLMG
😎Paper arxiv.org/pdf/2307.01197.pdf
😎Project www.vis.xyz/pub/sam-pt/
😎Code github.com/SysCV/sam-pt
đŸ”Ĩ14❤7đŸ¤¯3👍1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
đŸĒŠ DISCO: Human Dance Generation đŸĒŠ

👉NTU (+ #Microsoft) unveils DISCO: a big step towards the Human Dance Generation.

😎Review https://t.ly/cNGX
😎Paper arxiv.org/pdf/2307.00040.pdf
😎Project disco-dance.github.io/
😎Code github.com/Wangt-CN/DisCo
đŸ”Ĩ13đŸĨ°4😍2⚡1👍1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
đŸ›Ŗī¸ STAR.: 3D-tracking w/ attention paradigm đŸ›Ŗī¸

👉#Mercedes STAR: e2e 3D object tracking that follows the tracking-by-attention paradigm

😎Review https://t.ly/JoGj
😎Paper arxiv.org/pdf/2306.17602.pdf
😎Project simondoll.github.io/publications/star_track
👍14đŸ”Ĩ1đŸĨ°1👏1
This media is not supported in your browser
VIEW IN TELEGRAM
🍡 Text2Cinemagraphs: Cinemagraph from text 🍡

👉CMU (+ #Snap) unveils a fully automated method for creating cinemagraphs from text descriptions

😎Review https://t.ly/BwZs6
😎Paper arxiv.org/pdf/2307.03190.pdf
😎Project text2cinemagraph.github.io/website
😎Code github.com/text2cinemagraph/text2cinemagraph
❤12đŸ¤¯3😱1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
đŸ”ĨTest-Time Training on fire đŸ”Ĩ

👉Extending the TTT to the streaming setting. Suitable for Panoptic, Instance & Colorization.

😎Review https://t.ly/eZYA
😎Paper arxiv.org/pdf/2307.05014.pdf
😎Project https://video-ttt.github.io/
😎Code github.com/renwang435/video-ttt-release
đŸ”Ĩ10👍3⚡1đŸ¤¯1
This media is not supported in your browser
VIEW IN TELEGRAM
🃏 Deepfake via casual self-scan 🃏

👉TAU presents a novel approach to reenact an ID using only a casual self-scan

😎Review https://t.ly/9T8Wi
😎Paper arxiv.org/pdf/2307.06307.pdf
😎Project arielazary.github.io/PGR
đŸ¤¯7👍6❤5đŸ”Ĩ1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
đŸŽĒ Extreme Human Pose Estimation đŸŽĒ

👉RePoGen: novel synthetic data generator of extreme/realistic poses of humans

😎Review https://t.ly/ecBvM
😎Paper arxiv.org/pdf/2307.06737.pdf
😎Project mirapurkrabek.github.io/RePoGen-paper
😎Code github.com/MiraPurkrabek/RePoGen
đŸ”Ĩ12👍2👏1đŸ¤¯1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
💡 DATID-3D: Text-to-3D Generation 💡

👉 A novel domain adaptation method for 3D via text-to-image diffusion. 🤗-Demo available!

😎Review https://t.ly/TCL-B
😎Paper arxiv.org/pdf/2211.16374.pdf
😎Project gwang-kim.github.io/datid_3d/
😎Code github.com/gwang-kim/DATID-3D
🤗 huggingface.co/spaces/gwang-kim/DATID-3D
😎Colab colab.research.google.com/drive/1e9NSVB7x_hjz-nr4K0jO4rfTXILnNGtA?usp=sharing
đŸ¤¯5
This media is not supported in your browser
VIEW IN TELEGRAM
đŸ§¯Neural Focal Modulation VARđŸ§¯

👉A novel architecture for video recognition that models both local/global context

😎Review https://t.ly/rF_fk
😎Paper arxiv.org/pdf/2307.06947.pdf
😎Project talalwasim.github.io/Video-FocalNets
😎Code github.com/TalalWasim/Video-FocalNets
đŸ”Ĩ8⚡1👏1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🐈 Gen-AI as representation learner 🐈

👉DreamTeacher: novel self-supervised feats. representation learning framework that utilizes gen-nets for pre-training downstream image backbones

😎Review https://t.ly/RL8iG
😎Paper arxiv.org/pdf/2307.07487.pdf
😎Project research.nvidia.com/labs/toronto-ai/DreamTeacher
đŸ”Ĩ9👍2đŸ¤¯1
This media is not supported in your browser
VIEW IN TELEGRAM
☔ #SelfDriving? It's all about weather! ☔

👉Novel self-supervised MDE method to handle adverse weather in real-world autonomous driving

😎Review https://t.ly/tcLQW
😎Paper arxiv.org/pdf/2307.08357.pdf
😎Project kieran514.github.io/Robust-Depth-Project/
❤7👍3đŸ¤¯1😱1