AI with Papers - Artificial Intelligence & Deep Learning
14.8K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
😍 CLIP/GPT3-driven Affective Faces 😍

👉Columbia unveils a neural framework for facial expressions retrieval given the context of the speaker

😎Review https://bit.ly/3HERna0
😎Paper arxiv.org/pdf/2301.10939.pdf
😎Project realtalk.cs.columbia.edu
😎Code github.com/scottgeng00/realtalk
This media is not supported in your browser
VIEW IN TELEGRAM
🎷Audio-Visual Semantic Segmentation🎷

👉A novel problem in #AI: pixel-level segmentation of objects that produce sound in the image frame

😎Review https://bit.ly/3wFY6dw
😎Paper arxiv.org/pdf/2301.13190.pdf
😎Project opennlplab.github.io/AVSBench
😎Code github.com/OpenNLPLab/AVSBench
This media is not supported in your browser
VIEW IN TELEGRAM
🚛 Text-driven Video Neural Editing 🚛

👉A novel text-guided video editing with both appearance/shape

😎Review https://bit.ly/3YcfMJO
😎Paper arxiv.org/pdf/2301.13173.pdf
😎Project text-video-edit.github.io/
This media is not supported in your browser
VIEW IN TELEGRAM
Mono-STAR: Unified Track/3D

👉Real-time 3D unified framework for semantic fusion, tracking, non-rigid deformation, and topological changes

😎Review https://bit.ly/3Dxvxmx
😎Paper arxiv.org/pdf/2301.13244.pdf
😎Project github.com/changhaonan/Mono-STAR-demo
🛋️🛋️ 100% Accurated #3D Labeling 🛋️🛋️

👉#Amazon unveils a novel tool for fine-grained 3D part labeling. Up to 100% accuracy! Paper only😢

😎Review https://bit.ly/3kYpQHQ
😎Paper https://arxiv.org/pdf/2301.10460.pdf
This media is not supported in your browser
VIEW IN TELEGRAM
💧FLOW360: 360° Neural Optical Flow💧

👉 The first perceptually realistic 360° video benchmark dataset + SLOF method for OF tracking

😎Review https://bit.ly/3wMZZoX
😎Paper arxiv.org/pdf/2301.11880.pdf
😎Project https://siamlof.github.io
This media is not supported in your browser
VIEW IN TELEGRAM
🐓DREAMIX:General Diffusive Video Editor🐓

👉#Google unveils the first diffusion-based method able to perform text-based motion/appearance editing of general videos

😎Review https://bit.ly/3I3Hq6B
😎Paper arxiv.org/pdf/2302.01329.pdf
😎Project dreamix-video-editing.github.io/
This media is not supported in your browser
VIEW IN TELEGRAM
🦚 MOSE: coMplex video Object SEgmentation 🦚

👉Novel Dataset for VOS is out! SOTA method on DAVIS is only 59.4% on MOSE

😎Review https://bit.ly/40yzSzW
😎Paper arxiv.org/pdf/2302.01872.pdf
😎Project henghuiding.github.io/MOSE/
😎Code github.com/henghuiding/MOSE-api
This media is not supported in your browser
VIEW IN TELEGRAM
🌘 Gen-1: next-gen Generative #AI 🌘

👉#Runway unveils Gen-1: the next step forward for Generative AI. Registration available for beta -> hurry up!

😎Review https://bit.ly/3YqQYh8
😎Paper arxiv.org/pdf/2302.03011.pdf
😎Project https://research.runwayml.com/gen1
This media is not supported in your browser
VIEW IN TELEGRAM
🗿DirectMHP: Multi-Head Pose Estimation🗿

👉Novel E2E multi-person head pose estimation (MPHPE) under full-range angles

😎Review https://bit.ly/3HJubXg
😎Paper arxiv.org/pdf/2302.01110.pdf
😎Code github.com/hnuzhy/DirectMHP
This media is not supported in your browser
VIEW IN TELEGRAM
🧱 LEGO-Net: Objects in Rooms 🧱

👉Transformer-based iterative method for rearrangement of objects in messy rooms

😎Review https://bit.ly/3HR0fs6
😎Paper arxiv.org/pdf/2301.09629.pdf
😎Project ivl.cs.brown.edu/#/projects/lego-net
This media is not supported in your browser
VIEW IN TELEGRAM
🎃 In-N-Out: 3D-aware OOD video editing 🎃

👉Novel 3D-aware video editing able to manipulate OOD objects (e.g. heavy makeup, accessories)

😎Review https://bit.ly/3jN0CMu
😎Paper arxiv.org/pdf/2302.04871.pdf
😎Project https://in-n-out-3d.github.io
This media is not supported in your browser
VIEW IN TELEGRAM
🥸 MEGANE: Generative Morphable Eyeglass 🥸

👉#META unveils the most advanced #3D compositional morphable AI for eyeglasses (HD geometry/photometric interaction)

😎Review https://bit.ly/3jOWifu
😎Paper arxiv.org/pdf/2302.04868.pdf
😎Project junxuan-li.github.io/megane
This media is not supported in your browser
VIEW IN TELEGRAM
💘 3D-aware Blending with NeRF 💘

👉Novel 3D-aware blending method via generative NeRFs

😎Review https://bit.ly/3lBEJA2
😎Paper arxiv.org/pdf/2302.06608.pdf
😎Project blandocs.github.io/blendnerf
😎Code github.com/naver-ai/BlendNeRF
This media is not supported in your browser
VIEW IN TELEGRAM
🌅 Semantics-guided natural synthesis 🌅

👉Alibaba #AI unveils a novel semantics-guided synthesis of natural scenes

😎Review https://bit.ly/4115MVJ
😎Paper arxiv.org/pdf/2302.07224.pdf
😎Project zju3dv.github.io/paintingnature
This media is not supported in your browser
VIEW IN TELEGRAM
🦞 SOTA ALERT: YOWOv2 is out! 🦞

👉 The 2nd-gen of YOWO, real-time detection of spatio-temporal actions

😎Review https://bit.ly/3IscY60
😎Paper arxiv.org/pdf/2302.06848v1.pdf
😎Code github.com/yjh0410/YOWOv2
This media is not supported in your browser
VIEW IN TELEGRAM
📬 DIVOTrack: crossview MOT dataset 📬

👉 DIVOTrack + CrossMOT: the ultimate solution for MOT in realistic scenario

😎Review https://bit.ly/3YSFZgL
😎Paper arxiv.org/pdf/2302.07676.pdf
😎Code github.com/shengyuhao/DIVOTrack
This media is not supported in your browser
VIEW IN TELEGRAM
🦩 One-Shot Face via LSs of StyleGAN2 🦩

👉 Novel video generation framework with edits, facial motions, deformations & identity

😎Review https://bit.ly/3xuChhF
😎Paper arxiv.org/pdf/2302.07848.pdf
😎Project trevineoorloff.github.io/FaceVideoReenactment_HybridLatents.io/