AI with Papers - Artificial Intelligence & Deep Learning

😍 CLIP/GPT3-driven Affective Faces 😍

👉Columbia unveils a neural framework for facial expressions retrieval given the context of the speaker

😎Review https://bit.ly/3HERna0
😎Paper arxiv.org/pdf/2301.10939.pdf
😎Project realtalk.cs.columbia.edu
😎Code github.com/scottgeng00/realtalk

🔥12❤5👍1🥰1🤩1

4.89K viewsedited 08:12

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🐦 Physics-inspired Computer Vision 🐦

👉UCLA unveils PhyCV, the first Physics-inspired Computer Vision Library

😎Review https://bit.ly/3HEWozI
😎Code github.com/JalaliLabUCLA/phycv
😎Project photonics.ucla.edu/2022/05/12/jalali-lab-open-sources-phycv-a-physics-inspired-computer-vision-library/

🤯7❤5👍4😱1

5.12K viewsedited 11:27

AI with Papers - Artificial Intelligence & Deep Learning

0:05

This media is not supported in your browser

VIEW IN TELEGRAM

🎷Audio-Visual Semantic Segmentation🎷

👉A novel problem in #AI: pixel-level segmentation of objects that produce sound in the image frame

😎Review https://bit.ly/3wFY6dw
😎Paper arxiv.org/pdf/2301.13190.pdf
😎Project opennlplab.github.io/AVSBench
😎Code github.com/OpenNLPLab/AVSBench

🤯10👍3🔥2❤1😱1

6.17K views08:48

AI with Papers - Artificial Intelligence & Deep Learning

0:05

This media is not supported in your browser

VIEW IN TELEGRAM

🚛 Text-driven Video Neural Editing 🚛

👉A novel text-guided video editing with both appearance/shape

😎Review https://bit.ly/3YcfMJO
😎Paper arxiv.org/pdf/2301.13173.pdf
😎Project text-video-edit.github.io/

🔥12👍1

4.39K views13:37

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

⭐ Mono-STAR: Unified Track/3D ⭐

👉Real-time 3D unified framework for semantic fusion, tracking, non-rigid deformation, and topological changes

😎Review https://bit.ly/3Dxvxmx
😎Paper arxiv.org/pdf/2301.13244.pdf
😎Project github.com/changhaonan/Mono-STAR-demo

⚡5👍4🔥4❤1

4.5K viewsedited 08:21

AI with Papers - Artificial Intelligence & Deep Learning

🛋️🛋️ 100% Accurated #3D Labeling 🛋️🛋️

👉#Amazon unveils a novel tool for fine-grained 3D part labeling. Up to 100% accuracy! Paper only😢

😎Review https://bit.ly/3kYpQHQ
😎Paper https://arxiv.org/pdf/2301.10460.pdf

🤯10❤2👍1

4.76K viewsedited 13:16

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

💧FLOW360: 360° Neural Optical Flow💧

👉 The first perceptually realistic 360° video benchmark dataset + SLOF method for OF tracking

😎Review https://bit.ly/3wMZZoX
😎Paper arxiv.org/pdf/2301.11880.pdf
😎Project https://siamlof.github.io

👍7🤯2🔥1

4.99K viewsedited 08:41

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🐓DREAMIX:General Diffusive Video Editor🐓

👉#Google unveils the first diffusion-based method able to perform text-based motion/appearance editing of general videos

😎Review https://bit.ly/3I3Hq6B
😎Paper arxiv.org/pdf/2302.01329.pdf
😎Project dreamix-video-editing.github.io/

🤯24😱3👍2❤1

5.2K viewsedited 07:34

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🧩 Text-Guided #3D Texturing 🧩

👉 Text-Guided HQ textures via iterative diffusion-based process

😎Review https://bit.ly/3ldC6Ez
😎Project texturepaper.github.io/TEXTurePaper
😎Code github.com/TEXTurePaper/TEXTurePaper
😎Paper texturepaper.github.io/TEXTurePaper/static/paper.pdf

🔥8🤯2👍1

4.5K views07:51

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🦚 MOSE: coMplex video Object SEgmentation 🦚

👉Novel Dataset for VOS is out! SOTA method on DAVIS is only 59.4% on MOSE

😎Review https://bit.ly/40yzSzW
😎Paper arxiv.org/pdf/2302.01872.pdf
😎Project henghuiding.github.io/MOSE/
😎Code github.com/henghuiding/MOSE-api

❤7👍2🔥2

4.82K viewsedited 13:09

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🌘 Gen-1: next-gen Generative #AI 🌘

👉#Runway unveils Gen-1: the next step forward for Generative AI. Registration available for beta -> hurry up!

😎Review https://bit.ly/3YqQYh8
😎Paper arxiv.org/pdf/2302.03011.pdf
😎Project https://research.runwayml.com/gen1

🤯10😱3❤1👍1🔥1🤩1

4.84K viewsedited 07:57

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🗿DirectMHP: Multi-Head Pose Estimation🗿

👉Novel E2E multi-person head pose estimation (MPHPE) under full-range angles

😎Review https://bit.ly/3HJubXg
😎Paper arxiv.org/pdf/2302.01110.pdf
😎Code github.com/hnuzhy/DirectMHP

🔥13👍1

4.77K views07:58

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🧱 LEGO-Net: Objects in Rooms 🧱

👉Transformer-based iterative method for rearrangement of objects in messy rooms

😎Review https://bit.ly/3HR0fs6
😎Paper arxiv.org/pdf/2301.09629.pdf
😎Project ivl.cs.brown.edu/#/projects/lego-net

🔥11🤯4

4.93K views14:09

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🎃 In-N-Out: 3D-aware OOD video editing 🎃

👉Novel 3D-aware video editing able to manipulate OOD objects (e.g. heavy makeup, accessories)

😎Review https://bit.ly/3jN0CMu
😎Paper arxiv.org/pdf/2302.04871.pdf
😎Project https://in-n-out-3d.github.io

🔥4❤2🤯2👍1

5K views07:41

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🥸 MEGANE: Generative Morphable Eyeglass 🥸

👉#META unveils the most advanced #3D compositional morphable AI for eyeglasses (HD geometry/photometric interaction)

😎Review https://bit.ly/3jOWifu
😎Paper arxiv.org/pdf/2302.04868.pdf
😎Project junxuan-li.github.io/megane

🔥9🤯3👍2🤩1

5.38K views12:58

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

💘 3D-aware Blending with NeRF 💘

👉Novel 3D-aware blending method via generative NeRFs

😎Review https://bit.ly/3lBEJA2
😎Paper arxiv.org/pdf/2302.06608.pdf
😎Project blandocs.github.io/blendnerf
😎Code github.com/naver-ai/BlendNeRF

❤8

4.53K viewsedited 12:53

AI with Papers - Artificial Intelligence & Deep Learning

0:03

This media is not supported in your browser

VIEW IN TELEGRAM

🌅 Semantics-guided natural synthesis 🌅

👉Alibaba #AI unveils a novel semantics-guided synthesis of natural scenes

😎Review https://bit.ly/4115MVJ
😎Paper arxiv.org/pdf/2302.07224.pdf
😎Project zju3dv.github.io/paintingnature

👍5🔥1🤯1

4.51K viewsedited 13:52

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🦞 SOTA ALERT: YOWOv2 is out! 🦞

👉 The 2nd-gen of YOWO, real-time detection of spatio-temporal actions

😎Review https://bit.ly/3IscY60
😎Paper arxiv.org/pdf/2302.06848v1.pdf
😎Code github.com/yjh0410/YOWOv2

🔥17👍2

4.51K viewsedited 07:54

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

📬 DIVOTrack: crossview MOT dataset 📬

👉 DIVOTrack + CrossMOT: the ultimate solution for MOT in realistic scenario

😎Review https://bit.ly/3YSFZgL
😎Paper arxiv.org/pdf/2302.07676.pdf
😎Code github.com/shengyuhao/DIVOTrack

🔥6👍2🤯1

4.53K views13:13

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🦩 One-Shot Face via LSs of StyleGAN2 🦩

👉 Novel video generation framework with edits, facial motions, deformations & identity

😎Review https://bit.ly/3xuChhF
😎Paper arxiv.org/pdf/2302.07848.pdf
😎Project trevineoorloff.github.io/FaceVideoReenactment_HybridLatents.io/

🤯3😱2⚡1👍1

4.59K views07:48

About

Blog

Apps

Platform