AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
🍄 BTS: Density Fields from Single View 🍄

👉Volumetric scene representation from a single image in challenging conditions

😎Review https://bit.ly/3wjHDvH
😎Paper arxiv.org/pdf/2301.07668.pdf
😎Project fwmb.github.io/bts/
🔥7👍1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
StyleGAN-T: unlocking Power of GANs

👉#Nvidia unveils StyleGAN-T to regain competitiveness to GANs vs. Diffusive Models

😎Review https://bit.ly/3HtKxEA
😎Paper arxiv.org/pdf/2301.09515.pdf
😎Project sites.google.com/view/stylegan-t
😎Code github.com/autonomousvision/stylegan-t
🔥9👍4🤯41
This media is not supported in your browser
VIEW IN TELEGRAM
🪀 NeRF in Time, Space and Appearance 🪀

👉From Berkeley k-planes: a white-box model for radiance fields in arbitrary dimensions

😎Review https://bit.ly/3J8GiiS
😎Paper arxiv.org/pdf/2301.10241.pdf
😎Project sarafridov.github.io/K-Planes/
😎Code github.com/sarafridov/K-Planes
👍2🤯1🍾1
Media is too big
VIEW IN TELEGRAM
🔥 Neural Tracking via Weighted OF 🔥

👉The new SOTA in planar neural tracking is INSANE!

😎Review https://bit.ly/404gcDs
😎Paper arxiv.org/pdf/2301.10057.pdf
😎Code github.com/serycjon/WOFT
😎Project cmp.felk.cvut.cz/~serycjon/WOFT
🤯153👍3😱1
This media is not supported in your browser
VIEW IN TELEGRAM
Detecting Vulnerable Pedestrian

👉 BGSU opens a novel pedestrian dataset for vulnerable people

😎Review https://bit.ly/3JjVmu2
😎Paper arxiv.org/pdf/2212.06218.pdf
😎Data github.com/devvansh1997/BGVP
👍61🔥1
🧠 SERENA: LLM for Mental Health Support 🧠

👉Interactive #AI (in "#chatgpt" style) designed for mental health counseling

😎Review https://bit.ly/3wtbW37
😎Paper arxiv.org/pdf/2301.09412.pdf
😎Project https://serena.chat/
👍92
This media is not supported in your browser
VIEW IN TELEGRAM
🐕 MAV3D: #3D Video from Text 🐕

👉#META unveils a novel #AI for generating #3D dynamic videos from text

😎Review https://bit.ly/3XN0zin
😎Paper arxiv.org/pdf/2301.11280.pdf
😎Project make-a-video3d.github.io
🔥8👍3🤣31
This media is not supported in your browser
VIEW IN TELEGRAM
🔥CutLER: Unsupervised Segmentation 🔥

👉Novel paper by #META on detection & instance segmentation without human annotations

😎Review https://bit.ly/3DlFiUG
😎Paper arxiv.org/pdf/2301.11320.pdf
😎Code github.com/facebookresearch/CutLER
😎Project people.eecs.berkeley.edu/~xdwang/projects/CutLER
10👍4🔥4🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
😍 CLIP/GPT3-driven Affective Faces 😍

👉Columbia unveils a neural framework for facial expressions retrieval given the context of the speaker

😎Review https://bit.ly/3HERna0
😎Paper arxiv.org/pdf/2301.10939.pdf
😎Project realtalk.cs.columbia.edu
😎Code github.com/scottgeng00/realtalk
🔥125👍1🥰1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🐦 Physics-inspired Computer Vision 🐦

👉UCLA unveils PhyCV, the first Physics-inspired Computer Vision Library

😎Review https://bit.ly/3HEWozI
😎Code github.com/JalaliLabUCLA/phycv
😎Project photonics.ucla.edu/2022/05/12/jalali-lab-open-sources-phycv-a-physics-inspired-computer-vision-library/
🤯75👍4😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🎷Audio-Visual Semantic Segmentation🎷

👉A novel problem in #AI: pixel-level segmentation of objects that produce sound in the image frame

😎Review https://bit.ly/3wFY6dw
😎Paper arxiv.org/pdf/2301.13190.pdf
😎Project opennlplab.github.io/AVSBench
😎Code github.com/OpenNLPLab/AVSBench
🤯10👍3🔥21😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🚛 Text-driven Video Neural Editing 🚛

👉A novel text-guided video editing with both appearance/shape

😎Review https://bit.ly/3YcfMJO
😎Paper arxiv.org/pdf/2301.13173.pdf
😎Project text-video-edit.github.io/
🔥12👍1
This media is not supported in your browser
VIEW IN TELEGRAM
Mono-STAR: Unified Track/3D

👉Real-time 3D unified framework for semantic fusion, tracking, non-rigid deformation, and topological changes

😎Review https://bit.ly/3Dxvxmx
😎Paper arxiv.org/pdf/2301.13244.pdf
😎Project github.com/changhaonan/Mono-STAR-demo
5👍4🔥41
🛋️🛋️ 100% Accurated #3D Labeling 🛋️🛋️

👉#Amazon unveils a novel tool for fine-grained 3D part labeling. Up to 100% accuracy! Paper only😢

😎Review https://bit.ly/3kYpQHQ
😎Paper https://arxiv.org/pdf/2301.10460.pdf
🤯102👍1
This media is not supported in your browser
VIEW IN TELEGRAM
💧FLOW360: 360° Neural Optical Flow💧

👉 The first perceptually realistic 360° video benchmark dataset + SLOF method for OF tracking

😎Review https://bit.ly/3wMZZoX
😎Paper arxiv.org/pdf/2301.11880.pdf
😎Project https://siamlof.github.io
👍7🤯2🔥1
This media is not supported in your browser
VIEW IN TELEGRAM
🐓DREAMIX:General Diffusive Video Editor🐓

👉#Google unveils the first diffusion-based method able to perform text-based motion/appearance editing of general videos

😎Review https://bit.ly/3I3Hq6B
😎Paper arxiv.org/pdf/2302.01329.pdf
😎Project dreamix-video-editing.github.io/
🤯24😱3👍21
This media is not supported in your browser
VIEW IN TELEGRAM
🦚 MOSE: coMplex video Object SEgmentation 🦚

👉Novel Dataset for VOS is out! SOTA method on DAVIS is only 59.4% on MOSE

😎Review https://bit.ly/40yzSzW
😎Paper arxiv.org/pdf/2302.01872.pdf
😎Project henghuiding.github.io/MOSE/
😎Code github.com/henghuiding/MOSE-api
7👍2🔥2
This media is not supported in your browser
VIEW IN TELEGRAM
🌘 Gen-1: next-gen Generative #AI 🌘

👉#Runway unveils Gen-1: the next step forward for Generative AI. Registration available for beta -> hurry up!

😎Review https://bit.ly/3YqQYh8
😎Paper arxiv.org/pdf/2302.03011.pdf
😎Project https://research.runwayml.com/gen1
🤯10😱31👍1🔥1🤩1