AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
🍡 Text2Cinemagraphs: Cinemagraph from text 🍡

👉CMU (+ #Snap) unveils a fully automated method for creating cinemagraphs from text descriptions

😎Review https://t.ly/BwZs6
😎Paper arxiv.org/pdf/2307.03190.pdf
😎Project text2cinemagraph.github.io/website
😎Code github.com/text2cinemagraph/text2cinemagraph
12🤯3😱1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🔥Test-Time Training on fire 🔥

👉Extending the TTT to the streaming setting. Suitable for Panoptic, Instance & Colorization.

😎Review https://t.ly/eZYA
😎Paper arxiv.org/pdf/2307.05014.pdf
😎Project https://video-ttt.github.io/
😎Code github.com/renwang435/video-ttt-release
🔥10👍31🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
🃏 Deepfake via casual self-scan 🃏

👉TAU presents a novel approach to reenact an ID using only a casual self-scan

😎Review https://t.ly/9T8Wi
😎Paper arxiv.org/pdf/2307.06307.pdf
😎Project arielazary.github.io/PGR
🤯7👍65🔥1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🎪 Extreme Human Pose Estimation 🎪

👉RePoGen: novel synthetic data generator of extreme/realistic poses of humans

😎Review https://t.ly/ecBvM
😎Paper arxiv.org/pdf/2307.06737.pdf
😎Project mirapurkrabek.github.io/RePoGen-paper
😎Code github.com/MiraPurkrabek/RePoGen
🔥12👍2👏1🤯1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🧯Neural Focal Modulation VAR🧯

👉A novel architecture for video recognition that models both local/global context

😎Review https://t.ly/rF_fk
😎Paper arxiv.org/pdf/2307.06947.pdf
😎Project talalwasim.github.io/Video-FocalNets
😎Code github.com/TalalWasim/Video-FocalNets
🔥81👏1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🐈 Gen-AI as representation learner 🐈

👉DreamTeacher: novel self-supervised feats. representation learning framework that utilizes gen-nets for pre-training downstream image backbones

😎Review https://t.ly/RL8iG
😎Paper arxiv.org/pdf/2307.07487.pdf
😎Project research.nvidia.com/labs/toronto-ai/DreamTeacher
🔥9👍2🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
#SelfDriving? It's all about weather!

👉Novel self-supervised MDE method to handle adverse weather in real-world autonomous driving

😎Review https://t.ly/tcLQW
😎Paper arxiv.org/pdf/2307.08357.pdf
😎Project kieran514.github.io/Robust-Depth-Project/
7👍3🤯1😱1
🦙 Llama-2: the Open-Source "ChatGPT" 🦙

👉GenAI, #Meta unveils Llama-2: a collection of LLMs ranging in scale 7-70B params. Challenging with #chatgpt, but open.

😎Review https://t.ly/bLJgP
😎Paper https://t.ly/AOXru
😎Project https://ai.meta.com/llama
🤯192🔥1💩1
This media is not supported in your browser
VIEW IN TELEGRAM
🍉 AltFreezing: new SOTA in detecting deepfake 🍉

👉#Microsoft unveils AltFreezing: spatial/temporal artifacts in one model for more general face forgery detection

😎Review https://t.ly/mkIKX
😎Paper https://t.ly/z4KnJ
😎Code github.com/ZhendongWang6/AltFreezing
😱6👍5😍4🤯2🥰1
This media is not supported in your browser
VIEW IN TELEGRAM
🪟META's Ultra-HD Data for #AR🪟

👉Aria Digital Twin: egocentric dataset for detection/tracking, reconstruction/understanding, S2R learning, pose and more.

😎Review https://t.ly/MRPt1
😎Paper arxiv.org/pdf/2306.06362.pdf
😎Project www.projectaria.com/datasets/adt
😎Code github.com/facebookresearch/projectaria_tools
🔥10👍1
This media is not supported in your browser
VIEW IN TELEGRAM
👩‍🦰 Ultra-Realistic Neural Hair 👩‍🦰

👉A novel method to reconstruct the hair geometry at a strand level from monocular video or multi-view images

😎Review https://t.ly/6xZyp
😎Paper arxiv.org/pdf/2306.05872.pdf
😎Project samsunglabs.github.io/NeuralHaircut
😎Code github.com/SamsungLabs/NeuralHaircut
🤯17🤩5😍5👍21
This media is not supported in your browser
VIEW IN TELEGRAM
💪 Muscles in Action with #AI 💪

👉Muscles in Action (MIA): learn to incorporate muscle activity into human motion representations

😎Review https://t.ly/hUKub
😎Paper arxiv.org/pdf/2212.02978.pdf
😎Project musclesinaction.cs.columbia.edu
🔥7👍2👏2🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
🪤 PAPR: Proximity Attention Point Render 🪤

👉PAPR: fast point-based scene representation with differentiable renderer approach

😎Review https://t.ly/yoI0g
😎Paper arxiv.org/pdf/2307.11086.pdf
😎Project https://zvict.github.io/papr
👍2🥰2😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🪛 CAD-based Object Segmentation 🪛

👉 A novel three-stage approach to segment unseen objects in RGB images using their CAD models

😎Review https://t.ly/RtHLN
😎Paper arxiv.org/pdf/2307.11067.pdf
😎Code https://github.com/nv-nguyen/cnos
🔥7🤯41😱1🤩1😍1
This media is not supported in your browser
VIEW IN TELEGRAM
🛵 ALPR via CTS-Matching 🛵

👉UIT unveils a neural approach (#YOLO5 + tracking + rotation) to improve the license plate recognition accuracy

😎Review https://t.ly/VP4BP
😎Paper arxiv.org/pdf/2307.11336.pdf
😎Code github.com/chequanghuy/Character-Time-series-Matching
🔥92🤯1😱1🤣1
This media is not supported in your browser
VIEW IN TELEGRAM
🥬 Generative AI’s Next Frontiers 🥬

👉Hair simulation, 2D->3D animation, and much more. ~20 papers from #NVIDIA accepted into #SIGGRAPH2023

😎 Review https://t.ly/wgGin
🤯13👍3🤩3🥰1😱1💩1
This media is not supported in your browser
VIEW IN TELEGRAM
🦀 simPLE: learning to grasp only with CAD 🦀

👉simPLE learns to pick, regrasp & place objects precisely, given only the object CAD model and no prior experience

😎Review https://t.ly/ab5pA
😎Paper arxiv.org/pdf/2307.13133.pdf
😎Project mcube.mit.edu/research/simPLE.html
4🔥2👍1👏1🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐧 Track Anything in HQ 🐧

👉Video multi-object segmenter (VMOS) and a mask refiner (MR) to track anything

😎Review https://t.ly/hAvF2
😎Paper arxiv.org/pdf/2307.13974.pdf
😎Code github.com/jiawen-zhu/HQTrack
🔥5🤯2👍1🤩1
🥬Consensus-Adaptive RANSAC🥬

👉Novel RANSAC that learns to explore the parameter space via a novel attention layer

😎Review https://t.ly/eSLmD
😎Paper arxiv.org/pdf/2307.14030.pdf
😎Code github.com/cavalli1234/CA-RANSAC
🔥7🤯3😱1🍾1