AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
🧯Neural Focal Modulation VAR🧯

👉A novel architecture for video recognition that models both local/global context

😎Review https://t.ly/rF_fk
😎Paper arxiv.org/pdf/2307.06947.pdf
😎Project talalwasim.github.io/Video-FocalNets
😎Code github.com/TalalWasim/Video-FocalNets
🔥81👏1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🐈 Gen-AI as representation learner 🐈

👉DreamTeacher: novel self-supervised feats. representation learning framework that utilizes gen-nets for pre-training downstream image backbones

😎Review https://t.ly/RL8iG
😎Paper arxiv.org/pdf/2307.07487.pdf
😎Project research.nvidia.com/labs/toronto-ai/DreamTeacher
🔥9👍2🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
#SelfDriving? It's all about weather!

👉Novel self-supervised MDE method to handle adverse weather in real-world autonomous driving

😎Review https://t.ly/tcLQW
😎Paper arxiv.org/pdf/2307.08357.pdf
😎Project kieran514.github.io/Robust-Depth-Project/
7👍3🤯1😱1
🦙 Llama-2: the Open-Source "ChatGPT" 🦙

👉GenAI, #Meta unveils Llama-2: a collection of LLMs ranging in scale 7-70B params. Challenging with #chatgpt, but open.

😎Review https://t.ly/bLJgP
😎Paper https://t.ly/AOXru
😎Project https://ai.meta.com/llama
🤯192🔥1💩1
This media is not supported in your browser
VIEW IN TELEGRAM
🍉 AltFreezing: new SOTA in detecting deepfake 🍉

👉#Microsoft unveils AltFreezing: spatial/temporal artifacts in one model for more general face forgery detection

😎Review https://t.ly/mkIKX
😎Paper https://t.ly/z4KnJ
😎Code github.com/ZhendongWang6/AltFreezing
😱6👍5😍4🤯2🥰1
This media is not supported in your browser
VIEW IN TELEGRAM
🪟META's Ultra-HD Data for #AR🪟

👉Aria Digital Twin: egocentric dataset for detection/tracking, reconstruction/understanding, S2R learning, pose and more.

😎Review https://t.ly/MRPt1
😎Paper arxiv.org/pdf/2306.06362.pdf
😎Project www.projectaria.com/datasets/adt
😎Code github.com/facebookresearch/projectaria_tools
🔥10👍1
This media is not supported in your browser
VIEW IN TELEGRAM
👩‍🦰 Ultra-Realistic Neural Hair 👩‍🦰

👉A novel method to reconstruct the hair geometry at a strand level from monocular video or multi-view images

😎Review https://t.ly/6xZyp
😎Paper arxiv.org/pdf/2306.05872.pdf
😎Project samsunglabs.github.io/NeuralHaircut
😎Code github.com/SamsungLabs/NeuralHaircut
🤯17🤩5😍5👍21
This media is not supported in your browser
VIEW IN TELEGRAM
💪 Muscles in Action with #AI 💪

👉Muscles in Action (MIA): learn to incorporate muscle activity into human motion representations

😎Review https://t.ly/hUKub
😎Paper arxiv.org/pdf/2212.02978.pdf
😎Project musclesinaction.cs.columbia.edu
🔥7👍2👏2🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
🪤 PAPR: Proximity Attention Point Render 🪤

👉PAPR: fast point-based scene representation with differentiable renderer approach

😎Review https://t.ly/yoI0g
😎Paper arxiv.org/pdf/2307.11086.pdf
😎Project https://zvict.github.io/papr
👍2🥰2😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🪛 CAD-based Object Segmentation 🪛

👉 A novel three-stage approach to segment unseen objects in RGB images using their CAD models

😎Review https://t.ly/RtHLN
😎Paper arxiv.org/pdf/2307.11067.pdf
😎Code https://github.com/nv-nguyen/cnos
🔥7🤯41😱1🤩1😍1
This media is not supported in your browser
VIEW IN TELEGRAM
🛵 ALPR via CTS-Matching 🛵

👉UIT unveils a neural approach (#YOLO5 + tracking + rotation) to improve the license plate recognition accuracy

😎Review https://t.ly/VP4BP
😎Paper arxiv.org/pdf/2307.11336.pdf
😎Code github.com/chequanghuy/Character-Time-series-Matching
🔥92🤯1😱1🤣1
This media is not supported in your browser
VIEW IN TELEGRAM
🥬 Generative AI’s Next Frontiers 🥬

👉Hair simulation, 2D->3D animation, and much more. ~20 papers from #NVIDIA accepted into #SIGGRAPH2023

😎 Review https://t.ly/wgGin
🤯13👍3🤩3🥰1😱1💩1
This media is not supported in your browser
VIEW IN TELEGRAM
🦀 simPLE: learning to grasp only with CAD 🦀

👉simPLE learns to pick, regrasp & place objects precisely, given only the object CAD model and no prior experience

😎Review https://t.ly/ab5pA
😎Paper arxiv.org/pdf/2307.13133.pdf
😎Project mcube.mit.edu/research/simPLE.html
4🔥2👍1👏1🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐧 Track Anything in HQ 🐧

👉Video multi-object segmenter (VMOS) and a mask refiner (MR) to track anything

😎Review https://t.ly/hAvF2
😎Paper arxiv.org/pdf/2307.13974.pdf
😎Code github.com/jiawen-zhu/HQTrack
🔥5🤯2👍1🤩1
🥬Consensus-Adaptive RANSAC🥬

👉Novel RANSAC that learns to explore the parameter space via a novel attention layer

😎Review https://t.ly/eSLmD
😎Paper arxiv.org/pdf/2307.14030.pdf
😎Code github.com/cavalli1234/CA-RANSAC
🔥7🤯3😱1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🍡 DWPose: 2-stage Pose Distillation 🍡

👉 Tsinghua (+IDEA) unveils a novel two-stage pose Distillation for whole-body pose estimation.

😎Review https://t.ly/BSi20
😎Paper arxiv.org/pdf/2307.15880.pdf
😎Code github.com/IDEA-Research/DWPose
🤯72👍1🔥1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
👗 Multimodal Neural Designer 👗

👉 Multimodal #AI that can generate novel fashion images conditioned on text, keypoints, and sketches

😎Review https://t.ly/zVk70
😎Paper arxiv.org/pdf/2304.02051.pdf
😎Code github.com/aimagelab/multimodal-garment-designer
🥰64🤩3🔥21
This media is not supported in your browser
VIEW IN TELEGRAM
📸 Computational Burst Photography in App 📸

👉#Google unveils a novel computational burst system to democratize the professional photography via smartphone

😎Review https://t.ly/5ibJX
😎Paper arxiv.org/pdf/2308.01379.pdf
😎Project https://motion-mode.github.io
🔥6🥰3👍2🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🎠Neural Closed-Loop Simulator🎠

👉A neural sensor simulator that takes a single recorded log captured by a sensor-equipped vehicle and converts it into a realistic closed-loop multi-sensor simulation

😎Review https://t.ly/EcRLc
😎Paper arxiv.org/pdf/2308.01898.pdf
😎Project https://waabi.ai/unisim/
🤯8🤩32👍2🔥1👏1
🙏 A quick poll for helping me in improving the quality of the contents about #computervision.

Please give me a feedback here: https://t.ly/qXb4C

Thanks :)
17👍7🥰1