AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
236 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
🥬 Generative AI’s Next Frontiers 🥬

👉Hair simulation, 2D->3D animation, and much more. ~20 papers from #NVIDIA accepted into #SIGGRAPH2023

😎 Review https://t.ly/wgGin
🤯13👍3🤩3🥰1😱1💩1
This media is not supported in your browser
VIEW IN TELEGRAM
🦀 simPLE: learning to grasp only with CAD 🦀

👉simPLE learns to pick, regrasp & place objects precisely, given only the object CAD model and no prior experience

😎Review https://t.ly/ab5pA
😎Paper arxiv.org/pdf/2307.13133.pdf
😎Project mcube.mit.edu/research/simPLE.html
4🔥2👍1👏1🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐧 Track Anything in HQ 🐧

👉Video multi-object segmenter (VMOS) and a mask refiner (MR) to track anything

😎Review https://t.ly/hAvF2
😎Paper arxiv.org/pdf/2307.13974.pdf
😎Code github.com/jiawen-zhu/HQTrack
🔥5🤯2👍1🤩1
🥬Consensus-Adaptive RANSAC🥬

👉Novel RANSAC that learns to explore the parameter space via a novel attention layer

😎Review https://t.ly/eSLmD
😎Paper arxiv.org/pdf/2307.14030.pdf
😎Code github.com/cavalli1234/CA-RANSAC
🔥7🤯3😱1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🍡 DWPose: 2-stage Pose Distillation 🍡

👉 Tsinghua (+IDEA) unveils a novel two-stage pose Distillation for whole-body pose estimation.

😎Review https://t.ly/BSi20
😎Paper arxiv.org/pdf/2307.15880.pdf
😎Code github.com/IDEA-Research/DWPose
🤯72👍1🔥1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
👗 Multimodal Neural Designer 👗

👉 Multimodal #AI that can generate novel fashion images conditioned on text, keypoints, and sketches

😎Review https://t.ly/zVk70
😎Paper arxiv.org/pdf/2304.02051.pdf
😎Code github.com/aimagelab/multimodal-garment-designer
🥰64🤩3🔥21
This media is not supported in your browser
VIEW IN TELEGRAM
📸 Computational Burst Photography in App 📸

👉#Google unveils a novel computational burst system to democratize the professional photography via smartphone

😎Review https://t.ly/5ibJX
😎Paper arxiv.org/pdf/2308.01379.pdf
😎Project https://motion-mode.github.io
🔥6🥰3👍2🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🎠Neural Closed-Loop Simulator🎠

👉A neural sensor simulator that takes a single recorded log captured by a sensor-equipped vehicle and converts it into a realistic closed-loop multi-sensor simulation

😎Review https://t.ly/EcRLc
😎Paper arxiv.org/pdf/2308.01898.pdf
😎Project https://waabi.ai/unisim/
🤯8🤩32👍2🔥1👏1
🙏 A quick poll for helping me in improving the quality of the contents about #computervision.

Please give me a feedback here: https://t.ly/qXb4C

Thanks :)
17👍7🥰1
AI with Papers - Artificial Intelligence & Deep Learning pinned «🙏 A quick poll for helping me in improving the quality of the contents about #computervision. Please give me a feedback here: https://t.ly/qXb4C Thanks :)»
This media is not supported in your browser
VIEW IN TELEGRAM
🪛 HANDAL: Real-World Manipulable Objects 🪛

👉 #Nvidia unveils HANDAL dataset: category-level object pose and affordance prediction

😎Review https://t.ly/MXZDI
😎Paper arxiv.org/pdf/2308.01477.pdf
😎Dataset wenbowen123.github.io/handaldataset
👍8🔥31🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🎨 Interactive Neural Painting 🎨

👉 Novel AI-powered tool to help artists in completing their artworks

😎Review https://t.ly/ELUb0
😎Paper arxiv.org/pdf/2307.16441.pdf
😎Project helia95.github.io/inp-website
😎Supp helia95.github.io/inp-website/supp_mat.html
🤩4🤯21👍1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
👩‍🚀 HD Avatar via Text & Pose 👩‍🚀

👉 Generating expressive #3D avatars from nothing but text descriptions & pose guidance

😎Review https://t.ly/wrSMH
😎Paper arxiv.org/pdf/2308.03610.pdf
😎Project avatarverse3d.github.io
7🥰4👍1🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐘 Controllable Synthetic Data (extending Image-Net) 🐘

👉#META's PUG, a new generation of interactive environments for representation learning. Extending Image-Net!

😎Review https://t.ly/nCYs0
😎Paper arxiv.org/pdf/2308.03977.pdf
😎Project pug.metademolab.com
😎Code github.com/facebookresearch/PUG
🔥42👍1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🌈 Tracking by Persistent Dynamic View Synthesis 🌈

👉Novel simultaneous addressing of dynamic scene novel-view synthesis + 6-DOF tracking of all dense scene elements

😎Review https://t.ly/Bc535
😎Paper arxiv.org/pdf/2308.09713.pdf
😎Project dynamic3dgaussians.github.io
😎Code github.com/JonathonLuiten/Dynamic3DGaussians
🤯10🔥3😱1
🛒 Digital Twins for AutoRetail Checkout 🛒

👉From #Nvidia a novel approach for using 3D assets for training 2D detection and tracking model in AutoRetail Checkout

😎Review https://t.ly/Ea7kt
😎Paper arxiv.org/pdf/2308.09708.pdf
😎Code github.com/yorkeyao/Automated-Retail-Checkout
🔥2🥰2😱2
This media is not supported in your browser
VIEW IN TELEGRAM
🥎SportsMOT + MixSort = Sport MOT🥎

👉Nanjing just released a MOT dataset for sports scenes + the SOTA code/model for tracking (MixSort)

😎Review https://t.ly/NHUxL
😎Paper arxiv.org/pdf/2304.05170.pdf
😎Code github.com/MCG-NJU/MixSort
😎Project deeperaction.github.io/datasets/sportsmot.html
🔥12👍2🤯21🤩1
⚡️Feature Matching at Light Speed⚡️

👉LightGlue is a lightweight feature matcher with high accuracy and blazing fast inference

😎Review https://t.ly/jkecX
😎Paper arxiv.org/pdf/2306.13643.pdf
😎Code github.com/cvg/LightGlue
23🔥6😱4👍32🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🕹️ CoDeF: Video Content Deformation Fields 🕹️

👉CoDeF is a new type of video representation for video-editing tasks

😎Review https://t.ly/PIVl-
😎Paper arxiv.org/pdf/2308.07926.pdf
😎Project https://qiuyu96.github.io/CoDeF
😎Code https://github.com/qiuyu96/CoDeF
18🔥4👍2🥰1🤯1😱1