AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
πŸ‡ SplineCam: Neural Decision Boundary πŸ‡

πŸ‘‰#META -> SplineCam: a step towards neural visualization / interpretability

😎Review https://bit.ly/3mgoOaH
😎Paper arxiv.org/pdf/2302.12828.pdf
😎Project imtiazhumayun.github.io/splinecam
😎Code github.com/AhmedImtiazPrio/SplineCAM
🀯8πŸ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘‘ ControNet: Conditional Control of Diffusion πŸ‘‘

πŸ‘‰Controlling Stable Diffusion via conditional inputs like edges, segmentation, keypoints, etc. Extra: a super-nice tutorial.

😎Review https://bit.ly/3YgjrWt
😎Paper arxiv.org/pdf/2302.05543.pdf
😎Code github.com/lllyasviel/ControlNet
😎Tutorial https://github.com/Mikubill/sd-webui-controlnet/discussions/204
🀯15πŸ‘8πŸ”₯3❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ›Έ TAU: video traffic analytics via UAVs πŸ›Έ

πŸ‘‰ Prince Sultan University unveils TAU: AI-integrated video analytics framework from UAVs' POV

😎Review https://bit.ly/3EQIh8F
😎Paper arxiv.org/pdf/2303.00337.pdf
😎Project github.com/bilel-bj/TAU
πŸ”₯10πŸ‘3πŸ₯°1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🩻 Independent Tokens for 3D Human 🩻

πŸ‘‰Tencent open-sourcing a novel method to estimate #3D human pose and shape from monocular videos

😎Review https://bit.ly/3Zz0uiH
😎Paper arxiv.org/pdf/2303.00298.pdf
😎Code github.com/yangsenius/INT_HMR_Model
😎Project yangsenius.github.io/INT_HMR_Model/index.html
πŸ”₯5πŸ‘1😒1
This media is not supported in your browser
VIEW IN TELEGRAM
🌸 3DGP: ImageNet in #3D 🌸

πŸ‘‰ Snap unveils 3DGP: a novel 3D generator with Generic Priors

😎Review https://bit.ly/3KWHUgG
😎Paper arxiv.org/pdf/2303.01416.pdf
😎Project snap-research.github.io/3dgp/
😎Code github.com/snap-research/3dgp
πŸ”₯8⚑1πŸ‘1
Media is too big
VIEW IN TELEGRAM
πŸ—ΊοΈ S-NeRF: NeRF for Street Views πŸ—ΊοΈ

πŸ‘‰S-NeRF: novel view synthesis of streets & foreground moving vehicles jointly

😎Review https://bit.ly/3KZUN9w
😎Paper arxiv.org/pdf/2303.00749.pdf
😎Project ziyang-xie.github.io/s-nerf/
😎Code (soon)
πŸ‘9πŸ”₯3🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🧱 MobileBrick: #3D object on mobile 🧱

πŸ‘‰#Apple (+Oxford) exploiting #LEGO bricks to open the most precise #3D dataset ever. Suitable for mobile #AR

😎Review https://bit.ly/3ZqbiAh
😎Paper arxiv.org/pdf/2303.01932.pdf
😎Project code.active.vision/MobileBrick/
😎Code github.com/ActiveVisionLab/MobileBrick
πŸ”₯6πŸ‘2😱1
This media is not supported in your browser
VIEW IN TELEGRAM
⚠️ BREAKING: Stability Acquires Init ML ⚠️

πŸ‘‰ Stability AI (#stablediffusion) announces the acquisition of Clipdrop makers 🀯

πŸ‘‰ More: https://bit.ly/3JhKkVO
🀯7❀2πŸ‘2πŸ₯°1πŸ‘1😱1🍾1
🐠🐑 DeepSeeColor: Under the Sea in Colors 🐑🐠

πŸ‘‰ DeepSeeColor: real-time adaptive color correction under the sea

😎Review https://bit.ly/3mxGuyP
😎Paper arxiv.org/pdf/2303.04025.pdf
😎Code warp.whoi.edu/deepseecolor
πŸ₯°16πŸ‘4πŸ‘2
This media is not supported in your browser
VIEW IN TELEGRAM
🧩 X-Avatar: Expressive Human Avatars 🧩

πŸ‘‰ETHZ (+Microsoft) unveils X-Avatar: digital expressive humans for life-like experiences (#AR / #VR)

😎Review https://bit.ly/3J6YDLH
😎Paper arxiv.org/pdf/2303.04805.pdf
😎Project https://skype-line.github.io/projects/X-Avatar/
😎Code https://github.com/Skype-line/X-Avatar
πŸ”₯4🀯3πŸ‘1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸš€ Visual ChatGPT: Talking, Drawing & Editing πŸš€

πŸ‘‰ Microsoft extends ChatGPT to handle visual information

😎Review https://bit.ly/3LkSJJD
😎Paper arxiv.org/pdf/2303.04671.pdf
😎Code github.com/microsoft/visual-chatgpt
🀯13πŸ”₯8❀7πŸ‘3😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🐒 GigaGAN: ultra-GAN Synthesis 🐒

πŸ‘‰ Novel method that far exceeds the previous limits of GAN -> ultra HD images!

😎Review https://bit.ly/3ZYnZBX
😎Paper arxiv.org/pdf/2303.05511.pdf
😎Project mingukkang.github.io/GigaGAN
πŸ”₯30❀6🀩1
πŸŽ€ The baby is born: GPT-4 is out! πŸŽ€

πŸ‘‰GPT-4 is the new LLM (accepting image and text inputs, emitting text outputs) with human-level performance on various professional and academic benchmarks

😎More: https://bit.ly/3LntuWL
πŸ”₯40🀯6😱5πŸ’©5❀4πŸ‘1πŸ‘1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🐍ViperGPT: Visual Inference / Reasoning🐍

πŸ‘‰ViperGPT: vision/language -> code to produce results for any query

😎Review https://bit.ly/3n6yw01
😎Paper arxiv.org/pdf/2303.08128.pdf
😎Project viper.cs.columbia.edu/
😎Code github.com/cvlab-columbia/viper
🀯13πŸ”₯6❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ’€ SDFStudio: Unified Surface Reconstruction πŸ’€

πŸ‘‰A unified (and open) framework for neural implicit surface reconstruction

😎Review https://bit.ly/3yUvuP2
😎Project autonomousvision.github.io/sdfstudio
😎Code github.com/autonomousvision/sdfstudio
😎Doc github.com/autonomousvision/sdfstudio/blob/master/docs/sdfstudio-methods.md
🀯5πŸ‘2πŸ”₯2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ“˜ e-book: Understanding Deep Learning πŸ“˜

πŸ‘‰From MIT Press, a free & fresh book for mastering the Deep Learning & #AI

😎Download https://bit.ly/3JUHMNv
❀14πŸ‘4
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ›Œ CC3D: Layout-Conditioned 3D πŸ›Œ

πŸ‘‰A novel step towards the realistic multi-object scene generation

😎Review https://bit.ly/3JXG0v3
😎Paper arxiv.org/pdf/2303.12074.pdf
😎Project sherwinbahmani.github.io/cc3d
😎Code github.com/sherwinbahmani/cc3d
πŸ”₯5❀2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🌈 #3D editing via text instructions 🌈

πŸ‘‰A novel method for editing NeRF scenes with text-instructions

😎Review https://bit.ly/3K3putM
😎Paper arxiv.org/pdf/2303.12789.pdf
😎Project instruct-nerf2nerf.github.io
πŸ”₯11🀯3❀1πŸ‘1