AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‰#AI finds where IG photos are takenπŸ‰

πŸ‘‰Brilliant work of Depoorter, Belgium artist that handles #privacy, #AI & #socialmedia

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Recorded open cameras for weeks
βœ…Scraped all #Instagram photos
βœ…Matching Instagram vs. footage

More: https://bit.ly/3eL5dfc
😱18πŸ‘13πŸ₯°2
This media is not supported in your browser
VIEW IN TELEGRAM
🈯SAMURAI: in-the-wild Shape/Material🈯

πŸ‘‰#Google SAMURAI: shape, BRDF, per-image pose & illumination. Relightable #3D assets for #AR/#VR.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Parametrization for varying distances
βœ…Camera multiplex optimization
βœ…Posterior scaling of input images
βœ…Explicit meshes extraction with BRDF
βœ…Code/data soon available ->#NeurIPS

More: https://bit.ly/3BKWgf3
πŸ‘8πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
🟨 Lang<->Pics in 100+ Languages 🟨

πŸ‘‰#Google PaLI: unified lang-image #AI to perform tasks in 109 languages 🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…PaLI: Pathways Lang & Image model
βœ…Answering, captioning, reasoning, etc
βœ…From Eng. to 109 lang. understanding
βœ…The new SOTA on several datasets

More: https://bit.ly/3QMslHC
πŸ”₯6πŸ‘1πŸ’―1
This media is not supported in your browser
VIEW IN TELEGRAM
🍐PeRFception: Largest IR Dataset🍐

πŸ‘‰#Nvidia, a new frontier in data collection via Plenoxels: same info, -96.4% in size.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…POSTECH + NVIDIA + Caltech = 🀯
βœ…Size: -96.4% from original dataset!
βœ…2D/3D image/object class/semantic
βœ…Ready-to-use pipeline for implicit dataset

More: https://bit.ly/3eW9hJA
❀9❀‍πŸ”₯1πŸ‘1😍1
This media is not supported in your browser
VIEW IN TELEGRAM
🐸 CHARL-E: Stable Diffusion in 1 click 🐸

πŸ‘‰CHARL-E packages Stable Diffusion into a simple app.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…No setup, dependencies, or internet
βœ…Images with 1-click on #macbook
βœ…Suitable only for M1/M2 processor
βœ…Source code under MIT license

More: https://bit.ly/3xv2z3G
πŸ”₯11πŸ‘3❀‍πŸ”₯1❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‹YOLOPv2: Better Driving PerceptionπŸ‹

πŸ‘‰YOLOPv2: simultaneous object, road segmentation & lane detection

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…E2E perception net with better backbone
βœ…Efficient ELAN for reasonable memory
βœ…Stability for adapting to scenarios
βœ…SOTA on BDD100K, +50% faster!
βœ…Source code under MIT license

More: https://bit.ly/3LvYGBh
πŸ”₯12
🍈SegNeXt: new SOTA in Semantic Seg.🍈

πŸ‘‰SOTA (by large margin) on ADE20K, Cityscapes, COCO-Stuff, Pascal VOC, Pascal Context, and iSAID 🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Novel tailored network architecture
βœ…Spatial attention via multi-scale feats
βœ…Encoder + conv. better than transformers
βœ…SOTA on several datasets (ADE20K, etc.)

More: https://bit.ly/3UrZhrH
πŸ”₯9πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ¦ͺStereoVoxelNet: RT Obstacles DetectionπŸ¦ͺ

πŸ‘‰Novel deep neural approach to detect occupancy from stereo images directly

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Occupancy voxels via deep learning
βœ…RT on Jetson-TX2 (-98% CPU of SOTA)
βœ…Optimization via octrees / sparse conv.
βœ…Real-world stereo in/outdoor dataset

More: https://bit.ly/3BylAn3
πŸ‘10πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
🚜 NeRF-Factory: a NeRF collection 🚜

πŸ‘‰PyTorch-reimplemented NeRF library with 7 popular models/implementations & 7 datasets

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…NeRF: Project | Paper | Code
βœ…NeRF++: Paper | Code
βœ…DVGO: Project | Paper v1/v2 | Code
βœ…Plenoxels: Project | Paper | Code
βœ…Mip-NeRF: Project | Paper | Code
βœ…Mip-NeRF360: Project | Paper | Code
βœ…Ref-NeRF: Project | Paper | Code

More: https://bit.ly/3qUgmgC
πŸ‘7🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯Ά Lumos by #Nvidia: Relighting Portrait πŸ₯Ά

πŸ‘‰The new SOTA in relighting without requiring a light stage

😎Review https://bit.ly/3dCH9ej
😎Project deepimagination.cc/Lumos
😎Paper arxiv.org/pdf/2209.10510.pdf
😎Demo http://imaginaire.cc/Lumos/
❀11πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🍜 SURF-GAN: NeRF - >StyleGAN 🍜

πŸ‘‰ Editable portraits by injecting the NeRF's prior into StyleGAN

😎Review https://bit.ly/3SohEw3
😎Project jgkwak95.github.io/surfgan
😎Paper arxiv.org/pdf/2207.10257.pdf
😎Code github.com/jgkwak95/SURF-GAN
πŸ‘4❀2❀‍πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯#Google just announced "TensorStore"πŸ”₯

πŸ‘‰Novel open-source C++ / #Python library for storage/manipulation of high-dim data

😎Review https://bit.ly/3DLwbha
😎Project https://bit.ly/3C4T2TR
😎Code github.com/google/tensorstore
πŸ”₯14πŸ‘2
This media is not supported in your browser
VIEW IN TELEGRAM
🦠 Motion Transformer for #selfdriving 🦠

πŸ‘‰The 1st place solution for 2022 #waymo "motion prediction" challenge

😎Review https://bit.ly/3f8G4LD
😎Paper arxiv.org/pdf/2209.10033.pdf
😎Code github.com/sshaoshuai/MTR
πŸ”₯17πŸ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ’Ή Image Synthesis @160+ FPS! πŸ’Ή

πŸ‘‰Super-fast, 3D-Aware Image Synthesis with Sparse Voxels -> up to 167 FPS!

😎Review https://bit.ly/3r3ZNij
😎Paper arxiv.org/pdf/2206.07695.pdf
😎Project katjaschwarz.github.io/voxgraf
πŸ‘3🀯2πŸ”₯1πŸ’―1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘› #Nvidia GET3D: #3D generative #AI πŸ‘›

πŸ‘‰AI-based Textured 3D meshes with complex topology, rich geometry & hi-fi textures

😎Review https://bit.ly/3SgnT5h
😎Code github.com/nv-tlabs/GET3D
😎Project nv-tlabs.github.io/GET3D/
😎Paper nv-tlabs.github.io/GET3D/assets/paper.pdf
❀‍πŸ”₯7πŸ‘5
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯πŸ”₯ IDE-3D: source code is out! πŸ”₯πŸ”₯

πŸ‘‰Novel, photorealistic, 3D-aware facial generator: source code just released!

😎Review https://bit.ly/3BNrO2C
😎Project mrtornado24.github.io/IDE-3D/
😎Code github.com/MrTornado24/IDE-3D
😎Paper arxiv.org/pdf/2205.15517.pdf
🀯8πŸ‘5πŸ”₯3🀩3
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯Diffusion Model of Neural CheckpointsπŸ”₯

πŸ‘‰Conditional diffusion model on Millions of checkpoints of a given task/architecture 🀯

😎Review https://bit.ly/3SBR4Qb
😎Project www.wpeebles.com/Gpt
😎Code github.com/wpeebles/G.pt
😎Paper arxiv.org/pdf/2209.12892.pdf
🀯5❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ Semantic VISOR dataset is out! πŸ”₯

πŸ‘‰Segmenting hands / active objects in egocentric video (millions masks)

😎Review https://bit.ly/3LOBLBv
😎Project epic-kitchens.github.io/VISOR/
😎Paper arxiv.org/pdf/2209.13064.pdf
🀯8πŸ”₯4πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯‡πŸ₯‡ Olympic Games in 2028? πŸ₯‡πŸ₯‡

πŸ‘‰ In a few years, the fastest runner on earth will not be a human πŸ₯Ά

😎Review https://bit.ly/3Rme3O3
😱8πŸ‘3πŸ‘Ž1