AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
237 videos
11 files
1.27K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
🉐#AI finds where IG photos are taken🉐

👉Brilliant work of Depoorter, Belgium artist that handles #privacy, #AI & #socialmedia

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
Recorded open cameras for weeks
Scraped all #Instagram photos
Matching Instagram vs. footage

More: https://bit.ly/3eL5dfc
😱18👍13🥰2
This media is not supported in your browser
VIEW IN TELEGRAM
🈯SAMURAI: in-the-wild Shape/Material🈯

👉#Google SAMURAI: shape, BRDF, per-image pose & illumination. Relightable #3D assets for #AR/#VR.

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
Parametrization for varying distances
Camera multiplex optimization
Posterior scaling of input images
Explicit meshes extraction with BRDF
Code/data soon available ->#NeurIPS

More: https://bit.ly/3BKWgf3
👍8🔥1
This media is not supported in your browser
VIEW IN TELEGRAM
🟨 Lang<->Pics in 100+ Languages 🟨

👉#Google PaLI: unified lang-image #AI to perform tasks in 109 languages 🤯

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
PaLI: Pathways Lang & Image model
Answering, captioning, reasoning, etc
From Eng. to 109 lang. understanding
The new SOTA on several datasets

More: https://bit.ly/3QMslHC
🔥6👍1💯1
This media is not supported in your browser
VIEW IN TELEGRAM
🍐PeRFception: Largest IR Dataset🍐

👉#Nvidia, a new frontier in data collection via Plenoxels: same info, -96.4% in size.

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
POSTECH + NVIDIA + Caltech = 🤯
Size: -96.4% from original dataset!
2D/3D image/object class/semantic
Ready-to-use pipeline for implicit dataset

More: https://bit.ly/3eW9hJA
9❤‍🔥1👍1😍1
This media is not supported in your browser
VIEW IN TELEGRAM
🐸 CHARL-E: Stable Diffusion in 1 click 🐸

👉CHARL-E packages Stable Diffusion into a simple app.

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
No setup, dependencies, or internet
Images with 1-click on #macbook
Suitable only for M1/M2 processor
Source code under MIT license

More: https://bit.ly/3xv2z3G
🔥11👍3❤‍🔥11
This media is not supported in your browser
VIEW IN TELEGRAM
🍋YOLOPv2: Better Driving Perception🍋

👉YOLOPv2: simultaneous object, road segmentation & lane detection

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
E2E perception net with better backbone
Efficient ELAN for reasonable memory
Stability for adapting to scenarios
SOTA on BDD100K, +50% faster!
Source code under MIT license

More: https://bit.ly/3LvYGBh
🔥12
🍈SegNeXt: new SOTA in Semantic Seg.🍈

👉SOTA (by large margin) on ADE20K, Cityscapes, COCO-Stuff, Pascal VOC, Pascal Context, and iSAID 🤯

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
Novel tailored network architecture
Spatial attention via multi-scale feats
Encoder + conv. better than transformers
SOTA on several datasets (ADE20K, etc.)

More: https://bit.ly/3UrZhrH
🔥9👍1
This media is not supported in your browser
VIEW IN TELEGRAM
🦪StereoVoxelNet: RT Obstacles Detection🦪

👉Novel deep neural approach to detect occupancy from stereo images directly

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
Occupancy voxels via deep learning
RT on Jetson-TX2 (-98% CPU of SOTA)
Optimization via octrees / sparse conv.
Real-world stereo in/outdoor dataset

More: https://bit.ly/3BylAn3
👍10🥰1
This media is not supported in your browser
VIEW IN TELEGRAM
🚜 NeRF-Factory: a NeRF collection 🚜

👉PyTorch-reimplemented NeRF library with 7 popular models/implementations & 7 datasets

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
NeRF: Project | Paper | Code
NeRF++: Paper | Code
DVGO: Project | Paper v1/v2 | Code
Plenoxels: Project | Paper | Code
Mip-NeRF: Project | Paper | Code
Mip-NeRF360: Project | Paper | Code
Ref-NeRF: Project | Paper | Code

More: https://bit.ly/3qUgmgC
👍7🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
🥶 Lumos by #Nvidia: Relighting Portrait 🥶

👉The new SOTA in relighting without requiring a light stage

😎Review https://bit.ly/3dCH9ej
😎Project deepimagination.cc/Lumos
😎Paper arxiv.org/pdf/2209.10510.pdf
😎Demo http://imaginaire.cc/Lumos/
11👍1
This media is not supported in your browser
VIEW IN TELEGRAM
🍜 SURF-GAN: NeRF - >StyleGAN 🍜

👉 Editable portraits by injecting the NeRF's prior into StyleGAN

😎Review https://bit.ly/3SohEw3
😎Project jgkwak95.github.io/surfgan
😎Paper arxiv.org/pdf/2207.10257.pdf
😎Code github.com/jgkwak95/SURF-GAN
👍42❤‍🔥1
This media is not supported in your browser
VIEW IN TELEGRAM
🔥#Google just announced "TensorStore"🔥

👉Novel open-source C++ / #Python library for storage/manipulation of high-dim data

😎Review https://bit.ly/3DLwbha
😎Project https://bit.ly/3C4T2TR
😎Code github.com/google/tensorstore
🔥14👍2
This media is not supported in your browser
VIEW IN TELEGRAM
🦠 Motion Transformer for #selfdriving 🦠

👉The 1st place solution for 2022 #waymo "motion prediction" challenge

😎Review https://bit.ly/3f8G4LD
😎Paper arxiv.org/pdf/2209.10033.pdf
😎Code github.com/sshaoshuai/MTR
🔥17👍3
This media is not supported in your browser
VIEW IN TELEGRAM
💹 Image Synthesis @160+ FPS! 💹

👉Super-fast, 3D-Aware Image Synthesis with Sparse Voxels -> up to 167 FPS!

😎Review https://bit.ly/3r3ZNij
😎Paper arxiv.org/pdf/2206.07695.pdf
😎Project katjaschwarz.github.io/voxgraf
👏3🤯2🔥1💯1
This media is not supported in your browser
VIEW IN TELEGRAM
👛 #Nvidia GET3D: #3D generative #AI 👛

👉AI-based Textured 3D meshes with complex topology, rich geometry & hi-fi textures

😎Review https://bit.ly/3SgnT5h
😎Code github.com/nv-tlabs/GET3D
😎Project nv-tlabs.github.io/GET3D/
😎Paper nv-tlabs.github.io/GET3D/assets/paper.pdf
❤‍🔥7👍5
This media is not supported in your browser
VIEW IN TELEGRAM
🔥🔥 IDE-3D: source code is out! 🔥🔥

👉Novel, photorealistic, 3D-aware facial generator: source code just released!

😎Review https://bit.ly/3BNrO2C
😎Project mrtornado24.github.io/IDE-3D/
😎Code github.com/MrTornado24/IDE-3D
😎Paper arxiv.org/pdf/2205.15517.pdf
🤯8👍5🔥3🤩3
This media is not supported in your browser
VIEW IN TELEGRAM
🔥Diffusion Model of Neural Checkpoints🔥

👉Conditional diffusion model on Millions of checkpoints of a given task/architecture 🤯

😎Review https://bit.ly/3SBR4Qb
😎Project www.wpeebles.com/Gpt
😎Code github.com/wpeebles/G.pt
😎Paper arxiv.org/pdf/2209.12892.pdf
🤯51
This media is not supported in your browser
VIEW IN TELEGRAM
🔥 Semantic VISOR dataset is out! 🔥

👉Segmenting hands / active objects in egocentric video (millions masks)

😎Review https://bit.ly/3LOBLBv
😎Project epic-kitchens.github.io/VISOR/
😎Paper arxiv.org/pdf/2209.13064.pdf
🤯8🔥4👍1
This media is not supported in your browser
VIEW IN TELEGRAM
🥇🥇 Olympic Games in 2028? 🥇🥇

👉 In a few years, the fastest runner on earth will not be a human 🥶

😎Review https://bit.ly/3Rme3O3
😱8👍3👎1
This media is not supported in your browser
VIEW IN TELEGRAM
🔥 SOTA ALERT: new Text-to-Video #AI 🔥

👉#META unveils a novel Text-to-Video (T2V) generation #AI

😎Review https://bit.ly/3E1ZDzG
😎Project https://makeavideo.studio/
😎Paper makeavideo.studio/Make-A-Video.pdf
🤯9👍6😱1💩1