AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
236 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ’œ #Selfdriving in 80's. Damn Romantic ๐Ÿ’œ

๐Ÿ‘‰The first self-driving car with people on board, 1986. So slow and lovely.

More: https://bit.ly/3BtRDon
โค9๐Ÿ‘4๐Ÿ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿต๏ธ TORAS: SOTA #AI for annotation ๐Ÿต๏ธ

๐Ÿ‘‰TORAS: web-based AI-powered, cooperative, annotation platform.

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…SOTA AI tools -> significant speedup
โœ…"Recipes" to define how to annotate
โœ…Repo with folder structure for storage
โœ…Also on-prem for (commercial) firms

More: https://bit.ly/3L78YI2
๐Ÿ”ฅ9๐Ÿคฏ2๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ’ฎMAXIM: Multi-Axis MLP for Vision๐Ÿ’ฎ

๐Ÿ‘‰#Google opens MAXIM, a multi-axis MLP for low-level vision

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Denoising, deblurring, dehazing, etc
โœ…Multi-axis gated MLP, linear complexity
โœ…Cross gating block, separate features
โœ…SOTA results on several datasets!

More: https://bit.ly/3Dmp8LI
๐Ÿ”ฅ12โค1๐Ÿ‘Ž1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ A Survey on Diffusion Models ๐Ÿ”ฅ

๐Ÿ‘‰A comprehensive review of denoising diffusion models in #computervision ๐Ÿคฏ

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Overview on diffusion models
โœ…Hot trend for the generative AI
โœ…A multi-perspective categorization
โœ…Current limitations / new directions

More: https://bit.ly/3RYG5zP
โค5๐Ÿ‘3๐Ÿ”ฅ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ‰#AI finds where IG photos are taken๐Ÿ‰

๐Ÿ‘‰Brilliant work of Depoorter, Belgium artist that handles #privacy, #AI & #socialmedia

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Recorded open cameras for weeks
โœ…Scraped all #Instagram photos
โœ…Matching Instagram vs. footage

More: https://bit.ly/3eL5dfc
๐Ÿ˜ฑ18๐Ÿ‘13๐Ÿฅฐ2
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸˆฏSAMURAI: in-the-wild Shape/Material๐Ÿˆฏ

๐Ÿ‘‰#Google SAMURAI: shape, BRDF, per-image pose & illumination. Relightable #3D assets for #AR/#VR.

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Parametrization for varying distances
โœ…Camera multiplex optimization
โœ…Posterior scaling of input images
โœ…Explicit meshes extraction with BRDF
โœ…Code/data soon available ->#NeurIPS

More: https://bit.ly/3BKWgf3
๐Ÿ‘8๐Ÿ”ฅ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸŸจ Lang<->Pics in 100+ Languages ๐ŸŸจ

๐Ÿ‘‰#Google PaLI: unified lang-image #AI to perform tasks in 109 languages ๐Ÿคฏ

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…PaLI: Pathways Lang & Image model
โœ…Answering, captioning, reasoning, etc
โœ…From Eng. to 109 lang. understanding
โœ…The new SOTA on several datasets

More: https://bit.ly/3QMslHC
๐Ÿ”ฅ6๐Ÿ‘1๐Ÿ’ฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸPeRFception: Largest IR Dataset๐Ÿ

๐Ÿ‘‰#Nvidia, a new frontier in data collection via Plenoxels: same info, -96.4% in size.

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…POSTECH + NVIDIA + Caltech = ๐Ÿคฏ
โœ…Size: -96.4% from original dataset!
โœ…2D/3D image/object class/semantic
โœ…Ready-to-use pipeline for implicit dataset

More: https://bit.ly/3eW9hJA
โค9โคโ€๐Ÿ”ฅ1๐Ÿ‘1๐Ÿ˜1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿธ CHARL-E: Stable Diffusion in 1 click ๐Ÿธ

๐Ÿ‘‰CHARL-E packages Stable Diffusion into a simple app.

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…No setup, dependencies, or internet
โœ…Images with 1-click on #macbook
โœ…Suitable only for M1/M2 processor
โœ…Source code under MIT license

More: https://bit.ly/3xv2z3G
๐Ÿ”ฅ11๐Ÿ‘3โคโ€๐Ÿ”ฅ1โค1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ‹YOLOPv2: Better Driving Perception๐Ÿ‹

๐Ÿ‘‰YOLOPv2: simultaneous object, road segmentation & lane detection

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…E2E perception net with better backbone
โœ…Efficient ELAN for reasonable memory
โœ…Stability for adapting to scenarios
โœ…SOTA on BDD100K, +50% faster!
โœ…Source code under MIT license

More: https://bit.ly/3LvYGBh
๐Ÿ”ฅ12
๐ŸˆSegNeXt: new SOTA in Semantic Seg.๐Ÿˆ

๐Ÿ‘‰SOTA (by large margin) on ADE20K, Cityscapes, COCO-Stuff, Pascal VOC, Pascal Context, and iSAID ๐Ÿคฏ

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Novel tailored network architecture
โœ…Spatial attention via multi-scale feats
โœ…Encoder + conv. better than transformers
โœ…SOTA on several datasets (ADE20K, etc.)

More: https://bit.ly/3UrZhrH
๐Ÿ”ฅ9๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸฆชStereoVoxelNet: RT Obstacles Detection๐Ÿฆช

๐Ÿ‘‰Novel deep neural approach to detect occupancy from stereo images directly

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Occupancy voxels via deep learning
โœ…RT on Jetson-TX2 (-98% CPU of SOTA)
โœ…Optimization via octrees / sparse conv.
โœ…Real-world stereo in/outdoor dataset

More: https://bit.ly/3BylAn3
๐Ÿ‘10๐Ÿฅฐ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿšœ NeRF-Factory: a NeRF collection ๐Ÿšœ

๐Ÿ‘‰PyTorch-reimplemented NeRF library with 7 popular models/implementations & 7 datasets

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…NeRF: Project | Paper | Code
โœ…NeRF++: Paper | Code
โœ…DVGO: Project | Paper v1/v2 | Code
โœ…Plenoxels: Project | Paper | Code
โœ…Mip-NeRF: Project | Paper | Code
โœ…Mip-NeRF360: Project | Paper | Code
โœ…Ref-NeRF: Project | Paper | Code

More: https://bit.ly/3qUgmgC
๐Ÿ‘7๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฅถ Lumos by #Nvidia: Relighting Portrait ๐Ÿฅถ

๐Ÿ‘‰The new SOTA in relighting without requiring a light stage

๐Ÿ˜ŽReview https://bit.ly/3dCH9ej
๐Ÿ˜ŽProject deepimagination.cc/Lumos
๐Ÿ˜ŽPaper arxiv.org/pdf/2209.10510.pdf
๐Ÿ˜ŽDemo http://imaginaire.cc/Lumos/
โค11๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿœ SURF-GAN: NeRF - >StyleGAN ๐Ÿœ

๐Ÿ‘‰ Editable portraits by injecting the NeRF's prior into StyleGAN

๐Ÿ˜ŽReview https://bit.ly/3SohEw3
๐Ÿ˜ŽProject jgkwak95.github.io/surfgan
๐Ÿ˜ŽPaper arxiv.org/pdf/2207.10257.pdf
๐Ÿ˜ŽCode github.com/jgkwak95/SURF-GAN
๐Ÿ‘4โค2โคโ€๐Ÿ”ฅ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ#Google just announced "TensorStore"๐Ÿ”ฅ

๐Ÿ‘‰Novel open-source C++ / #Python library for storage/manipulation of high-dim data

๐Ÿ˜ŽReview https://bit.ly/3DLwbha
๐Ÿ˜ŽProject https://bit.ly/3C4T2TR
๐Ÿ˜ŽCode github.com/google/tensorstore
๐Ÿ”ฅ14๐Ÿ‘2
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฆ  Motion Transformer for #selfdriving ๐Ÿฆ 

๐Ÿ‘‰The 1st place solution for 2022 #waymo "motion prediction" challenge

๐Ÿ˜ŽReview https://bit.ly/3f8G4LD
๐Ÿ˜ŽPaper arxiv.org/pdf/2209.10033.pdf
๐Ÿ˜ŽCode github.com/sshaoshuai/MTR
๐Ÿ”ฅ17๐Ÿ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ’น Image Synthesis @160+ FPS! ๐Ÿ’น

๐Ÿ‘‰Super-fast, 3D-Aware Image Synthesis with Sparse Voxels -> up to 167 FPS!

๐Ÿ˜ŽReview https://bit.ly/3r3ZNij
๐Ÿ˜ŽPaper arxiv.org/pdf/2206.07695.pdf
๐Ÿ˜ŽProject katjaschwarz.github.io/voxgraf
๐Ÿ‘3๐Ÿคฏ2๐Ÿ”ฅ1๐Ÿ’ฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ‘› #Nvidia GET3D: #3D generative #AI ๐Ÿ‘›

๐Ÿ‘‰AI-based Textured 3D meshes with complex topology, rich geometry & hi-fi textures

๐Ÿ˜ŽReview https://bit.ly/3SgnT5h
๐Ÿ˜ŽCode github.com/nv-tlabs/GET3D
๐Ÿ˜ŽProject nv-tlabs.github.io/GET3D/
๐Ÿ˜ŽPaper nv-tlabs.github.io/GET3D/assets/paper.pdf
โคโ€๐Ÿ”ฅ7๐Ÿ‘5