AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸŸจ Lang<->Pics in 100+ Languages ๐ŸŸจ

๐Ÿ‘‰#Google PaLI: unified lang-image #AI to perform tasks in 109 languages ๐Ÿคฏ

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…PaLI: Pathways Lang & Image model
โœ…Answering, captioning, reasoning, etc
โœ…From Eng. to 109 lang. understanding
โœ…The new SOTA on several datasets

More: https://bit.ly/3QMslHC
๐Ÿ”ฅ6๐Ÿ‘1๐Ÿ’ฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸPeRFception: Largest IR Dataset๐Ÿ

๐Ÿ‘‰#Nvidia, a new frontier in data collection via Plenoxels: same info, -96.4% in size.

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…POSTECH + NVIDIA + Caltech = ๐Ÿคฏ
โœ…Size: -96.4% from original dataset!
โœ…2D/3D image/object class/semantic
โœ…Ready-to-use pipeline for implicit dataset

More: https://bit.ly/3eW9hJA
โค9โคโ€๐Ÿ”ฅ1๐Ÿ‘1๐Ÿ˜1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿธ CHARL-E: Stable Diffusion in 1 click ๐Ÿธ

๐Ÿ‘‰CHARL-E packages Stable Diffusion into a simple app.

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…No setup, dependencies, or internet
โœ…Images with 1-click on #macbook
โœ…Suitable only for M1/M2 processor
โœ…Source code under MIT license

More: https://bit.ly/3xv2z3G
๐Ÿ”ฅ11๐Ÿ‘3โคโ€๐Ÿ”ฅ1โค1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ‹YOLOPv2: Better Driving Perception๐Ÿ‹

๐Ÿ‘‰YOLOPv2: simultaneous object, road segmentation & lane detection

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…E2E perception net with better backbone
โœ…Efficient ELAN for reasonable memory
โœ…Stability for adapting to scenarios
โœ…SOTA on BDD100K, +50% faster!
โœ…Source code under MIT license

More: https://bit.ly/3LvYGBh
๐Ÿ”ฅ12
๐ŸˆSegNeXt: new SOTA in Semantic Seg.๐Ÿˆ

๐Ÿ‘‰SOTA (by large margin) on ADE20K, Cityscapes, COCO-Stuff, Pascal VOC, Pascal Context, and iSAID ๐Ÿคฏ

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Novel tailored network architecture
โœ…Spatial attention via multi-scale feats
โœ…Encoder + conv. better than transformers
โœ…SOTA on several datasets (ADE20K, etc.)

More: https://bit.ly/3UrZhrH
๐Ÿ”ฅ9๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸฆชStereoVoxelNet: RT Obstacles Detection๐Ÿฆช

๐Ÿ‘‰Novel deep neural approach to detect occupancy from stereo images directly

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…Occupancy voxels via deep learning
โœ…RT on Jetson-TX2 (-98% CPU of SOTA)
โœ…Optimization via octrees / sparse conv.
โœ…Real-world stereo in/outdoor dataset

More: https://bit.ly/3BylAn3
๐Ÿ‘10๐Ÿฅฐ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿšœ NeRF-Factory: a NeRF collection ๐Ÿšœ

๐Ÿ‘‰PyTorch-reimplemented NeRF library with 7 popular models/implementations & 7 datasets

๐‡๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โœ…NeRF: Project | Paper | Code
โœ…NeRF++: Paper | Code
โœ…DVGO: Project | Paper v1/v2 | Code
โœ…Plenoxels: Project | Paper | Code
โœ…Mip-NeRF: Project | Paper | Code
โœ…Mip-NeRF360: Project | Paper | Code
โœ…Ref-NeRF: Project | Paper | Code

More: https://bit.ly/3qUgmgC
๐Ÿ‘7๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฅถ Lumos by #Nvidia: Relighting Portrait ๐Ÿฅถ

๐Ÿ‘‰The new SOTA in relighting without requiring a light stage

๐Ÿ˜ŽReview https://bit.ly/3dCH9ej
๐Ÿ˜ŽProject deepimagination.cc/Lumos
๐Ÿ˜ŽPaper arxiv.org/pdf/2209.10510.pdf
๐Ÿ˜ŽDemo http://imaginaire.cc/Lumos/
โค11๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿœ SURF-GAN: NeRF - >StyleGAN ๐Ÿœ

๐Ÿ‘‰ Editable portraits by injecting the NeRF's prior into StyleGAN

๐Ÿ˜ŽReview https://bit.ly/3SohEw3
๐Ÿ˜ŽProject jgkwak95.github.io/surfgan
๐Ÿ˜ŽPaper arxiv.org/pdf/2207.10257.pdf
๐Ÿ˜ŽCode github.com/jgkwak95/SURF-GAN
๐Ÿ‘4โค2โคโ€๐Ÿ”ฅ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ#Google just announced "TensorStore"๐Ÿ”ฅ

๐Ÿ‘‰Novel open-source C++ / #Python library for storage/manipulation of high-dim data

๐Ÿ˜ŽReview https://bit.ly/3DLwbha
๐Ÿ˜ŽProject https://bit.ly/3C4T2TR
๐Ÿ˜ŽCode github.com/google/tensorstore
๐Ÿ”ฅ14๐Ÿ‘2
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฆ  Motion Transformer for #selfdriving ๐Ÿฆ 

๐Ÿ‘‰The 1st place solution for 2022 #waymo "motion prediction" challenge

๐Ÿ˜ŽReview https://bit.ly/3f8G4LD
๐Ÿ˜ŽPaper arxiv.org/pdf/2209.10033.pdf
๐Ÿ˜ŽCode github.com/sshaoshuai/MTR
๐Ÿ”ฅ17๐Ÿ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ’น Image Synthesis @160+ FPS! ๐Ÿ’น

๐Ÿ‘‰Super-fast, 3D-Aware Image Synthesis with Sparse Voxels -> up to 167 FPS!

๐Ÿ˜ŽReview https://bit.ly/3r3ZNij
๐Ÿ˜ŽPaper arxiv.org/pdf/2206.07695.pdf
๐Ÿ˜ŽProject katjaschwarz.github.io/voxgraf
๐Ÿ‘3๐Ÿคฏ2๐Ÿ”ฅ1๐Ÿ’ฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ‘› #Nvidia GET3D: #3D generative #AI ๐Ÿ‘›

๐Ÿ‘‰AI-based Textured 3D meshes with complex topology, rich geometry & hi-fi textures

๐Ÿ˜ŽReview https://bit.ly/3SgnT5h
๐Ÿ˜ŽCode github.com/nv-tlabs/GET3D
๐Ÿ˜ŽProject nv-tlabs.github.io/GET3D/
๐Ÿ˜ŽPaper nv-tlabs.github.io/GET3D/assets/paper.pdf
โคโ€๐Ÿ”ฅ7๐Ÿ‘5
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ๐Ÿ”ฅ IDE-3D: source code is out! ๐Ÿ”ฅ๐Ÿ”ฅ

๐Ÿ‘‰Novel, photorealistic, 3D-aware facial generator: source code just released!

๐Ÿ˜ŽReview https://bit.ly/3BNrO2C
๐Ÿ˜ŽProject mrtornado24.github.io/IDE-3D/
๐Ÿ˜ŽCode github.com/MrTornado24/IDE-3D
๐Ÿ˜ŽPaper arxiv.org/pdf/2205.15517.pdf
๐Ÿคฏ8๐Ÿ‘5๐Ÿ”ฅ3๐Ÿคฉ3
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅDiffusion Model of Neural Checkpoints๐Ÿ”ฅ

๐Ÿ‘‰Conditional diffusion model on Millions of checkpoints of a given task/architecture ๐Ÿคฏ

๐Ÿ˜ŽReview https://bit.ly/3SBR4Qb
๐Ÿ˜ŽProject www.wpeebles.com/Gpt
๐Ÿ˜ŽCode github.com/wpeebles/G.pt
๐Ÿ˜ŽPaper arxiv.org/pdf/2209.12892.pdf
๐Ÿคฏ5โค1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ Semantic VISOR dataset is out! ๐Ÿ”ฅ

๐Ÿ‘‰Segmenting hands / active objects in egocentric video (millions masks)

๐Ÿ˜ŽReview https://bit.ly/3LOBLBv
๐Ÿ˜ŽProject epic-kitchens.github.io/VISOR/
๐Ÿ˜ŽPaper arxiv.org/pdf/2209.13064.pdf
๐Ÿคฏ8๐Ÿ”ฅ4๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฅ‡๐Ÿฅ‡ Olympic Games in 2028? ๐Ÿฅ‡๐Ÿฅ‡

๐Ÿ‘‰ In a few years, the fastest runner on earth will not be a human ๐Ÿฅถ

๐Ÿ˜ŽReview https://bit.ly/3Rme3O3
๐Ÿ˜ฑ8๐Ÿ‘3๐Ÿ‘Ž1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ SOTA ALERT: new Text-to-Video #AI ๐Ÿ”ฅ

๐Ÿ‘‰#META unveils a novel Text-to-Video (T2V) generation #AI

๐Ÿ˜ŽReview https://bit.ly/3E1ZDzG
๐Ÿ˜ŽProject https://makeavideo.studio/
๐Ÿ˜ŽPaper makeavideo.studio/Make-A-Video.pdf
๐Ÿคฏ9๐Ÿ‘6๐Ÿ˜ฑ1๐Ÿ’ฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅDreamFusion: Text-to-3D via Diffusion๐Ÿ”ฅ

๐Ÿ‘‰DeepDream-like procedure to create #3D assets just from a given text

๐Ÿ˜ŽReview https://bit.ly/3BYY5nu
๐Ÿ˜ŽPaper arxiv.org/pdf/2209.14988.pdf
๐Ÿ˜ŽProject dreamfusion3d.github.io/gallery.html
๐Ÿคฏ12๐Ÿ‘5๐Ÿ’ฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿงช Light Field Neural Rendering ๐Ÿงช

๐Ÿ‘‰Two-stage transformer capable of non-Lambertian effects (reflection, refraction, translucency)

๐Ÿ˜ŽReview https://bit.ly/3CpIFdm
๐Ÿ˜ŽPaper arxiv.org/pdf/2112.09687.pdf
๐Ÿ˜ŽProject light-field-neural-rendering.github.io
๐Ÿ˜ŽCode github.com/google-research/google-research/tree/master/light_field_neural_rendering
๐Ÿคฏ14๐Ÿ‘1