AI with Papers - Artificial Intelligence & Deep Learning

🦎 VMT: Video Mask Transfiner 🦎

👉Novel highly efficient ViT structure for video instance segmentation.

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅HD & more temporally stable mask
✅Higher resolution features for VIS
✅Detecting error-prone s-t. regions
✅Auto-refinement on training data!

More: https://bit.ly/3RKXtb4

🤯9❤1

2.99K views12:39

AI with Papers - Artificial Intelligence & Deep Learning

🤯 #StableDiffusion + #Dallemini = BOOM! 🤯

👉A #colab notebook that combines Stable Diffusion + DALL-E Mini (Craiyon)

More: https://bit.ly/3TTOshR

🔥9👏5😢1

3.1K views18:40

AI with Papers - Artificial Intelligence & Deep Learning

0:09

This media is not supported in your browser

VIEW IN TELEGRAM

🐠VIS - Deformable Transformers 🐠

👉DeVIS: VIS method with efficiency and performance of deformable ViT

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Temp. multi-scale D-Attention
✅Instance-aware object queries
✅Mask: DA + multi-scale feats map
✅Improved multi-cue clip tracking
✅SOTA on YouTube-VIS 2021/OVIS

More: https://bit.ly/3TQv1Xc

🔥8❤1👍1

4.08K views06:51

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🌈 X-NeRF: Cross-Spectral NeRF 🌈

👉Cross-Spectral NeRF from cams with different light spectrums

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅First ever cross-spectral NeRF
✅Avoiding non-trivial calib/match
✅Normalized Cross-Device Coords
✅Novel dataset w/ RGB, MS, & IR

More: https://bit.ly/3RqHnUo

👍7

3.17K views11:53

AI with Papers - Artificial Intelligence & Deep Learning

0:03

This media is not supported in your browser

VIEW IN TELEGRAM

👹TT-GNeRF: generative NeRF for Faces👹

👉TT-GNeRF: a novel 3D-aware GANs based on generative NeRF for faces

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅ETH + Uni_Trento + #Snap 🤯
✅DAEM for disentanglement of 3D model
✅"Training-as-Init, Optimizing-for-Tuning"
✅Consistency++, preserving non-target ROI
✅Unsupervised optimization of geometry

More: https://bit.ly/3ARZmMw

🔥4❤1👍1

3.28K views14:17

AI with Papers - Artificial Intelligence & Deep Learning

🎪 SOTA in Arbitrary Shape Text Detection 🎪

👉Novel unified coarse-to-fine Transformer for arbitrary shape text detection

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Coarse-to-fine arbitrary text detection
✅Accurate text detection, NO post-process
✅Boundary proposal generation mechanism
✅Innovative boundary transformer (iterative)
✅Boundary energy loss (BEL) for refinement

More: https://bit.ly/3D6Ryt4

❤8👍2😢1

3.46K viewsedited 06:39

AI with Papers - Artificial Intelligence & Deep Learning

0:09

This media is not supported in your browser

VIEW IN TELEGRAM

🐲 Open-Source Self-Driving projects 🐲

👉A free repo with many autonomous vehicle-related projects

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Basic/Advance Lane/Line Detection
✅Driving behavior by training & validating
✅Autopilot: predicting steering angle

More: https://bit.ly/3qqJ7RB

🔥22👍1

3.29K views06:58

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🥤K-VIL: Keypoint-based visual imitation🥤

👉K-VIL: auto-incremental extraction of object-centric task representation.

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Efficient task-relevant keypoints
✅Embodiment-independent tasks
✅Adaptation of tasks to new scenes
✅Input: only a small set of demo clips
✅Novel keypoint-based controller

More: https://bit.ly/3eIrxpP

🔥7👍1

2.92K viewsedited 15:51

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

💜 #Selfdriving in 80's. Damn Romantic 💜

👉The first self-driving car with people on board, 1986. So slow and lovely.

More: https://bit.ly/3BtRDon

❤9👏4👍3

2.89K views16:28

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🏵️ TORAS: SOTA #AI for annotation 🏵️

👉TORAS: web-based AI-powered, cooperative, annotation platform.

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅SOTA AI tools -> significant speedup
✅"Recipes" to define how to annotate
✅Repo with folder structure for storage
✅Also on-prem for (commercial) firms

More: https://bit.ly/3L78YI2

🔥9🤯2👍1

2.92K views06:34

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

💮MAXIM: Multi-Axis MLP for Vision💮

👉#Google opens MAXIM, a multi-axis MLP for low-level vision

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Denoising, deblurring, dehazing, etc
✅Multi-axis gated MLP, linear complexity
✅Cross gating block, separate features
✅SOTA results on several datasets!

More: https://bit.ly/3Dmp8LI

🔥12❤1👎1

3.07K viewsedited 10:19

AI with Papers - Artificial Intelligence & Deep Learning

0:14

This media is not supported in your browser

VIEW IN TELEGRAM

🔥 A Survey on Diffusion Models 🔥

👉A comprehensive review of denoising diffusion models in #computervision 🤯

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Overview on diffusion models
✅Hot trend for the generative AI
✅A multi-perspective categorization
✅Current limitations / new directions

More: https://bit.ly/3RYG5zP

❤5👍3🔥1

3.35K views13:04

AI with Papers - Artificial Intelligence & Deep Learning

🦋Transf-Codebook HD-Face Restoration🦋 👉S-Lab unveils CodeFormer: hyper-datailed face restoration from degraded clips 𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬: ✅Face restoration as a code prediction ✅Discrete CB prior in small proxy space ✅Controllable transformation for LQ->HQ ✅Robustness…

🔥🔥 UPDATE 🔥🔥

Code Released: https://github.com/sczhou/CodeFormer

❤6👍2

3.19K viewsedited 18:57

AI with Papers - Artificial Intelligence & Deep Learning

0:06

This media is not supported in your browser

VIEW IN TELEGRAM

🉐#AI finds where IG photos are taken🉐

👉Brilliant work of Depoorter, Belgium artist that handles #privacy, #AI & #socialmedia

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Recorded open cameras for weeks
✅Scraped all #Instagram photos
✅Matching Instagram vs. footage

More: https://bit.ly/3eL5dfc

😱18👍13🥰2

3.11K viewsedited 14:49

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🈯SAMURAI: in-the-wild Shape/Material🈯

👉#Google SAMURAI: shape, BRDF, per-image pose & illumination. Relightable #3D assets for #AR/#VR.

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Parametrization for varying distances
✅Camera multiplex optimization
✅Posterior scaling of input images
✅Explicit meshes extraction with BRDF
✅Code/data soon available ->#NeurIPS

More: https://bit.ly/3BKWgf3

👍8🔥1

3.31K views13:24

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🟨 Lang<->Pics in 100+ Languages 🟨

👉#Google PaLI: unified lang-image #AI to perform tasks in 109 languages 🤯

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅PaLI: Pathways Lang & Image model
✅Answering, captioning, reasoning, etc
✅From Eng. to 109 lang. understanding
✅The new SOTA on several datasets

More: https://bit.ly/3QMslHC

🔥6👍1💯1

3.49K viewsedited 06:29

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍐PeRFception: Largest IR Dataset🍐

👉#Nvidia, a new frontier in data collection via Plenoxels: same info, -96.4% in size.

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅POSTECH + NVIDIA + Caltech = 🤯
✅Size: -96.4% from original dataset!
✅2D/3D image/object class/semantic
✅Ready-to-use pipeline for implicit dataset

More: https://bit.ly/3eW9hJA

❤9❤‍🔥1👍1😍1

3.28K viewsedited 07:59

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🐸 CHARL-E: Stable Diffusion in 1 click 🐸

👉CHARL-E packages Stable Diffusion into a simple app.

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅No setup, dependencies, or internet
✅Images with 1-click on #macbook
✅Suitable only for M1/M2 processor
✅Source code under MIT license

More: https://bit.ly/3xv2z3G

🔥11👍3❤‍🔥1❤1

3.18K viewsedited 06:58

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍋YOLOPv2: Better Driving Perception🍋

👉YOLOPv2: simultaneous object, road segmentation & lane detection

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅E2E perception net with better backbone
✅Efficient ELAN for reasonable memory
✅Stability for adapting to scenarios
✅SOTA on BDD100K, +50% faster!
✅Source code under MIT license

More: https://bit.ly/3LvYGBh

🔥12

3.09K viewsedited 06:45

AI with Papers - Artificial Intelligence & Deep Learning

🍈SegNeXt: new SOTA in Semantic Seg.🍈

👉SOTA (by large margin) on ADE20K, Cityscapes, COCO-Stuff, Pascal VOC, Pascal Context, and iSAID 🤯

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Novel tailored network architecture
✅Spatial attention via multi-scale feats
✅Encoder + conv. better than transformers
✅SOTA on several datasets (ADE20K, etc.)

More: https://bit.ly/3UrZhrH

🔥9👍1

3.01K views07:01

About

Blog

Apps

Platform