This media is not supported in your browser
VIEW IN TELEGRAM
👹TT-GNeRF: generative NeRF for Faces👹
👉TT-GNeRF: a novel 3D-aware GANs based on generative NeRF for faces
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅ETH + Uni_Trento + #Snap 🤯
✅DAEM for disentanglement of 3D model
✅"Training-as-Init, Optimizing-for-Tuning"
✅Consistency++, preserving non-target ROI
✅Unsupervised optimization of geometry
More: https://bit.ly/3ARZmMw
👉TT-GNeRF: a novel 3D-aware GANs based on generative NeRF for faces
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅ETH + Uni_Trento + #Snap 🤯
✅DAEM for disentanglement of 3D model
✅"Training-as-Init, Optimizing-for-Tuning"
✅Consistency++, preserving non-target ROI
✅Unsupervised optimization of geometry
More: https://bit.ly/3ARZmMw
🔥4❤1👍1
🎪 SOTA in Arbitrary Shape Text Detection 🎪
👉Novel unified coarse-to-fine Transformer for arbitrary shape text detection
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Coarse-to-fine arbitrary text detection
✅Accurate text detection, NO post-process
✅Boundary proposal generation mechanism
✅Innovative boundary transformer (iterative)
✅Boundary energy loss (BEL) for refinement
More: https://bit.ly/3D6Ryt4
👉Novel unified coarse-to-fine Transformer for arbitrary shape text detection
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Coarse-to-fine arbitrary text detection
✅Accurate text detection, NO post-process
✅Boundary proposal generation mechanism
✅Innovative boundary transformer (iterative)
✅Boundary energy loss (BEL) for refinement
More: https://bit.ly/3D6Ryt4
❤8👍2😢1
This media is not supported in your browser
VIEW IN TELEGRAM
🐲 Open-Source Self-Driving projects 🐲
👉A free repo with many autonomous vehicle-related projects
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Basic/Advance Lane/Line Detection
✅Driving behavior by training & validating
✅Autopilot: predicting steering angle
More: https://bit.ly/3qqJ7RB
👉A free repo with many autonomous vehicle-related projects
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Basic/Advance Lane/Line Detection
✅Driving behavior by training & validating
✅Autopilot: predicting steering angle
More: https://bit.ly/3qqJ7RB
🔥22👍1
This media is not supported in your browser
VIEW IN TELEGRAM
🥤K-VIL: Keypoint-based visual imitation🥤
👉K-VIL: auto-incremental extraction of object-centric task representation.
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Efficient task-relevant keypoints
✅Embodiment-independent tasks
✅Adaptation of tasks to new scenes
✅Input: only a small set of demo clips
✅Novel keypoint-based controller
More: https://bit.ly/3eIrxpP
👉K-VIL: auto-incremental extraction of object-centric task representation.
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Efficient task-relevant keypoints
✅Embodiment-independent tasks
✅Adaptation of tasks to new scenes
✅Input: only a small set of demo clips
✅Novel keypoint-based controller
More: https://bit.ly/3eIrxpP
🔥7👍1
This media is not supported in your browser
VIEW IN TELEGRAM
💜 #Selfdriving in 80's. Damn Romantic 💜
👉The first self-driving car with people on board, 1986. So slow and lovely.
More: https://bit.ly/3BtRDon
👉The first self-driving car with people on board, 1986. So slow and lovely.
More: https://bit.ly/3BtRDon
❤9👏4👍3
This media is not supported in your browser
VIEW IN TELEGRAM
🏵️ TORAS: SOTA #AI for annotation 🏵️
👉TORAS: web-based AI-powered, cooperative, annotation platform.
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅SOTA AI tools -> significant speedup
✅"Recipes" to define how to annotate
✅Repo with folder structure for storage
✅Also on-prem for (commercial) firms
More: https://bit.ly/3L78YI2
👉TORAS: web-based AI-powered, cooperative, annotation platform.
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅SOTA AI tools -> significant speedup
✅"Recipes" to define how to annotate
✅Repo with folder structure for storage
✅Also on-prem for (commercial) firms
More: https://bit.ly/3L78YI2
🔥9🤯2👍1
This media is not supported in your browser
VIEW IN TELEGRAM
💮MAXIM: Multi-Axis MLP for Vision💮
👉#Google opens MAXIM, a multi-axis MLP for low-level vision
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Denoising, deblurring, dehazing, etc
✅Multi-axis gated MLP, linear complexity
✅Cross gating block, separate features
✅SOTA results on several datasets!
More: https://bit.ly/3Dmp8LI
👉#Google opens MAXIM, a multi-axis MLP for low-level vision
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Denoising, deblurring, dehazing, etc
✅Multi-axis gated MLP, linear complexity
✅Cross gating block, separate features
✅SOTA results on several datasets!
More: https://bit.ly/3Dmp8LI
🔥12❤1👎1
This media is not supported in your browser
VIEW IN TELEGRAM
🔥 A Survey on Diffusion Models 🔥
👉A comprehensive review of denoising diffusion models in #computervision 🤯
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Overview on diffusion models
✅Hot trend for the generative AI
✅A multi-perspective categorization
✅Current limitations / new directions
More: https://bit.ly/3RYG5zP
👉A comprehensive review of denoising diffusion models in #computervision 🤯
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Overview on diffusion models
✅Hot trend for the generative AI
✅A multi-perspective categorization
✅Current limitations / new directions
More: https://bit.ly/3RYG5zP
❤5👍3🔥1
This media is not supported in your browser
VIEW IN TELEGRAM
🉐#AI finds where IG photos are taken🉐
👉Brilliant work of Depoorter, Belgium artist that handles #privacy, #AI & #socialmedia
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Recorded open cameras for weeks
✅Scraped all #Instagram photos
✅Matching Instagram vs. footage
More: https://bit.ly/3eL5dfc
👉Brilliant work of Depoorter, Belgium artist that handles #privacy, #AI & #socialmedia
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Recorded open cameras for weeks
✅Scraped all #Instagram photos
✅Matching Instagram vs. footage
More: https://bit.ly/3eL5dfc
😱18👍13🥰2
This media is not supported in your browser
VIEW IN TELEGRAM
🈯SAMURAI: in-the-wild Shape/Material🈯
👉#Google SAMURAI: shape, BRDF, per-image pose & illumination. Relightable #3D assets for #AR/#VR.
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Parametrization for varying distances
✅Camera multiplex optimization
✅Posterior scaling of input images
✅Explicit meshes extraction with BRDF
✅Code/data soon available ->#NeurIPS
More: https://bit.ly/3BKWgf3
👉#Google SAMURAI: shape, BRDF, per-image pose & illumination. Relightable #3D assets for #AR/#VR.
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Parametrization for varying distances
✅Camera multiplex optimization
✅Posterior scaling of input images
✅Explicit meshes extraction with BRDF
✅Code/data soon available ->#NeurIPS
More: https://bit.ly/3BKWgf3
👍8🔥1
This media is not supported in your browser
VIEW IN TELEGRAM
🟨 Lang<->Pics in 100+ Languages 🟨
👉#Google PaLI: unified lang-image #AI to perform tasks in 109 languages 🤯
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅PaLI: Pathways Lang & Image model
✅Answering, captioning, reasoning, etc
✅From Eng. to 109 lang. understanding
✅The new SOTA on several datasets
More: https://bit.ly/3QMslHC
👉#Google PaLI: unified lang-image #AI to perform tasks in 109 languages 🤯
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅PaLI: Pathways Lang & Image model
✅Answering, captioning, reasoning, etc
✅From Eng. to 109 lang. understanding
✅The new SOTA on several datasets
More: https://bit.ly/3QMslHC
🔥6👍1💯1
This media is not supported in your browser
VIEW IN TELEGRAM
🍐PeRFception: Largest IR Dataset🍐
👉#Nvidia, a new frontier in data collection via Plenoxels: same info, -96.4% in size.
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅POSTECH + NVIDIA + Caltech = 🤯
✅Size: -96.4% from original dataset!
✅2D/3D image/object class/semantic
✅Ready-to-use pipeline for implicit dataset
More: https://bit.ly/3eW9hJA
👉#Nvidia, a new frontier in data collection via Plenoxels: same info, -96.4% in size.
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅POSTECH + NVIDIA + Caltech = 🤯
✅Size: -96.4% from original dataset!
✅2D/3D image/object class/semantic
✅Ready-to-use pipeline for implicit dataset
More: https://bit.ly/3eW9hJA
❤9❤🔥1👍1😍1
This media is not supported in your browser
VIEW IN TELEGRAM
🐸 CHARL-E: Stable Diffusion in 1 click 🐸
👉CHARL-E packages Stable Diffusion into a simple app.
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅No setup, dependencies, or internet
✅Images with 1-click on #macbook
✅Suitable only for M1/M2 processor
✅Source code under MIT license
More: https://bit.ly/3xv2z3G
👉CHARL-E packages Stable Diffusion into a simple app.
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅No setup, dependencies, or internet
✅Images with 1-click on #macbook
✅Suitable only for M1/M2 processor
✅Source code under MIT license
More: https://bit.ly/3xv2z3G
🔥11👍3❤🔥1❤1
This media is not supported in your browser
VIEW IN TELEGRAM
🍋YOLOPv2: Better Driving Perception🍋
👉YOLOPv2: simultaneous object, road segmentation & lane detection
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅E2E perception net with better backbone
✅Efficient ELAN for reasonable memory
✅Stability for adapting to scenarios
✅SOTA on BDD100K, +50% faster!
✅Source code under MIT license
More: https://bit.ly/3LvYGBh
👉YOLOPv2: simultaneous object, road segmentation & lane detection
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅E2E perception net with better backbone
✅Efficient ELAN for reasonable memory
✅Stability for adapting to scenarios
✅SOTA on BDD100K, +50% faster!
✅Source code under MIT license
More: https://bit.ly/3LvYGBh
🔥12
🍈SegNeXt: new SOTA in Semantic Seg.🍈
👉SOTA (by large margin) on ADE20K, Cityscapes, COCO-Stuff, Pascal VOC, Pascal Context, and iSAID 🤯
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Novel tailored network architecture
✅Spatial attention via multi-scale feats
✅Encoder + conv. better than transformers
✅SOTA on several datasets (ADE20K, etc.)
More: https://bit.ly/3UrZhrH
👉SOTA (by large margin) on ADE20K, Cityscapes, COCO-Stuff, Pascal VOC, Pascal Context, and iSAID 🤯
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Novel tailored network architecture
✅Spatial attention via multi-scale feats
✅Encoder + conv. better than transformers
✅SOTA on several datasets (ADE20K, etc.)
More: https://bit.ly/3UrZhrH
🔥9👍1
This media is not supported in your browser
VIEW IN TELEGRAM
🦪StereoVoxelNet: RT Obstacles Detection🦪
👉Novel deep neural approach to detect occupancy from stereo images directly
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Occupancy voxels via deep learning
✅RT on Jetson-TX2 (-98% CPU of SOTA)
✅Optimization via octrees / sparse conv.
✅Real-world stereo in/outdoor dataset
More: https://bit.ly/3BylAn3
👉Novel deep neural approach to detect occupancy from stereo images directly
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Occupancy voxels via deep learning
✅RT on Jetson-TX2 (-98% CPU of SOTA)
✅Optimization via octrees / sparse conv.
✅Real-world stereo in/outdoor dataset
More: https://bit.ly/3BylAn3
👍10🥰1
This media is not supported in your browser
VIEW IN TELEGRAM
🚜 NeRF-Factory: a NeRF collection 🚜
👉PyTorch-reimplemented NeRF library with 7 popular models/implementations & 7 datasets
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅NeRF: Project | Paper | Code
✅NeRF++: Paper | Code
✅DVGO: Project | Paper v1/v2 | Code
✅Plenoxels: Project | Paper | Code
✅Mip-NeRF: Project | Paper | Code
✅Mip-NeRF360: Project | Paper | Code
✅Ref-NeRF: Project | Paper | Code
More: https://bit.ly/3qUgmgC
👉PyTorch-reimplemented NeRF library with 7 popular models/implementations & 7 datasets
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅NeRF: Project | Paper | Code
✅NeRF++: Paper | Code
✅DVGO: Project | Paper v1/v2 | Code
✅Plenoxels: Project | Paper | Code
✅Mip-NeRF: Project | Paper | Code
✅Mip-NeRF360: Project | Paper | Code
✅Ref-NeRF: Project | Paper | Code
More: https://bit.ly/3qUgmgC
👍7🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
🥶 Lumos by #Nvidia: Relighting Portrait 🥶
👉The new SOTA in relighting without requiring a light stage
😎Review https://bit.ly/3dCH9ej
😎Project deepimagination.cc/Lumos
😎Paper arxiv.org/pdf/2209.10510.pdf
😎Demo http://imaginaire.cc/Lumos/
👉The new SOTA in relighting without requiring a light stage
😎Review https://bit.ly/3dCH9ej
😎Project deepimagination.cc/Lumos
😎Paper arxiv.org/pdf/2209.10510.pdf
😎Demo http://imaginaire.cc/Lumos/
❤11👍1
This media is not supported in your browser
VIEW IN TELEGRAM
🍜 SURF-GAN: NeRF - >StyleGAN 🍜
👉 Editable portraits by injecting the NeRF's prior into StyleGAN
😎Review https://bit.ly/3SohEw3
😎Project jgkwak95.github.io/surfgan
😎Paper arxiv.org/pdf/2207.10257.pdf
😎Code github.com/jgkwak95/SURF-GAN
👉 Editable portraits by injecting the NeRF's prior into StyleGAN
😎Review https://bit.ly/3SohEw3
😎Project jgkwak95.github.io/surfgan
😎Paper arxiv.org/pdf/2207.10257.pdf
😎Code github.com/jgkwak95/SURF-GAN
👍4❤2❤🔥1