This media is not supported in your browser
VIEW IN TELEGRAM
A lightweight vision library for performing large scale object detection & instance segmentation
Github: https://github.com/obss/sahi
Paper: https://arxiv.org/abs/2202.06934v1
Kaggle notebook: https://www.kaggle.com/remekkinas/sahi-slicing-aided-hyper-inference-yv5-and-yx
Dataset: https://paperswithcode.com/dataset/xview
👉👉@computer_science_and_programming
Github: https://github.com/obss/sahi
Paper: https://arxiv.org/abs/2202.06934v1
Kaggle notebook: https://www.kaggle.com/remekkinas/sahi-slicing-aided-hyper-inference-yv5-and-yx
Dataset: https://paperswithcode.com/dataset/xview
👉👉@computer_science_and_programming
👍151👎8
Reading suggestions to keep you up-to-date with the latest and classic breakthroughs in AI and Data Science.
https://towardsdatascience.com/ai-papers-to-read-in-2022-c6edd4302247
https://towardsdatascience.com/ai-papers-to-read-in-2022-c6edd4302247
Medium
AI Papers to Read in 2022
Reading suggestions to keep you up-to-date with the latest and classic breakthroughs in AI and Data Science.
👍223👎9
💬 A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution
Github: https://github.com/mjq11302010044/tatt
Paper: https://arxiv.org/abs/2203.09388v2
Dataset: https://deepchecks.com/blog/
@computer_science_and_programming
Github: https://github.com/mjq11302010044/tatt
Paper: https://arxiv.org/abs/2203.09388v2
Dataset: https://deepchecks.com/blog/
@computer_science_and_programming
👍127👎6
This media is not supported in your browser
VIEW IN TELEGRAM
NAFSSR: Stereo Image Super-Resolution Using NAFNet
Github: https://github.com/megvii-research/NAFNet
Paper: https://arxiv.org/abs/2204.08714v1
Demo: https://colab.research.google.com/drive/1dkO5AyktmBoWwxBwoKFUurIDn0m4qDXT?usp=sharing
Dataset: https://paperswithcode.com/dataset/kitti
@computer_science_and_programming
Github: https://github.com/megvii-research/NAFNet
Paper: https://arxiv.org/abs/2204.08714v1
Demo: https://colab.research.google.com/drive/1dkO5AyktmBoWwxBwoKFUurIDn0m4qDXT?usp=sharing
Dataset: https://paperswithcode.com/dataset/kitti
@computer_science_and_programming
👍273👎11
🧊 Focal Sparse Convolutional Networks for 3D Object Detection (CVPR 2022, Oral)
Github: https://github.com/dvlab-research/focalsconv
Paper: https://arxiv.org/abs/2204.12463
Dataset: https://paperswithcode.com/dataset/nuscenes
@computer_science_and_programming
Github: https://github.com/dvlab-research/focalsconv
Paper: https://arxiv.org/abs/2204.12463
Dataset: https://paperswithcode.com/dataset/nuscenes
@computer_science_and_programming
👍116👎10❤1
RefineMask: Towards High-Quality Instance Segmentation
with Fine-Grained Features (CVPR 2021)
Paper:
https://openaccess.thecvf.com/content/CVPR2021/papers/Zhang_RefineMask_Towards_High-Quality_Instance_Segmentation_With_Fine-Grained_Features_CVPR_2021_paper.pdf
Source:
https://github.com/zhanggang001/RefineMask
@computer_science_and_programming
with Fine-Grained Features (CVPR 2021)
Paper:
https://openaccess.thecvf.com/content/CVPR2021/papers/Zhang_RefineMask_Towards_High-Quality_Instance_Segmentation_With_Fine-Grained_Features_CVPR_2021_paper.pdf
Source:
https://github.com/zhanggang001/RefineMask
@computer_science_and_programming
👍137👎10😁1
This media is not supported in your browser
VIEW IN TELEGRAM
AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition
Github: https://github.com/ShoufaChen/AdaptFormer
Paper: https://arxiv.org/abs/2205.13535v1
Dataset: https://paperswithcode.com/dataset/something-something-v2
@computer_science_and_programming
Github: https://github.com/ShoufaChen/AdaptFormer
Paper: https://arxiv.org/abs/2205.13535v1
Dataset: https://paperswithcode.com/dataset/something-something-v2
@computer_science_and_programming
👍102👎1
Squeezeformer: An Efficient Transformer for Automatic Speech Recognition
Github: https://github.com/kssteven418/squeezeformer
Paper: https://arxiv.org/abs/2206.00888v1
Dataset: https://paperswithcode.com/dataset/librispeech
@computer_science_and_programming
Github: https://github.com/kssteven418/squeezeformer
Paper: https://arxiv.org/abs/2206.00888v1
Dataset: https://paperswithcode.com/dataset/librispeech
@computer_science_and_programming
👍127👎6
MIT, Introduction to Deep Learning, 2022 Lecture series
Website:
http://introtodeeplearning.com/
Lecture:
https://www.youtube.com/watch?v=7sB052Pz0sQ&list=PLtBw6njQRU-rwp5__7C0oIVt26ZgjG9NI
@computer_science_and_programming
Website:
http://introtodeeplearning.com/
Lecture:
https://www.youtube.com/watch?v=7sB052Pz0sQ&list=PLtBw6njQRU-rwp5__7C0oIVt26ZgjG9NI
@computer_science_and_programming
👍273👎7
CVPR 2022 open access
All accepted papers list:
https://openaccess.thecvf.com/CVPR2022?day=2022-06-21
@computer_science_and_programming
All accepted papers list:
https://openaccess.thecvf.com/CVPR2022?day=2022-06-21
@computer_science_and_programming
👍85👎2
Prosody Cloning in Zero-Shot Multispeaker Text-to-Speech
IMS Toucan is a toolkit for teaching, training and using state-of-the-art Speech Synthesis models.
Github: https://github.com/DigitalPhonetics/IMS-Toucan
https://github.com/rballester/tntorch
Pre-Generated Audios: https://multilingualtoucan.github.io/
Cloning prosody across speakers: https://toucanprosodycloningdemo.github.io/
Interactive Demo: https://huggingface.co/spaces/Flux9665/IMS-Toucan
Paper: https://arxiv.org/abs/2206.12229v1
@computer_science_and_programming
IMS Toucan is a toolkit for teaching, training and using state-of-the-art Speech Synthesis models.
Github: https://github.com/DigitalPhonetics/IMS-Toucan
https://github.com/rballester/tntorch
Pre-Generated Audios: https://multilingualtoucan.github.io/
Cloning prosody across speakers: https://toucanprosodycloningdemo.github.io/
Interactive Demo: https://huggingface.co/spaces/Flux9665/IMS-Toucan
Paper: https://arxiv.org/abs/2206.12229v1
@computer_science_and_programming
👍130
Instance Shadow Detection with A Single-Stage Detector
Deep framework, and an evaluation metric to approach this new task.
Github: https://github.com/stevewongv/InstanceShadowDetection
Instance Shadow Detection: https://github.com/stevewongv/SSIS
Video: https://www.youtube.com/watch?v=p0b_2SsFypw
Colab: https://colab.research.google.com/drive/1y9UpS5uA1YuoMyvYVzcKL4ltA_FDu_x0?usp=sharing
Paper: https://arxiv.org/abs/2207.04614v1
Datasets: https://paperswithcode.com/dataset/soba
@computer_science_and_programming
Deep framework, and an evaluation metric to approach this new task.
Github: https://github.com/stevewongv/InstanceShadowDetection
Instance Shadow Detection: https://github.com/stevewongv/SSIS
Video: https://www.youtube.com/watch?v=p0b_2SsFypw
Colab: https://colab.research.google.com/drive/1y9UpS5uA1YuoMyvYVzcKL4ltA_FDu_x0?usp=sharing
Paper: https://arxiv.org/abs/2207.04614v1
Datasets: https://paperswithcode.com/dataset/soba
@computer_science_and_programming
👍168👎6
Weakly Supervised Object Localization via Transformer with Implicit Spatial Calibration
learnable parameter to dynamically adjust the semantic correlations and spatial context intensities for effective information propagation.
Github: https://github.com/164140757/scm
Paper: https://arxiv.org/abs/2207.10447v1
Dataset: https://paperswithcode.com/dataset/cub-200-2011
@computer_science_and_programming
learnable parameter to dynamically adjust the semantic correlations and spatial context intensities for effective information propagation.
Github: https://github.com/164140757/scm
Paper: https://arxiv.org/abs/2207.10447v1
Dataset: https://paperswithcode.com/dataset/cub-200-2011
@computer_science_and_programming
👍102👎6
This media is not supported in your browser
VIEW IN TELEGRAM
UFO: segmentation 140+ FPS
👉Unified Transformer Framework for Co-Segmentation, Co-Saliency & Salient Object Detection. All in one!
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Unified framework for co-segmentation
✅Co-segmentation, co-saliency, saliency
✅Block for long-range dependencies
✅Able to reach for 140 FPS in inference
✅The new SOTA on multiple datasets
Paper:
https://arxiv.org/pdf/2203.04708v2.pdf
Code:
https://github.com/suyukun666/UFO
@computer_science_and_programming
👉Unified Transformer Framework for Co-Segmentation, Co-Saliency & Salient Object Detection. All in one!
𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Unified framework for co-segmentation
✅Co-segmentation, co-saliency, saliency
✅Block for long-range dependencies
✅Able to reach for 140 FPS in inference
✅The new SOTA on multiple datasets
Paper:
https://arxiv.org/pdf/2203.04708v2.pdf
Code:
https://github.com/suyukun666/UFO
@computer_science_and_programming
👍205👎3
Harvard CS109A #DataScience course materials — huge collection free & open!
1. Lecture notes
2. R code, #Python notebooks
3. Lab material
4. Advanced sections
and more ...
https://harvard-iacs.github.io/2019-CS109A/pages/materials.html
@computer_science_and_programming
1. Lecture notes
2. R code, #Python notebooks
3. Lab material
4. Advanced sections
and more ...
https://harvard-iacs.github.io/2019-CS109A/pages/materials.html
@computer_science_and_programming
👍333👎18
Resources for performing deep learning on satellite imagery:
- Techniques
- Datasets
- ML best Practice
- Courses
and more ...
@computer_science_and_programming
- Techniques
- Datasets
- ML best Practice
- Courses
and more ...
@computer_science_and_programming
👍301👎17
This media is not supported in your browser
VIEW IN TELEGRAM
VToonify: Controllable High-Resolution Portrait Video Style Transfer
@computer_science_and_programming
@computer_science_and_programming
👍91
VToonify: Controllable High-Resolution Portrait Video Style Transfer
Github:
https://github.com/williamyang1991/vtoonify
Colab code example
https://colab.research.google.com/github/williamyang1991/VToonify/blob/master/notebooks/inference_playground.ipynb
Paper:
https://arxiv.org/pdf/2209.11224.pdf
Dataset:
https://paperswithcode.com/dataset/faceforensics-1
Video explanation:
https://www.youtube.com/watch?v=0_OmVhDgYuY
@computer_science_and_programming
Github:
https://github.com/williamyang1991/vtoonify
Colab code example
https://colab.research.google.com/github/williamyang1991/VToonify/blob/master/notebooks/inference_playground.ipynb
Paper:
https://arxiv.org/pdf/2209.11224.pdf
Dataset:
https://paperswithcode.com/dataset/faceforensics-1
Video explanation:
https://www.youtube.com/watch?v=0_OmVhDgYuY
@computer_science_and_programming
👍183👎4
This media is not supported in your browser
VIEW IN TELEGRAM
Omni3D: A Large Benchmark and Model for 3D Object Detection in the Wild
Paper:
https://arxiv.org/pdf/2207.10660.pdf
Github:
https://github.com/facebookresearch/omni3d
Project page:
https://garrickbrazil.com/omni3d/
@computer_science_and_programming
Paper:
https://arxiv.org/pdf/2207.10660.pdf
Github:
https://github.com/facebookresearch/omni3d
Project page:
https://garrickbrazil.com/omni3d/
@computer_science_and_programming
👍156