Computer Science and Programming

A lightweight vision library for performing large scale object detection & instance segmentation

Github: https://github.com/obss/sahi

Paper: https://arxiv.org/abs/2202.06934v1

Kaggle notebook: https://www.kaggle.com/remekkinas/sahi-slicing-aided-hyper-inference-yv5-and-yx

Dataset: https://paperswithcode.com/dataset/xview

👉👉@computer_science_and_programming

👍152👎8

184K viewsedited 12:09

Computer Science and Programming

Reading suggestions to keep you up-to-date with the latest and classic breakthroughs in AI and Data Science.

https://towardsdatascience.com/ai-papers-to-read-in-2022-c6edd4302247

Medium

AI Papers to Read in 2022

Reading suggestions to keep you up-to-date with the latest and classic breakthroughs in AI and Data Science.

👍223👎9❤1

176K views10:24

Computer Science and Programming

💬 A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution

Github: https://github.com/mjq11302010044/tatt

Paper: https://arxiv.org/abs/2203.09388v2

Dataset: https://deepchecks.com/blog/

@computer_science_and_programming

👍127👎6❤1

156K viewsedited 10:46

Computer Science and Programming

This media is not supported in your browser

VIEW IN TELEGRAM

NAFSSR: Stereo Image Super-Resolution Using NAFNet

Github: https://github.com/megvii-research/NAFNet

Paper: https://arxiv.org/abs/2204.08714v1

Demo: https://colab.research.google.com/drive/1dkO5AyktmBoWwxBwoKFUurIDn0m4qDXT?usp=sharing

Dataset: https://paperswithcode.com/dataset/kitti

@computer_science_and_programming

👍273👎11

149K viewsedited 07:53

Computer Science and Programming

🧊 Focal Sparse Convolutional Networks for 3D Object Detection (CVPR 2022, Oral)

Github: https://github.com/dvlab-research/focalsconv

Paper: https://arxiv.org/abs/2204.12463

Dataset: https://paperswithcode.com/dataset/nuscenes

@computer_science_and_programming

👍116👎10❤1

129K viewsedited 06:30

Computer Science and Programming

RefineMask: Towards High-Quality Instance Segmentation
with Fine-Grained Features (CVPR 2021)

Paper:
https://openaccess.thecvf.com/content/CVPR2021/papers/Zhang_RefineMask_Towards_High-Quality_Instance_Segmentation_With_Fine-Grained_Features_CVPR_2021_paper.pdf

Source:
https://github.com/zhanggang001/RefineMask

@computer_science_and_programming

👍137👎10😁1

142K views09:34

Computer Science and Programming

0:08

This media is not supported in your browser

VIEW IN TELEGRAM

AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition

Github: https://github.com/ShoufaChen/AdaptFormer

Paper: https://arxiv.org/abs/2205.13535v1

Dataset: https://paperswithcode.com/dataset/something-something-v2

@computer_science_and_programming

👍102👎1

137K viewsedited 11:52

Computer Science and Programming

Squeezeformer: An Efficient Transformer for Automatic Speech Recognition

Github: https://github.com/kssteven418/squeezeformer

Paper: https://arxiv.org/abs/2206.00888v1

Dataset: https://paperswithcode.com/dataset/librispeech

@computer_science_and_programming

👍127👎6

140K viewsedited 06:35

Computer Science and Programming

MIT, Introduction to Deep Learning, 2022 Lecture series

Website:
http://introtodeeplearning.com/

Lecture:
https://www.youtube.com/watch?v=7sB052Pz0sQ&list=PLtBw6njQRU-rwp5__7C0oIVt26ZgjG9NI

@computer_science_and_programming

👍273👎7

148K viewsedited 04:50

Computer Science and Programming

CVPR 2022 open access

All accepted papers list:
https://openaccess.thecvf.com/CVPR2022?day=2022-06-21

@computer_science_and_programming

👍85👎2

138K viewsedited 07:12

Computer Science and Programming

Prosody Cloning in Zero-Shot Multispeaker Text-to-Speech

IMS Toucan is a toolkit for teaching, training and using state-of-the-art Speech Synthesis models.

Github: https://github.com/DigitalPhonetics/IMS-Toucan
https://github.com/rballester/tntorch

Pre-Generated Audios: https://multilingualtoucan.github.io/

Cloning prosody across speakers: https://toucanprosodycloningdemo.github.io/

Interactive Demo: https://huggingface.co/spaces/Flux9665/IMS-Toucan

Paper: https://arxiv.org/abs/2206.12229v1

@computer_science_and_programming

👍130

141K views11:43

Computer Science and Programming

Instance Shadow Detection with A Single-Stage Detector

Deep framework, and an evaluation metric to approach this new task.

Github: https://github.com/stevewongv/InstanceShadowDetection

Instance Shadow Detection: https://github.com/stevewongv/SSIS

Video: https://www.youtube.com/watch?v=p0b_2SsFypw

Colab: https://colab.research.google.com/drive/1y9UpS5uA1YuoMyvYVzcKL4ltA_FDu_x0?usp=sharing

Paper: https://arxiv.org/abs/2207.04614v1

Datasets: https://paperswithcode.com/dataset/soba

@computer_science_and_programming

👍168👎6

150K viewsedited 09:33

Computer Science and Programming

Weakly Supervised Object Localization via Transformer with Implicit Spatial Calibration

learnable parameter to dynamically adjust the semantic correlations and spatial context intensities for effective information propagation.

Github: https://github.com/164140757/scm

Paper: https://arxiv.org/abs/2207.10447v1

Dataset: https://paperswithcode.com/dataset/cub-200-2011

@computer_science_and_programming

👍102👎6

149K viewsedited 10:15

Computer Science and Programming

This media is not supported in your browser

VIEW IN TELEGRAM

UFO: segmentation 140+ FPS

👉Unified Transformer Framework for Co-Segmentation, Co-Saliency & Salient Object Detection. All in one!

𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬:
✅Unified framework for co-segmentation
✅Co-segmentation, co-saliency, saliency
✅Block for long-range dependencies
✅Able to reach for 140 FPS in inference
✅The new SOTA on multiple datasets

Paper:
https://arxiv.org/pdf/2203.04708v2.pdf

Code:
https://github.com/suyukun666/UFO

@computer_science_and_programming

👍205👎3❤1

176K viewsedited 09:33

Computer Science and Programming

Harvard CS109A #DataScience course materials — huge collection free & open!

1. Lecture notes
2. R code, #Python notebooks
3. Lab material
4. Advanced sections
and more ...

https://harvard-iacs.github.io/2019-CS109A/pages/materials.html

@computer_science_and_programming

👍333👎18

176K views04:53

Computer Science and Programming

Resources for performing deep learning on satellite imagery:
- Techniques
- Datasets
- ML best Practice
- Courses
and more ...

@computer_science_and_programming

👍301👎17

161K viewsedited 06:54

Computer Science and Programming