DeepLearning ru:
Clockwork Convnets for Video Semantic Segmentation.
Adaptive video processing by incorporating data-driven clocks.
We define a novel family of "clockwork" convnets driven by fixed or adaptive clock signals that schedule the processing of different layers at different update rates according to their semantic stability. We design a pipeline schedule to reduce latency for real-time recognition and a fixed-rate schedule to reduce overall computation. Finally, we extend clockwork scheduling to adaptive video processing by incorporating data-driven clocks that can be tuned on unlabeled video.
https://arxiv.org/pdf/1608.03609v1.pdf
https://github.com/shelhamer/clockwork-fcn
http://www.gitxiv.com/posts/89zR7ATtd729JEJAg/clockwork-convnets-for-video-semantic-segmentation
#dl #CV #Caffe #video #Segmentation
Clockwork Convnets for Video Semantic Segmentation.
Adaptive video processing by incorporating data-driven clocks.
We define a novel family of "clockwork" convnets driven by fixed or adaptive clock signals that schedule the processing of different layers at different update rates according to their semantic stability. We design a pipeline schedule to reduce latency for real-time recognition and a fixed-rate schedule to reduce overall computation. Finally, we extend clockwork scheduling to adaptive video processing by incorporating data-driven clocks that can be tuned on unlabeled video.
https://arxiv.org/pdf/1608.03609v1.pdf
https://github.com/shelhamer/clockwork-fcn
http://www.gitxiv.com/posts/89zR7ATtd729JEJAg/clockwork-convnets-for-video-semantic-segmentation
#dl #CV #Caffe #video #Segmentation
GitHub
shelhamer/clockwork-fcn
Clockwork Convnets for Video Semantic Segmenation. Contribute to shelhamer/clockwork-fcn development by creating an account on GitHub.
There is a new $1MM competition on Kaggle to use ML / AI to diagnose lung cancer from CT scans.
Not only it is the great breakthrough for Kaggle (it is the first competition with this huge prize fund), it is also a breakthrough for science, since top world researchers and enginners will compete to basically crowdsource and ease the lung cancer diagnostics.
Competition is available at: https://www.kaggle.com/c/data-science-bowl-2017
#kaggle #segmentation #deeplearning #cv
Not only it is the great breakthrough for Kaggle (it is the first competition with this huge prize fund), it is also a breakthrough for science, since top world researchers and enginners will compete to basically crowdsource and ease the lung cancer diagnostics.
Competition is available at: https://www.kaggle.com/c/data-science-bowl-2017
#kaggle #segmentation #deeplearning #cv
Kaggle
Data Science Bowl 2017
Can you improve lung cancer detection?
Google purchased scene segmentation technology.
https://techcrunch.com/2017/08/16/google-acquires-aimatter-maker-of-the-fabby-computer-vision-app/
#dl #segmentation #cv #google
https://techcrunch.com/2017/08/16/google-acquires-aimatter-maker-of-the-fabby-computer-vision-app/
#dl #segmentation #cv #google
TechCrunch
Google acquires AIMatter, maker of the Fabby computer vision app
Computer vision -- the branch of artificial intelligence that lets computers "see" and process images like humans do (and, actually, often better than
Semantic Segmentation Models for Autonomous Vehicles
https://www.kdnuggets.com/2018/03/semantic-segmentation-models-autonomous-vehicles.html
#deeplearning #segmentation
https://www.kdnuggets.com/2018/03/semantic-segmentation-models-autonomous-vehicles.html
#deeplearning #segmentation
ModaNet: A Large-Scale Street Fashion Dataset with Polygon Annotations
Latest segmentation and detection approaches (DeepLabV3+, FasterRCNN) applied to street fashion images. Arxiv paper contains information about both: net and dataset.
Arxiv link: https://arxiv.org/abs/1807.01394
Paperdoll dataset: http://vision.is.tohoku.ac.jp/~kyamagu/research/paperdoll/
#segmentation #dataset #fashion #sv
Latest segmentation and detection approaches (DeepLabV3+, FasterRCNN) applied to street fashion images. Arxiv paper contains information about both: net and dataset.
Arxiv link: https://arxiv.org/abs/1807.01394
Paperdoll dataset: http://vision.is.tohoku.ac.jp/~kyamagu/research/paperdoll/
#segmentation #dataset #fashion #sv
vision.is.tohoku.ac.jp
Kota Yamaguchi - PaperDoll Parsing
Kota Yamaguchi's website
Cancer metastasis detection with neural conditional random field (NCRF)
Github: https://github.com/baidu-research/NCRF?utm_source=telegram&utm_medium=opendatascience
#Baidu #Cancer #Segmentation #cv #DL
Github: https://github.com/baidu-research/NCRF?utm_source=telegram&utm_medium=opendatascience
#Baidu #Cancer #Segmentation #cv #DL
GitHub
GitHub - baidu-research/NCRF: Cancer metastasis detection with neural conditional random field (NCRF)
Cancer metastasis detection with neural conditional random field (NCRF) - baidu-research/NCRF
Paper Β«A Probabilistic U-Net for Segmentation of Ambiguous ImagesΒ» from #NIPS2018 spotlight presentation.
Github: https://github.com/SimonKohl/probabilistic_unet
Github: Arxiv: https://arxiv.org/abs/1806.05034
#DeepMind #segmentation #cv
Github: https://github.com/SimonKohl/probabilistic_unet
Github: Arxiv: https://arxiv.org/abs/1806.05034
#DeepMind #segmentation #cv
GitHub
GitHub - SimonKohl/probabilistic_unet: A U-Net combined with a variational auto-encoder that is able to learn conditional distributionsβ¦
A U-Net combined with a variational auto-encoder that is able to learn conditional distributions over semantic segmentations. - GitHub - SimonKohl/probabilistic_unet: A U-Net combined with a variat...
Faster R-CNN and Mask R-CNN in #PyTorch 1.0
Another release from #Facebook.
Mask R-CNN Benchmark: a fast and modular implementation for Faster R-CNN and Mask R-CNN written entirely in @PyTorch 1.0. It brings up to 30% speedup compared to mmdetection during training.
Webcam demo and ipynb file are available.
Github: https://github.com/facebookresearch/maskrcnn-benchmark
#CNN #CV #segmentation #detection
Another release from #Facebook.
Mask R-CNN Benchmark: a fast and modular implementation for Faster R-CNN and Mask R-CNN written entirely in @PyTorch 1.0. It brings up to 30% speedup compared to mmdetection during training.
Webcam demo and ipynb file are available.
Github: https://github.com/facebookresearch/maskrcnn-benchmark
#CNN #CV #segmentation #detection
GitHub
GitHub - facebookresearch/maskrcnn-benchmark: Fast, modular reference implementation of Instance Segmentation and Object Detectionβ¦
Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch. - facebookresearch/maskrcnn-benchmark
ββPapers from #DeepMind panel at #NIPS2018
Work on radiotherapy planning: https://arxiv.org/abs/1809.04430
Triaging eye diseases: https://www.nature.com/articles/s41591-018-0107-6
Probabilistic U-net: https://arxiv.org/abs/1806.05034
#segmentation #CV #Unet
Work on radiotherapy planning: https://arxiv.org/abs/1809.04430
Triaging eye diseases: https://www.nature.com/articles/s41591-018-0107-6
Probabilistic U-net: https://arxiv.org/abs/1806.05034
#segmentation #CV #Unet
ββFast video object segmentation with Spatio-Temporal GANs
Spatio-Temporal GANs to the Video Object Segmentation task, allowing to run at 32 FPS without fine-tuning.
#FaSTGAN #GAN #Segmentation #videomining #CV #DL
Spatio-Temporal GANs to the Video Object Segmentation task, allowing to run at 32 FPS without fine-tuning.
#FaSTGAN #GAN #Segmentation #videomining #CV #DL
ββDeep Learning Image Segmentation for Ecommerce Catalogue Visual Search
Microsoftβs article on image segmentation
Link: https://www.microsoft.com/developerblog/2018/04/18/deep-learning-image-segmentation-for-ecommerce-catalogue-visual-search/
#CV #DL #Segmentation #Microsoft
Microsoftβs article on image segmentation
Link: https://www.microsoft.com/developerblog/2018/04/18/deep-learning-image-segmentation-for-ecommerce-catalogue-visual-search/
#CV #DL #Segmentation #Microsoft
ββGSCNN: video segmetation architecture
Semantic segmentation GSCNN significantly outperforms DeepLabV3+ on Cityscapes benchmark.
Paper: https://arxiv.org/abs/1907.05740
Github (Project): https://github.com/nv-tlabs/GSCNN
#DL #CV #NVidiaAI #Nvidia #autonomous #selfdriving #car #RL #segmentation
Semantic segmentation GSCNN significantly outperforms DeepLabV3+ on Cityscapes benchmark.
Paper: https://arxiv.org/abs/1907.05740
Github (Project): https://github.com/nv-tlabs/GSCNN
#DL #CV #NVidiaAI #Nvidia #autonomous #selfdriving #car #RL #segmentation
ββNew paper on training with pseudo-labels for semantic segmentation
Semi-Supervised Segmentation of Salt Bodies in Seismic Images:
SOTA (1st place) at TGS Salt Identification Challenge.
Github: https://github.com/ybabakhin/kaggle_salt_bes_phalanx
ArXiV: https://arxiv.org/abs/1904.04445
#GCPR2019 #Segmentation #CV
Semi-Supervised Segmentation of Salt Bodies in Seismic Images:
SOTA (1st place) at TGS Salt Identification Challenge.
Github: https://github.com/ybabakhin/kaggle_salt_bes_phalanx
ArXiV: https://arxiv.org/abs/1904.04445
#GCPR2019 #Segmentation #CV
Video on how Facebook continues to develop its #Portal device
How #Facebook used Mask R-CNN, #PyTorch, and custom hardware integrations like foveated processing to improve Portalβs Smart Camera system.
Link: https://ai.facebook.com/blog/smart-camera-portal-advances/
#CV #DL #Segmentation
How #Facebook used Mask R-CNN, #PyTorch, and custom hardware integrations like foveated processing to improve Portalβs Smart Camera system.
Link: https://ai.facebook.com/blog/smart-camera-portal-advances/
#CV #DL #Segmentation
Meta
How weβve advanced Smart Camera for new Portal video-calling devices
Weβve used Detectron2, Mask R-CNN, and custom hardware integrations like foveated processing in order to make additional speed and precision improvements in the computer vision models that power Smart Camera.
YOLACT_ Real-Time Instance Segmentation [ICCV Trailer].mp4
19.2 MB
YOLACT: Real-time Instance Segmentation
Fully-convolutional model for real-time instance segmentation that achieves 29.8 mAP on MS COCO at 33.5 fps evaluated on a single Titan Xp, which is significantly faster than any previous competitive approach. They obtain this result after training on only one GPU.
video: https://www.youtube.com/watch?v=0pMfmo8qfpQ
paper: https://arxiv.org/abs/1904.02689
code: https://github.com/dbolya/yolact
#yolo #instance_segmentation #segmentation #real_time
Fully-convolutional model for real-time instance segmentation that achieves 29.8 mAP on MS COCO at 33.5 fps evaluated on a single Titan Xp, which is significantly faster than any previous competitive approach. They obtain this result after training on only one GPU.
video: https://www.youtube.com/watch?v=0pMfmo8qfpQ
paper: https://arxiv.org/abs/1904.02689
code: https://github.com/dbolya/yolact
#yolo #instance_segmentation #segmentation #real_time
BodyPix: Real-time Person Segmentation in the Browser with TensorFlow.js
Released BodyPix 2.0 with #multiperson support and improved accuracy (based on ResNet50), a new API, weight quantization, and support for different image sizes with TensorFlow.js
It estimates and renders person and body-part segmentation at 25 fps on a 2018 15-inch MacBook Pro, and 21 fps on an iPhone X.
code: https://github.com/tensorflow/tfjs-models/tree/master/body-pix
demo: https://storage.googleapis.com/tfjs-models/demos/body-pix/index.html
blog: https://blog.tensorflow.org/2019/11/updated-bodypix-2.html
#dl #segmentation
Released BodyPix 2.0 with #multiperson support and improved accuracy (based on ResNet50), a new API, weight quantization, and support for different image sizes with TensorFlow.js
It estimates and renders person and body-part segmentation at 25 fps on a 2018 15-inch MacBook Pro, and 21 fps on an iPhone X.
code: https://github.com/tensorflow/tfjs-models/tree/master/body-pix
demo: https://storage.googleapis.com/tfjs-models/demos/body-pix/index.html
blog: https://blog.tensorflow.org/2019/11/updated-bodypix-2.html
#dl #segmentation
ββSAUNet: Shape Attentive U-Net for Interpretable Medical Image Segmentation
New approach for interpreting medical image segmentation models.
U-Net and other image segmentation models work quite well on medical data, but still aren't widely adopted. One of the reasons is the lack of reproducibility as well as robustness issues.
The key idea of the paper is using the additional stream in U-Net with shape features to increase robustness and use the output of this stream (attention map) that can be used or interpretability.
Modifications to the basic U-Net architecture:
- use dense blocks from DenseNet-121 as the encoder.
- use dual attention decoder block (with spatial and channel-wise attention paths)
- make the second stream using object shape (contour)
- dual-task loss function: cross-entropy + dice + edge loss (bce loss of the predicted shape boundaries)
Shape and spatial attention maps can be used for interpretation.
Paper: https://arxiv.org/abs/2001.07645
Code: https://github.com/sunjesse/shape-attentive-unet
#unet #imagesegmentation #interpretability #segmentation
New approach for interpreting medical image segmentation models.
U-Net and other image segmentation models work quite well on medical data, but still aren't widely adopted. One of the reasons is the lack of reproducibility as well as robustness issues.
The key idea of the paper is using the additional stream in U-Net with shape features to increase robustness and use the output of this stream (attention map) that can be used or interpretability.
Modifications to the basic U-Net architecture:
- use dense blocks from DenseNet-121 as the encoder.
- use dual attention decoder block (with spatial and channel-wise attention paths)
- make the second stream using object shape (contour)
- dual-task loss function: cross-entropy + dice + edge loss (bce loss of the predicted shape boundaries)
Shape and spatial attention maps can be used for interpretation.
Paper: https://arxiv.org/abs/2001.07645
Code: https://github.com/sunjesse/shape-attentive-unet
#unet #imagesegmentation #interpretability #segmentation