Data Science by ODS.ai 🦜

DeepLearning ru:
Clockwork Convnets for Video Semantic Segmentation.

Adaptive video processing by incorporating data-driven clocks.

We define a novel family of "clockwork" convnets driven by fixed or adaptive clock signals that schedule the processing of different layers at different update rates according to their semantic stability. We design a pipeline schedule to reduce latency for real-time recognition and a fixed-rate schedule to reduce overall computation. Finally, we extend clockwork scheduling to adaptive video processing by incorporating data-driven clocks that can be tuned on unlabeled video.

https://arxiv.org/pdf/1608.03609v1.pdf
https://github.com/shelhamer/clockwork-fcn

http://www.gitxiv.com/posts/89zR7ATtd729JEJAg/clockwork-convnets-for-video-semantic-segmentation

#dl #CV #Caffe #video #Segmentation

GitHub

shelhamer/clockwork-fcn

Clockwork Convnets for Video Semantic Segmenation. Contribute to shelhamer/clockwork-fcn development by creating an account on GitHub.

2.4K views22:26

Data Science by ODS.ai 🦜

Deep Bilateral Learning for Real-Time Image Enhancement

Video about image auto-enhancing with neural networks.

https://www.youtube.com/watch?v=GAe0qKKQY_I

#cv #dl #autoenhance #mit #youtube #video

YouTube

Deep Bilateral Learning for Real-Time Image Enhancement

Performance is a critical challenge in mobile image processing. Given a reference imaging pipeline, or even human-adjusted pairs of images, we seek to reproduce the enhancements and enable real-time evaluation. For this, we introduce a new neural network…

5.7K views14:42

Data Science by ODS.ai 🦜

STMVis - Visual Analysis for Recurrent Neural Networks

LSTMVis a visual analysis tool for recurrent neural networks with a focus on understanding these hidden state dynamics. The tool allows a user to select a hypothesis input range to focus on local state changes, to match these states changes to similar patterns in a large data set, and to align these results with structural annotations from their domain. We provide data for the tool to analyze specific hidden state properties on dataset containing nesting, phrase structure, and chord progressions, and demonstrate how the tool can be used to isolate patterns for further statistical analysis.

http://lstm.seas.harvard.edu/

#harvard #video #dl #rnn

lstm.seas.harvard.edu

LSTMVis

A visual analysis tool for recurrent neural networks

6.2K views10:37

Data Science by ODS.ai 🦜

Architecture for real-time scene annotation (BlitzNet)

http://thoth.inrialpes.fr/research/blitznet/

ArxiV: https://arxiv.org/abs/1708.02813
GitHub: https://github.com/dvornikita/blitznet

#ICCV #github #dl #video

GitHub

GitHub - dvornikita/blitznet: Deep neural network for object detection and semantic segmentation in real-time. Official code for…

Deep neural network for object detection and semantic segmentation in real-time. Official code for the paper "BlitzNet: A Real-Time Deep Network for Scene Understanding" - GitHub ...

10.0K views12:21

Data Science by ODS.ai 🦜

A cool paper from Facebook AI (not from FAIR!) about detecting and reading text in images, at scale.

This is very useful for detecting inappropriate content on Facebook.

The system uses R-CNN/Detectron for detecting lines of text.

The OCR uses a ConvNet applied at the level of a whole line trained with CTC.

This concept of applying a ConvNet on a whole line of text, without prior segmentation, has roots in the early days of ConvNets, for example with this NIPS 1992 paper:
"Multi-Digit Recognition Using a Space Displacement Neural Network"
by Ofer Matan, Chris Burges, Yann LeCun and John Denker.

Link: https://papers.nips.cc/paper/557-multi-digit-recognition-using-a-space-displacement-neural-network
Youtuve video with short explanation: https://youtu.be/yl3P2tYewVg

#ocr #cv #dl #rnn #facebook #yannlecun #video

papers.nips.cc

Multi-Digit Recognition Using a Space Displacement Neural Network

Electronic Proceedings of Neural Information Processing Systems

5.4K views06:58

Data Science by ODS.ai 🦜

#MIT recent release for video labeling

Youtube: https://www.youtube.com/watch?v=JBwSk6nJOyM&feature=youtu.be
Github: https://github.com/metalbubble/TRN-pytorch

#dl #video

YouTube

How a Temporal Relation Network understands what's going on there

Prediction of the on-going activity from a TRN is shown. Yes, I am playing my hands :)
Model of TRN and code are available at https://github.com/metalbubble/TRN-pytorch
Model is trained on Something-Something-V2 dataset.

5.0K views13:52

Data Science by ODS.ai 🦜

🎓 Free «Advanced Deep Learning and Reinforcement Learning» course.

#DeepMind researchers have released video recordings of lectures from «Advanced Deep Learning and Reinforcement Learning» a course on deep RL taught at #UCL earlier this year.

YouTube Playlist: https://www.youtube.com/playlist?list=PLqYmG7hTraZDNJre23vqCGIVpfZ_K2RZs

#course #video #RL #DL

7.0K views09:21

Data Science by ODS.ai 🦜

Large-Scale Object Mining for Object Discovery from Unlabeled Video

Paper about process of object discovery.

Link: https://arxiv.org/abs/1903.00362

#Video #DL #CV

7.2K views08:27

Data Science by ODS.ai 🦜

Google announced the updated YouTube-8M dataset

Updated set now includes a subset with verified 5-s segment level labels, along with the 3rd Large-Scale Video Understanding Challenge and Workshop at #ICCV19.

Link: https://ai.googleblog.com/2019/06/announcing-youtube-8m-segments-dataset.html

#Google #YouTube #CV #DL #Video #dataset

11.1K viewsedited 09:06

📹 44 😵 14