Novel topic modelling techniques

Looks interesting.
If anyone knows about this - please ping in PM.

Handy opencv snippet to transform grayscale mask with labels (i.e. 0 for background, 1,2,3 etc) into a colourful map

y_pred_coloured = cv2.applyColorMap((y_pred / y_pred.max() * 255).astype('uint8'), cv2.COLORMAP_JET)

For some reason when you use PyTorch multi-thread data loaders, it stalls if you use OpenCV and not set

Nice to know this.

The trend for smaller / inadequate prizes and weird datasets continues:

- This years' DS Bowl on Kaggle features a small public train dataset (650 images) vs. much larger delayed validation dataset (3000 images). Cheap annotation, anyone? =) Also remarkably, for a somewhat difficult task (instance segmentation) - the prize is much lower than the last year;

- New autonomous driving contest on Kaggle, as well as other CVPR competitions - features extremely large datasets, extremely low prizes (US$1-2k), and no travel costs to CVPR covered. Ofc you can win and fly there, but this will not even cover your GPU costs;

- The recent xnView challenge I really wanted to participate - requires a US Tax number to be eligible for prizes. Of course they do not know about double taxation treaties and WEP-8 tax exemptions;

Alas =(

- Ben Evans -
- About autonomous cars - - autonomy will vary based on the route / conditions / situation / use case
- FB delays its speaker -
- Foxconn buys Belking
- Amazon music > 10m subs -
- The Economist about ML in business -
- Apple to make its own chips -

As you may know (for newer people on the channel), sometimes we publish small articles on the website.

This time it covers a recent Power Laws challenge on DrivenData, which at first seemed legit and cool, but in the end turned back into a pumpkin.

Here is an article:

A bit more on semantic segmentation, now 3D

{V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation

--> Link / authors, Fausto Milletari / Nassir Navab / Seyed-Ahmad Ahmadi
--> Essence:
(0) Essentially applies UNet to 3D with a custom DICE based loss
(1) Architecture - - basically UNet with 3D convolutions. Upsampling / downsampling -
(2) PReLu (no ablation test)
(3) Receptive fields of layers -
(4) 3D DICE loss - (wo BCE?)
--> The paper does not use all the juice possible - hacky transfer learning (obvious idea - just stacking Imagenet filters), CLR, LinkNet architectures, etc
--> Looks like a good baseline / reference

{An application of cascaded 3D fully convolutional networks for medical image segmentation

-->, a group of Japanese researchers
--> Essence:
(0) 2 stage 3D UNet, ablation test against 2D FCNs
(1) Loss - 3D cross-entropy
(2) Transfer learning - it works for other datasets, give a mild boost (1-3 %)
(3) 80-90% DICE, varies by organ
(4) weights downloadable (Caffe...)
--> Essentially a 2 stage process is dictated by memory considerations:
(0) Pipeline

In the long run transfer learning may rule, but here legal limitations may slow down this process.

Yolov3 - best paper.
But not in terms of scientific contribution, but rebuttal of DS community BS.
Very funny read.

If you want a proper comparison of object detection algorithms - use this paper

Looks like SSD and YOLO are reasonably good and fast, and RCNN can be properly tuned to be 3-5x slower (not 100x) and more accurate.

DS Bowl 2018 stage 2 data was released.
It has completely different distribution from stage 1 data.
How do you like them, apples?

Looks like Kaggle admins really have no idea about dataset curation, or all of this is mean to misguide manual annotators.

Anyway - looks like random bs.

So I briefly dug into running a containerized GPU accelerated GUI app (I want to be able to run some apps I do not really want on my host).

Docker kind of works for this purpose, but I found working guides for nvidia-docker, not nvidia-docker2.

Looks like if you want to run a Linux host with a Linux container - then LXD is a good option. It is high level and seems to have an easy API to use. I will report if that will work for me.

- Guide
- LXD vs Docker
- Extensive LXD tutorial

So, usually I try to stay away from such controversial topics, but I have to address and elephant in the room. You all know, that originally I am from Russia and I have quite liberal world views.

Seeing that many people start to ride the hype and advertising some expensive "solutions", this is why today I decided to do a post about creating your own SOCK5 proxy server via a droplet on Digital Ocean:
- Post - - note that unlike my other posts - this one is a step-by-step explanation;
- It explains how to create your own SOCK5 proxy-server using Ubuntu and Digital Ocean with dante;
- The cheapest digital ocean droplet is US$5 per month (you can find such droplets for as low as US$2-3 with inferior service);
- If you use my referral link - you will get US$10 for free -
- Also you can create credentials for your friends and family;

So, I just found out that Firefox rendering engine was rewritten, now it boasts the fastest speeds and support for ... socks5 proxies, both on mobile and desktop.

Also projects like orbot+orfox help in more extreme cases.
