Spark in me

DS/ML digest 28

Google open sources pre-trained BERT ... with 102 languages ...

https://spark-in.me/post/2018_ds_ml_digest_28

#digest
#deep_learning
#data_science

Spark in me

2018 DS/ML digest 28

2018 DS/ML digest 28
Статьи автора - http://spark-in.me/author/snakers41
Блог - http://spark-in.me

1.4K viewsAlexander, edited 13:45

Spark in me

Fast-text trained on a random mix of Russian Wikipedia / Taiga / Common Crawl

On our benchmarks was marginally better than fast-text trained on Araneum from Rusvectors.

Download link
https://goo.gl/g6HmLU

Params
Standard params - (3,6) n-grams + vector dimensionality is 300.

Usage:

import fastText as ft
ft_model_big = ft.load_model('model')

And then just refer to
https://github.com/facebookresearch/fastText/blob/master/python/fastText/FastText.py

#nlp

1.6K viewsAlexander, 13:48

Spark in me

Playing with Transformer

TLDR - use only pre-trained.

On classification tasks performed the same as classic models.

On seq2seq - much worse time / memory wise. Inference is faster though.

#nlp

1.2K viewsAlexander, 10:30

Spark in me

Towards Data Science

Our article was accepted to their publication:
- https://towardsdatascience.com/building-client-routing-semantic-search-in-the-wild-14db04687c7e

Also when you have published once there, then you can just publish your work on TDS on recurrent basis =)

I doubt that this will be properly distributed to all 130k of their subs, but nevertheless this is a milestone.

#data_science

Medium

Building client routing / semantic search in the wild

A comparison of novel NLP techniques within an applied business setting

1.5K viewsAlexander, 18:32

Spark in me

A small saga about keeping GPUs cool (1) 1-2 GPUs with blower fans (or turbo fans) in a full tower -- idle 40-45C -- full load - 80-85C (2) 3-4 GPUs with blower fans (or turbo fans) in a full tower -- idle - 45-55C -- full load - 85-95С Also with 3-4+…

When it is colder, under full load GPUs run at 70C

1.3K viewsAlexander, 09:09

Spark in me

https://youtu.be/zL6ltnSKf9k

YouTube

This AI Learned To Isolate Speech Signals

The paper "Looking to Listen at the Cocktail Party: A Speaker-Independent Audio-Visual Model for Speech Separation " is available here:
https://looking-to-listen.github.io/

Pick up cool perks on our Patreon page:
› https://www.patreon.com/TwoMinutePapers…

1.4K viewsAlexander, 19:26

Spark in me

An intro to RL

Though published by OpenAI with TF, this is simply amazing:
- https://spinningup.openai.com/en/latest/spinningup/rl_intro.html

#rl

1.4K viewsAlexander, 08:36

Spark in me

DS/ML digest 29

https://spark-in.me/post/2018_ds_ml_digest_29

#digest
#deep_learning
#data_science

Spark in me

2018 DS/ML digest 29

2018 DS/ML digest 29
Статьи автора - http://spark-in.me/author/snakers41
Блог - http://spark-in.me

1.4K viewsAlexander, edited 08:09

Spark in me

Forwarded from Karim Iskakov - канал (karfly_bot)

"80 years of AI research. Epic battle between connectionist (~neural networks) and symbolic (~rule based) methods. Who will win?"
👤 @OriolVinyalsML (twitter)
📉 @loss_function_porn

36 viewsAlexander, 13:59

Spark in me

Problems with GPUs in DL box

Cryptic messages like:

GPU is lost

Usually this is either:
- PSU;
- Or bad PCIE contact;
- Too much load on PCIE bus;

#deep_learning

961 viewsAlexander, 11:49

Spark in me

Our victory in CFT-2018 competition

TLDR
- Multi-task learning + seq2seq models rule;
- The domain seems to be easy, but it is not;
- You can also build a pipeline based on manual features, but it will not be task agnostic;
- Loss weighting is crucial for such tasks;
- Transformer trains 10x longer;

https://spark-in.me/post/cft-spelling-2018

#nlp
#deep_learning
#data_science

1.3K viewsAlexander, 11:56

Spark in me

TDS article follow-up

TDS also accepted a reprint of the article
https://towardsdatascience.com/winning-a-cft-2018-spelling-correction-competition-b771d0c1b9f6

#nlp

Medium

Winning a CFT 2018 spelling correction competition

Or building a task-agnostic seq2seq pipeline on a challenging domain

1.5K viewsAlexander, 08:21

Spark in me

Jupyter extensions

Looks like they are near end of their support.
Alas.
On a fresh build you will need this

conda install notebook=5.6

To use them.

Will need to invest some time into making Jupyter Lab actually usable.

#data_science

1.2K viewsAlexander, edited 12:43

Spark in me

Getting your public key from Github ... with wget!

I kind of saw it when installing Ubuntu 18 from scratch. But it is super awesome!

wget -O - https://github.com/snakers4.keys >> test

Just replace test with your authorized_keys file and profit!

#linux

983 viewsAlexander, 04:59

Spark in me

Creating a new user

With the above hack, user creation can be done as easy as:

USER="YOUR_USER" && \
GROUP="YOUR_GROUP" && \
sudo useradd $USER && \
sudo adduser $USER $GROUP && \
sudo mkdir -p /home/$USER/.ssh/ && \
sudo touch /home/$USER/.ssh/authorized_keys && \
sudo chown -R $USER:$USER /home/$USER/.ssh/ && \
sudo wget -O - https://github.com/$USER.keys | sudo tee -a /home/$USER/.ssh/authorized_keys

#linux

1.3K viewsAlexander, 05:29

Spark in me

https://youtu.be/_lphFvojA-8

1.3K viewsAlexander, 18:55

Spark in me

DS/ML digest 30

https://spark-in.me/post/2018_ds_ml_digest_30

#digest
#deep_learning
#data_science

Spark in me

2018 DS/ML digest 30

2018 DS/ML digest 30
Статьи автора - http://spark-in.me/author/snakers41
Блог - http://spark-in.me

1.5K viewsAlexander, 11:55

Spark in me

Article about the reality of CV in Russia / CIS

(RU)
http://cv-blog.ru/?p=253

Also a bit on how to handle various types of "customers", who want to contract CV systems from you.
Warning - too much harsh reality)

#deep_learning

1.5K viewsAlexander, 08:10

Spark in me

Channel photo updated

11:01

Spark in me

A cheeky ML/DS themed sticker pack for our channel

Thanks to @birdborn for his art.

You are welcome to use it:
https://t.me/addstickers/ML_spark_in_me_by_BB

If you would like to contribute / create your own stickers - please ask around in our channel chat.

#data_science

1.2K viewsAlexander, 09:40

Spark in me

This kind of mirrors my own old post
https://spark-in.me/post/epistemic-responsibility

Spark in me

Эпистемологическая ответственность

Почему мы должны нести ответственность за идеи, в которые верим и которыми пользуемся
Статьи автора - http://spark-in.me/author/snakers41
Блог - http://spark-in.me

967 viewsAlexander, 14:35

About

Blog

Apps

Platform