Data Science by ODS.ai 🦜

Should we create official chat for the channel to discuss links, answer common questions and to flood (during nighttime) ?

We count every opinion and listen to your feedback, so please vote.

We also preparing special event for the chat creation, so stay tuned for the announcement

9.0K views16:53

Data Science by ODS.ai 🦜

Microsoft open-sourced scripts and notebooks to pre-train and finetune BERT natural language model with domain-specific texts

Github: https://github.com/microsoft/AzureML-BERT
Earlier: https://t.me/opendatascience/837

#Bert #Microsoft #NLP #dl

GitHub

GitHub - microsoft/AzureML-BERT: End-to-End recipes for pre-training and fine-tuning BERT using Azure Machine Learning Service

End-to-End recipes for pre-training and fine-tuning BERT using Azure Machine Learning Service - microsoft/AzureML-BERT

9.6K viewsedited 07:58

🔥 35 😑 7

Data Science by ODS.ai 🦜

Forwarded from Karim Iskakov - канал (Vladimir Ivashkin)

This media is not supported in your browser

VIEW IN TELEGRAM

I'd like to present our new paper with Yandex.Weather! We are pioneers in using a combination of satellite images, radar shots and neural networks for real-time rain forecast. Check out our video for more details!
▶️ youtu.be/9zd3VR-prYU
🔎 yandex.com/weather/nowcast
📝 arxiv.org/abs/1905.09932
📉 @loss_function_porn

454 views11:45

Data Science by ODS.ai 🦜

Checkout our friends' recent publication.

8.9K views11:45

Data Science by ODS.ai 🦜

ODS breakfast in Paris! See you this Saturday at 10:30 at Malongo Café, 50 Rue Saint-André des Arts.

9.1K views12:43

Data Science by ODS.ai 🦜

TabNine showed deep learning code autocomplete tool based on GPT-2 architecture.

Video demonstrates the concept. Hopefully, it will allow us to write code with less bugs, not more.

Link: https://tabnine.com/blog/deep
Something relatively similar by Microsoft: https://visualstudio.microsoft.com/ru/services/intellicode

#GPT2 #TabNine #autocomplete #product #NLP #NLU #codegeneration

10.1K viewsedited 06:47

🔥 72 😑 5

Data Science by ODS.ai 🦜

Great collection of practical rules for routine DS engineering / research job.

Machine Learning in a company is 10% Data Science & 90% other challenges, this pdf provides a great deal of principals and solutions to deal with them.

We can only recommend saving this post to your Saved Messages by forwarding it to yourself.

Link: http://martin.zinkevich.org/rules_of_ml/rules_of_ml.pdf

#cheatsheet #advice #practical #common #shouldbesaved

11.7K views09:27

Data Science by ODS.ai 🦜

YouTokenToMe, new tool for text tokenisation from VK team

Meet new enhanced tokenisation tool on steroids. Works 7-10 times faster alphabetic languages and 40 to 50 times faster on logographic languages, than alternatives.

Under the hood (watch source) there is C++ implementation with python bindings, using Byte Pair Encoding (BPE) algorithm. YouTokenToMe beats #SentencePiece by Google and #fastBPE, created by a researcher from Facebook AI Research in terms of speed.

Github: https://github.com/vkcom/YouTokenToMe
Medium: https://medium.com/@vktech/youtokentome-a-tool-for-quick-text-tokenization-from-the-vk-team-aa6341215c5a
Byte Pair Encoding: https://arxiv.org/abs/1508.07909

9.7K viewsedited 09:14

⚡ 64 🚨 4

Data Science by ODS.ai 🦜

This improvement for everyday used toolset deserves minimum 50 claps at Medium and a Star on github!
Let’s give VK research team appreciation from the community they deserve 👏!!

9.0K views09:16

Data Science by ODS.ai 🦜

What’s wrong with transformer architecture: an overview

How the Transformers broke NLP leaderboards and why that can be bad for industry.

Link: https://hackingsemantics.xyz/2019/leaderboards/

#NLP #overview #transformer #BERT #XLNet

Hacking semantics

How the Transformers broke NLP leaderboards

With the huge Transformer-based models such as BERT, GPT-2, and XLNet, are we losing track of how the state-of-the-art performance is achieved?

9.2K views21:21

Data Science by ODS.ai 🦜

Simultaneous food and facial recognition at a Foxconn factory canteen, Shenzhen China

#video #foodlearning #facerecogniction #dl #cv #foxconn

0:13

10.2K viewsedited 21:30

👀 77 🥙 19

Data Science by ODS.ai 🦜

Deep Learning Image Segmentation for Ecommerce Catalogue Visual Search

Microsoft’s article on image segmentation

Link: https://www.microsoft.com/developerblog/2018/04/18/deep-learning-image-segmentation-for-ecommerce-catalogue-visual-search/

#CV #DL #Segmentation #Microsoft

9.7K viewsedited 07:26

✂️ 36 👗 7

Data Science by ODS.ai 🦜

Google AI research on learning better simulation methods for partial differential equations

New research shows how machine learning can improve high-performance computing for solving partial differential equations, with potential applications that range from modeling #climatechange to simulating fusion reactions. Learn all about it here

Link: https://ai.googleblog.com/2019/07/learning-better-simulation-methods-for.html

#PDE #DE #GoogleAI

8.5K views07:49

Data Science by ODS.ai 🦜

On the concept of 'intellectual debt'

There is technical debt — when you know you should rewrite some stuff, or implement some features, but they don't seem critical at the moment. So article introduces a concept of 'intellectual debt', which resies with more broad and common use of #MachineLearning and #DeepLearning (specially, the latter). What happens when AI gives us seemingly correct answers that we wouldn't have thought of ourselves, without any theory to explain them?

Link: https://www.newyorker.com/tech/annals-of-technology/the-hidden-costs-of-automated-thinking

#Meta #common #lyrics

The New Yorker

The Hidden Costs of Automated Thinking

Overreliance on artificial intelligence may put us in intellectual debt.

9.4K viewsedited 08:57

🥶 – 18 🥵 – 6

Data Science by ODS.ai 🦜

New dataset with adversarial examples

Natural Adversarial Examples are real-world and unmodified examples which cause classifiers to be consistently confused. The new dataset has 7,500 images, which we personally labeled over several months.

ArXiV: https://arxiv.org/abs/1907.07174
Dataset and code: https://github.com/hendrycks/natural-adv-examples

#Dataset #Adversarial

8.4K viewsedited 10:14

🤩 46 🤨 12

Data Science by ODS.ai 🦜

Release of 27 pretrained models for NLP / NLU for PyTorch

Hugging Face open sources a new library that contains up to 27 pretrained models to conduct state-of-the-art NLP/NLU tasks.

Link: https://medium.com/dair-ai/pytorch-transformers-for-state-of-the-art-nlp-3348911ffa5b

#SOTA #NLP #NLU #PyTorch #opensource

9.1K viewsedited 04:59

🔥 53 😑 7

Data Science by ODS.ai 🦜

ODS breakfast in Paris! See you this Saturday at 10:30 at Malongo Café, 50 Rue Saint-André des Arts.

8.0K views08:10

Data Science by ODS.ai 🦜

Filter autoselect in VSCO by Google

#VSCO used #TensorFlow Lite to develop the 'For This Photo' feature, which uses on-device ML to suggest photo filter presets from a curated list.

YouTube: https://www.youtube.com/watch?v=fHbjfeitIvE
Link: https://medium.com/tensorflow/suggesting-presets-for-images-building-for-this-photo-at-vsco-9b94041c4ba4

#mobile #device #cv #dl

YouTube

VSCO — For This Photo

8.5K viewsedited 09:27

🥴📸 21

Data Science by ODS.ai 🦜

Baidu's recent paper: Hubless Nearest Neighbor Search

Hubless Nearest Neighbor Search, a new method for Bilingual Lexicon Induction, improves retrieval accuracy significantly. Empirical results show HNN outperforms NN, ISF and other state-of-the-art.

Github: https://github.com/baidu-research/HNN
Paper: https://github.com/baidu-research/HNN/blob/master/doc/HNN.pdf

#ACL2019 #NLP #NLU

GitHub

baidu-research/HNN

Contribute to baidu-research/HNN development by creating an account on GitHub.

8.5K views10:14

Data Science by ODS.ai 🦜

Plato Research Dialogue System: A Flexible Conversational AI Platform

The Plato Research Dialogue System is a platform #Uber developed to enable experts and non-experts alike to quickly build, train, and deploy conversational AI agents.

Link: https://eng.uber.com/plato-research-dialogue-system/

#ConversationalAI #converstaion #NLP #NLU

8.6K viewsedited 04:59

💭 5 💬 12 🚕 11

About

Blog

Apps

Platform