Data Science by ODS.ai 🦜 – Telegram

Data Science by ODS.ai 🦜

@opendatascience

49.7K subscribers

395 photos

43 videos

7 files

1.54K links

First Telegram Data Science channel. Covering all technical and popular staff about anything related to Data Science: AI, Big Data, Machine Learning, Statistics, general Math and the applications of former. To reach editors contact: @haarrp

Download Telegram

About

Blog

Apps

Platform

Data Science by ODS.ai 🦜

49.7K subscribers

Data Science by ODS.ai 🦜

Most of the Scots NLP models used Wikipedia for training are wrong

One person who had done 200,000 edits and written 20,000 articles of Scots Wikipedia was not using Scots language but rather faking it. Since Wikipedia texts are often used as a dataset for #NLU / #NLP / #NMT neural nets training, those models using it as an input had a flaw.

Reddit thread: https://www.reddit.com/r/Scotland/comments/ig9jia/ive_discovered_that_almost_every_single_article/

#datasets #translation #scots #wikipedia

From the Scotland community on Reddit: I’ve discovered that almost every single article on the Scots version of Wikipedia is written…

Explore this post and more from the Scotland community

13.8K views18:50

🤣 43 🌝 18 🦊 6