ArtificialIntelligenceArticles – Telegram

ArtificialIntelligenceArticles

@artificialintelligencearticles

3.04K subscribers

1.64K photos

9 videos

5 files

3.86K links

for who have a passion for -
1. #ArtificialIntelligence
2. Machine Learning
3. Deep Learning
4. #DataScience
5. #Neuroscience

6. #ResearchPapers

7. Related Courses and Ebooks

Download Telegram

About

Blog

Apps

Platform

ArtificialIntelligenceArticles

3.04K subscribers

ArtificialIntelligenceArticles

BERT Rediscovers the Classical NLP Pipeline

Tenney et al.: https://arxiv.org/abs/1905.05950

#artificialintelligence #bert #machinelearning #nlp

BERT Rediscovers the Classical NLP Pipeline

Pre-trained text encoders have rapidly advanced the state of the art on many NLP tasks. We focus on one such model, BERT, and aim to quantify where linguistic information is captured within the...

440 views13:24

ArtificialIntelligenceArticles

Introducing FastBert — A simple Deep Learning library for BERT Models
Blog by Kaushal Trivedi: https://medium.com/huggingface/introducing-fastbert-a-simple-deep-learning-library-for-bert-models-89ff763ad384
#MachineLearning #ArtificialIntelligence #NLP #Bert #NaturalLanguageProcessing

Introducing FastBert — A simple Deep Learning library for BERT Models

A simple to use Deep Learning library to build and deploy BERT models

302 viewsedited 13:01

ArtificialIntelligenceArticles

Visualizing and Measuring the Geometry of BERT
Coenen, Reif, Yuan et al.: https://arxiv.org/pdf/1906.02715.pdf
#ArtificialIntelligence #DeepLearning #BERT #NLP

326 viewsedited 13:20

ArtificialIntelligenceArticles

Visualizing and Measuring the Geometry of BERT
Coenen et al.: https://arxiv.org/abs/1906.02715
#ArtificialIntelligence #BERT #NaturalLanguageProcessing

Visualizing and Measuring the Geometry of BERT

Transformer architectures show significant promise for natural language processing. Given that a single pretrained model can be fine-tuned to perform well on many different tasks, these networks...

356 viewsedited 15:17

ArtificialIntelligenceArticles

What Does BERT Look At? An Analysis of BERT's Attention
Clark et al.: https://arxiv.org/abs/1906.04341
Code: https://github.com/clarkkev/attention-analysis
#bert #naturallanguage #unsupervisedlearning

505 viewsedited 08:36

ArtificialIntelligenceArticles

RoBERTa: A Robustly Optimized BERT Pretraining Approach

Liu et al.: https://arxiv.org/abs/1907.11692

#bert #naturallanguageprocessing #unsupervisedlearning

RoBERTa: A Robustly Optimized BERT Pretraining Approach

Language model pretraining has led to significant performance gains but careful comparison between different approaches is challenging. Training is computationally expensive, often done on private...

463 views15:53

ArtificialIntelligenceArticles

Visualizing and Measuring the Geometry of BERT
Coenen et al.: https://arxiv.org/abs/1906.02715
#BERT #NaturalLanguageProcessing #UnsupervisedLearning

Visualizing and Measuring the Geometry of BERT

Transformer architectures show significant promise for natural language processing. Given that a single pretrained model can be fine-tuned to perform well on many different tasks, these networks...

419 views15:14

ArtificialIntelligenceArticles

Smaller, faster, cheaper, lighter: Introducing DistilBERT, a distilled version of BERT
Blog by Victor Sanh : https://medium.com/huggingface/distilbert-8cf3380435b5
#MachineLearning #NLP #Bert #Distillation #Transformers

🏎 Smaller, faster, cheaper, lighter: Introducing DilBERT, a distilled version of BERT

You can find the code to reproduce the training of DilBERT along with pre-trained weights for DilBERT here.

383 views06:17

ArtificialIntelligenceArticles

Smaller, faster, cheaper, lighter: Introducing DistilBERT, a distilled version of BERT

Blog by Victor Sanh : https://medium.com/huggingface/distilbert-8cf3380435b5

#MachineLearning #NLP #Bert #Distillation #Transformers

🏎 Smaller, faster, cheaper, lighter: Introducing DilBERT, a distilled version of BERT

You can find the code to reproduce the training of DilBERT along with pre-trained weights for DilBERT here.

277 views13:47

ArtificialIntelligenceArticles

Extreme Language Model Compression with Optimal Subwords and Shared Projections
Zhao et al.: https://arxiv.org/abs/1909.11687
#neuralnetwork #bert #nlp

Extremely Small BERT Models from Mixed-Vocabulary Training

Pretrained language models like BERT have achieved good results on NLP tasks, but are impractical on resource-limited devices due to memory footprint. A large fraction of this footprint comes from...

439 views06:45

ArtificialIntelligenceArticles

The Illustrated GPT-2 (Visualizing Transformer Language Models)
Blog by Jay Alammar : https://jalammar.github.io/illustrated-gpt2/
#BERT #Transformer #ArtificialIntelligence

jalammar.github.io

The Illustrated GPT-2 (Visualizing Transformer Language Models)

Discussions:
Hacker News (64 points, 3 comments), Reddit r/MachineLearning (219 points, 18 comments)

Translations: Simplified Chinese, French, Korean, Russian, Turkish

This year, we saw a dazzling application of machine learning. The OpenAI GPT…

575 views02:40

ArtificialIntelligenceArticles

exBERT- A Visual Analysis Tool to Explore Learned Representations in Transformers Models
Benjamin Hoover, Hendrik Strobelt, Sebastian Gehrmann : http://exbert.net
#NLP #BERT #LanguageModel

531 views15:43