ArtificialIntelligenceArticles
3.04K subscribers
1.64K photos
9 videos
5 files
3.86K links
for who have a passion for -
1. #ArtificialIntelligence
2. Machine Learning
3. Deep Learning
4. #DataScience
5. #Neuroscience

6. #ResearchPapers

7. Related Courses and Ebooks
Download Telegram
MegatronLM: Training Billion+ Parameter Language Models Using GPU Model Parallelism
"... training an 8.3 billion parameter transformer language model with 8-way model parallelism and 64-way data parallelism on 512 GPUs, making it the largest transformer based language model ever trained at 24x the size of BERT and 5.6x the size of GPT-2."
Blog by NVIDIA Applied Deep Learning Research : https://nv-adlr.github.io/MegatronLM
Code: https://github.com/nvidia/megatron-lm
#ArtificialIntelligence #DeepLearning #NLP #PyTorch #Transformer
Write With Transformer
See how a modern neural network auto-completes your text 🤗
With the brand new GPT-2 large!
Built by the phenomenal Hugging Face team : https://transformer.huggingface.co
H / T : Lysandre Debut
#GPT2 #NeuralNetwork #Transformer
Write With Transformer
Hugging Face released a new version of their Write With Transformer app, using a language model trained directly on Arxiv to generate Deep Learning and NLP completions!
In addition, they add state-of-the-art NLP models such as GPT, GPT-2 and XLNet completions:

https://transformer.huggingface.co/

H / T : Lysandre Debut
#Transformer #Pytorch #NLP

@ArtificialIntelligenceArticles
On Extractive and Abstractive Neural Document Summarization with Transformer Language Models"
Sandeep Subramanian, Raymond Li, Jonathan Pilault, Christopher Pal : https://arxiv.org/abs/1909.03186
#transformer #naturallanguageprocessing #machinelearning
Forecaster: A Graph Transformer for Forecasting Spatial and Time-Dependent Data
Yang Li, José M. F. Moura : https://arxiv.org/abs/1909.04019v3
#MachineLearning #ArtificialIntelligence #Transformer
Multi-Graph Transformer for Free-Hand Sketch Recognition
Xu et al.: https://arxiv.org/abs/1912.11258
#ArtificialIntelligence #DeepLearning #Transformer