Microsoft DeepSpeed: an open-source deep-learning training optimization library (blog, website, github), new version just released
#Deep_Learning #Optimization
@ml_nlp_cv
#Deep_Learning #Optimization
@ml_nlp_cv
Microsoft Research
DeepSpeed: Extreme-scale model training for everyone - Microsoft Research
DeepSpeed continues to innovate, making its tools more powerful while broadening its reach. Learn how it now powers 10x bigger model training on one GPU, 10x longer input sequences, 5x less communication volume, & scales to train trillion-parameter models.