Am Neumarkt 😱

#ml

google-research/tuning_playbook: A playbook for systematically maximizing the performance of deep learning models.
https://github.com/google-research/tuning_playbook

GitHub - google-research/tuning_playbook: A playbook for systematically maximizing the performance of deep learning models.

A playbook for systematically maximizing the performance of deep learning models. - google-research/tuning_playbook

223 viewsMarkt Mai, edited 21:35

#ml

Releasing the Skynet

https://github.com/internet-explorer-ssl/internet-explorer

GitHub - internet-explorer-ssl/internet-explorer: Internet Explorer explores the web in a self-supervised manner to progressively…

Internet Explorer explores the web in a self-supervised manner to progressively find relevant examples that improve performance on a desired target dataset. - GitHub - internet-explorer-ssl/interne...

195 viewsMarkt Mai, 00:11

The State of Competitive Machine Learning | ML Contests

#ml

https://mlcontests.com/state-of-competitive-machine-learning-2022/

Quote from the report:

Successful competitors have mostly converged on a common set of tools — Python, PyData, PyTorch, and gradient-boosted decision trees.

Deep learning still has not replaced gradient-boosted decision trees when it comes to tabular data, though it does often seem to add value when ensembled with boosting methods.
Transformers continue to dominate in NLP, and start to compete with convolutional neural nets in computer vision.

Competitions cover a broad range of research areas including computer vision, NLP, tabular data, robotics, time-series analysis, and many others.
Large ensembles remain common among winners, though single-model solutions do win too.

There are several active machine learning competition platforms, as well as dozens of purpose-built websites for individual competitions.
Competitive machine learning continues to grow in popularity, including in academia.

Around 50% of winners are solo winners; 50% of winners are first-time winners; 30% have won more than once before.

Some competitors are able to invest significantly into hardware used to train their solutions, though others who use free hardware like Google Colab are also still able to win competitions.

ML Contests

We summarise the state of the competitive landscape and analyse the 200+ competitions that took place in 2022. Plus a deep dive analysis of 67 winning solutions to figure out the best strategies to win at competitive ML.

234 viewsMarkt Mai, edited 20:55

#ml

Pérez J, Barceló P, Marinkovic J. Attention is Turing-Complete. J Mach Learn Res. 2021;22: 1–35. Available: https://jmlr.org/papers/v22/20-302.html

187 viewsMarkt Mai, 20:51

#ml

Yeh, Catherine, Yida Chen, Aoyu Wu, Cynthia Chen, Fernanda Viégas, and Martin Wattenberg. 2023. “AttentionViz: A Global View of Transformer Attention.” ArXiv [Cs.HC]. arXiv. http://arxiv.org/abs/2305.03210.

189 views06:17

#ml

https://www.kdnuggets.com/2023/06/ten-years-ai-review.html

KDnuggets

Ten Years of AI in Review - KDnuggets

From image classification to chatbot therapy.

665 views10:56

Yes, Transformers are Effective for Time Series Forecasting (+ Autoformer)

#ml

Yes, Transformers are Effective for Time Series Forecasting (+ Autoformer)
https://huggingface.co/blog/autoformer

huggingface.co

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

219 viewsedited 09:40

#ml

A family tree shows how transformers are evolving.

(HTML is probably the worst name for a model.)

https://arxiv.org/abs/2302.07730

287 viewsedited 10:56

Run, share, and edit Python notebooks

#ml

Hand-Crafted Transformers

HandCrafted.ipynb - Colaboratory
https://colab.research.google.com/github/newhouseb/handcrafted/blob/main/HandCrafted.ipynb

Google

HandCrafted.ipynb

266 viewsedited 11:53

#ml

Interesting idea to use Hydra in ML experiments.

https://github.com/ashleve/lightning-hydra-template

GitHub - ashleve/lightning-hydra-template: PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡

PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡ - ashleve/lightning-hydra-template

186 views19:43

#ml

Jelassi S, Brandfonbrener D, Kakade SM, Malach E. Repeat after me: Transformers are better than state space models at copying. arXiv [cs.LG]. 2024. Available: http://arxiv.org/abs/2402.01032

Not surprising at all when you have direct access to a long context. But hey, look at this title.

Repeat After Me: Transformers are Better than State Space Models at Copying

Transformers are the dominant architecture for sequence modeling, but there is growing interest in models that use a fixed-size latent state that does not depend on the sequence length, which we...

187 viewsedited 10:57

#ml

I got interested in satellite data last year and played with it a bit. It's fantastic. The spatiotemporal nature of it brings up a lot of interesting questions.

Then I saw this paper today:

Rolf, Esther, Konstantin Klemmer, Caleb Robinson, and Hannah Kerner. 2024. “Mission Critical -- Satellite Data Is a Distinct Modality in Machine Learning.” arXiv [Cs.LG], February. http://arxiv.org/abs/2402.01444.

Mission Critical -- Satellite Data is a Distinct Modality in...

Satellite data has the potential to inspire a seismic shift for machine learning -- one in which we rethink existing practices designed for traditional data modalities. As machine learning for...

175 viewsedited 05:35

#ml

Like a dictionary

Kunc, Vladim’ir, and Jivr’i Kl’ema. 2024. “Three Decades of Activations: A Comprehensive Survey of 400 Activation Functions for Neural Networks.” arXiv [Cs.LG], February. http://arxiv.org/abs/2402.09092.

Three Decades of Activations: A Comprehensive Survey of 400...

Neural networks have proven to be a highly effective tool for solving complex problems in many areas of life. Recently, their importance and practical usability have further been reinforced with...

199 views11:21

#ml

https://github.com/google-research/timesfm

GitHub - google-research/timesfm: TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed…

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting. - google-research/timesfm

205 viewsedited 19:51

#ml

Schmidhuber J. Deep Learning: Our Miraculous Year 1990-1991. In: arXiv.org [Internet]. 12 May 2020 [cited 7 Jul 2024]. Available: https://arxiv.org/abs/2005.05744

Deep Learning: Our Miraculous Year 1990-1991

In 2020-2021, we celebrated that many of the basic ideas behind the deep learning revolution were published three decades ago within fewer than 12 months in our "Annus Mirabilis" or "Miraculous...

137 views22:02

#ml

I was searching for a tool to visualize computational graphs and ran into this preprint. The hierarchical visualization idea is quite nice.

https://arxiv.org/abs/2212.10774