Am Neumarkt 😱

#data

Played with polars a bit. It's actually quite fast.

https://www.pola.rs/

223 viewsMarkt Mai, 22:42

#ml

You spent 10k euros on GPU then realized the statistical baseline model is better. 🤣

https://github.com/Nixtla/statsforecast/tree/main/experiments/m3

GitHub

statsforecast/experiments/m3 at main · Nixtla/statsforecast

Lightning ⚡️ fast forecasting with statistical and econometric models. - Nixtla/statsforecast

205 viewsMarkt Mai, edited 07:49

Am Neumarkt 😱

#ml

This is amazing

compiled_model = torch.compile(model)

https://pytorch.org/get-started/pytorch-2.0/

PyTorch

PyTorch 2.x

Overview

203 viewsMarkt Mai, edited 22:54

Am Neumarkt 😱

#ml

In his MinT paper, Hyndman said he confused these two quantities in his previous paper. 😂

MinT is a simple method to make forecasts with hierarchical structure coherent. Here coherent means the sum of the lower level forecasts equals the higher level forecasts.

For example, our time series has a strucutre like sales of coca cola + sales of spirit = sales of beverages. If this relations holds for our forecasts, we have coherent forecasts.

This may sound trivial, the problem is in fact hard. There are many trivial methods such as only forecasting lower levels (coca cola, spirit) then use the sum as the higher level (sales of beverages). These are usually too naive to be effective.

MinT is a reconciliation method that combines high level forecasts and the lower level forecasts to find an optimal combination/reconciliation.

https://robjhyndman.com/papers/MinT.pdf

227 viewsMarkt Mai, edited 22:23

Am Neumarkt 😱

#visualization

https://www.nature.com/articles/d41586-022-04174-6

Nature

Nature’s top science graphics from 2022

Nature - From brain growth to COVID variants to vanishing trees, editors choose the charts and diagrams that define the year.

190 viewsMarkt Mai, edited 08:25

Am Neumarkt 😱

#fun

Denmark...

I thought French was complicated, now we all know Danish leads the race.

https://www.reddit.com/r/europe/comments/zo258s/how_to_say_number_92_in_european_countries/

r/europe on Reddit: How to say number "92" in European countries

Posted by u/trollrepublic - 1,637 votes and 429 comments

173 viewsMarkt Mai, edited 10:06

Am Neumarkt 😱

#visualization

Visualizations of energy consumption and prices in Germany. Given the low temperature atm, it maybe interesting to watch them evolve.

https://www.zeit.de/wirtschaft/energiemonitor-deutschland-gaspreis-spritpreis-energieversorgung

ZEIT ONLINE

Energiemonitor: Schafft Deutschland die Energiewende?

Wo gehen neue Windräder in Betrieb? Woher kommt der Strom? Reicht das Gas für den Winter? Der komplett überarbeitete ZEIT-ONLINE-Energiemonitor gibt Antworten.

173 viewsMarkt Mai, 21:56

Am Neumarkt 😱

#data

https://evidence.dev/

I like the idea. My last dashboarding tool for work was streamlit. Streamlit is lightweight and fast. But it requires Python code and a Python server.

Evidence is mostly markdown and SQL. For many lightweight dashboarding tasks, this is just sweet.

Evidence is built on node. I could run a server and provide live updates but I can already build a static website by running npm run build.

Played with it a bit. Nothing to complain about at this point.

evidence.dev

Evidence - Business Intelligence as Code

Evidence is an open source, code-based alternative to drag-and-drop BI tools. Build polished data products with just SQL and markdown.

207 viewsMarkt Mai, edited 09:25

Am Neumarkt 😱

#ml

GPT writing papers... Both fancy and scary.

https://huggingface.co/stanford-crfm/pubmedgpt?text=Neuroplasticity

huggingface.co

stanford-crfm/BioMedLM · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

259 viewsMarkt Mai, 19:18

Am Neumarkt 😱

#misc

Lastpass was hacked and the hacker obtained the encrypted user data including user names and passwords already.

https://blog.lastpass.com/2022/12/notice-of-recent-security-incident/

Lastpass

Security Incident December 2022 Update - LastPass - The LastPass Blog

Please refer to the latest article for updated information.nbs[..]

296 viewsMarkt Mai, edited 10:57

Am Neumarkt 😱

Forwarded from Parallel Experiments (Linghao)

https://pudding.cool/

The Pudding

The Pudding explains ideas debated in culture with visual essays.

248 viewsMarkt Mai, 10:39

Am Neumarkt 😱

#ml

Top-10 Things in 2022 | Anima on AI
https://anima-ai.org/2022/12/31/top-10-things-in-2022/

244 viewsMarkt Mai, edited 09:42

Am Neumarkt 😱

#ml

https://illustrated-machine-learning.github.io/index.html

999 viewsMarkt Mai, edited 16:46

Am Neumarkt 😱

#data

Just got my ticket.

I have been reviewing proposals for PyData this year. I saw some really cool proposals so I finally decided to attend the conference.

https://2023.pycon.de/blog/pyconde-pydata-berlin-tickets/

2023.pycon.de

PyConDE & PyData Berlin 2023 Tickets

Tickets for PyConDE & PyData Berlin 2023

243 viewsMarkt Mai, edited 19:55

Am Neumarkt 😱

#ml

Haha icecube

IceCube - Neutrinos in Deep Ice | Kaggle
https://www.kaggle.com/competitions/icecube-neutrinos-in-deep-ice?utm_medium=email&utm_source=gamma&utm_campaign=comp-icecube-2023

Kaggle

IceCube - Neutrinos in Deep Ice

Reconstruct the direction of neutrinos from the Universe to the South Pole

223 viewsMarkt Mai, edited 21:51

Am Neumarkt 😱

#ml

google-research/tuning_playbook: A playbook for systematically maximizing the performance of deep learning models.
https://github.com/google-research/tuning_playbook

GitHub

GitHub - google-research/tuning_playbook: A playbook for systematically maximizing the performance of deep learning models.

A playbook for systematically maximizing the performance of deep learning models. - google-research/tuning_playbook

223 viewsMarkt Mai, edited 21:35

Am Neumarkt 😱

#data

This is gold.

https://youtu.be/pjq3QOxl9Ok

YouTube

So You Wanna Be a Pandas Expert? (Tutorial) - James Powell | PyData Global 2021

So You Wanna Be a Pandas Expert? | (Pre-recorded Tutorial)
Speaker: James Powell

So… you want to be a Pandas expert.

What’s it going to take? Should you memorize the Pandas API? Should you read through the source code, line-by-line, file-by-file? Should…

202 viewsMarkt Mai, 13:59

Am Neumarkt 😱

#fun

The authors got some styles.

Source:
https://twitter.com/mraginsky/status/1181712367966674945

188 viewsMarkt Mai, edited 13:50

Am Neumarkt 😱

#data

In physics, people claim that more is different. In the data world, more is very different. I'm no expert in big data, but I learned the scaling problem only when I started working for corporates.

I like the following from the author.

> data sizes increase much faster than compute sizes.

In deep learning, many models are following a scaling law of performance and dataset size. Indeed, more data brings in better performance. But the increase in performance becomes really slow. Business doesn't need a perfect model. We also know computation costs money. At some point, we simply have to cut the dataset, even if we have all the data in the world.

So ..., data hoarding is probably fine, but our models might not need that much.

https://motherduck.com/blog/big-data-is-dead/

MotherDuck

Big Data is Dead - MotherDuck Blog

Big data is dead. Long live easy data.

220 viewsMarkt Mai, edited 10:28