Am Neumarkt 😱

#visualization #fun

The Dunning-Kruger effect is quite real 😂

Infographic: 50 Cognitive Biases in the Modern World
https://www.visualcapitalist.com/50-cognitive-biases-in-the-modern-world/

Visual Capitalist

50 Cognitive Biases in the Modern World

Our rapidly evolving world forces us to adopt distinct patterns of behavior, and in the process, paves the way for new cognitive biases to emerge.

200 viewsMarkt Mai, edited 07:27

Am Neumarkt 😱

#fun

Not bad. 😂

The Big Data Game | Firebolt
https://www.firebolt.io/big-data-game

www.firebolt.io

The Big Data Game | Firebolt

Play the Big Data Game – Because even a simple query can send you on an unexpected journey! Help the poor 8-bit data engineer to get the DATA!

223 viewsMarkt Mai, 14:08

Am Neumarkt 😱

#smarthome #misc

I have, somehow, 5 different brands of smart home products in our little apartment.
I have no idea what is going on in the smart home industry. Every brand has its own app, hub, or even protocal. So I had to install five different apps to initialize the devices. I could, in principle, ditch these apps and use google/alexa only after I installed them, however, this is still extremely inconvenient as google/alexa doesn’t support all the fancy functions of the devices.

Any solutions to this problem?

189 viewsMarkt Mai, edited 09:46

Am Neumarkt 😱

#fun

Every chemistry graduate will be in charge of a molecule. Someone got to take care of “Titin” (189,819 characters), and she/he will have to recite the name first in every meeting: https://en.wiktionary.org/wiki/Appendix:Protologisms/Long_words/Titin#Noun

https://xkcd.com/2602/

xkcd

Linguistics Degree

224 viewsMarkt Mai, edited 09:56

Am Neumarkt 😱

#jobs

I vaguely feel there's a talent shortage in Germany. "Hiring is hard". I heard this several times. Our team also need more hires.

So the company came up with this: Land a job at Zalando within 3 days after the final interviews!

https://jobs.zalando.com/en/jobs/4004181/?gh_src=%20f46af3281us

Zalando Jobs

Hiring Sprint Pricing Platform - Applied Science - Zalando Jobs

Come work as a Hiring Sprint Pricing Platform - Applied Science at Zalando in Dublin (Ireland),Berlin

194 viewsMarkt Mai, edited 05:23

Am Neumarkt 😱

Forwarded from C’s Random Collection

https://refactoring.guru/design-patterns/catalog 挺有设计感的 refactoring tutorial #java #refactoring #desginpattern

refactoring.guru

The Catalog of Design Patterns

The catalog of design patterns grouped by intent, complexity, and popularity. The catalog contains all classic design patterns and several architectural patterns.

186 viewsMarkt Mai, 05:43

Am Neumarkt 😱

How did you find this channel?

Anonymous Poll

74%

豆瓣

12%

Datumorphism (my machine learning online notebook)

21%

Other

34 voters222 viewsMarkt Mai, 06:03

Am Neumarkt 😱

Which language(s) do you prefer for reading?

Anonymous Poll

37 voters233 viewsMarkt Mai, 06:05

Am Neumarkt 😱

#ml #statistics

I read about conformal prediction a while ago and realized that I need to understand more about the hypothesis testing theories. As someone from natural science, I mostly work within the Neyman-Pearson ideas.
So I explored it a bit and found two nice papers. See the list below. If you have other papers on similar topics, I would appreciate some comments.

1. Perezgonzalez JD. Fisher, Neyman-Pearson or NHST? A tutorial for teaching data testing. Front Psychol. 2015;6: 223. doi:10.3389/fpsyg.2015.00223 https://www.frontiersin.org/articles/10.3389/fpsyg.2015.00223/full
2. Lehmann EL. The Fisher, Neyman-Pearson Theories of Testing Hypotheses: One Theory or Two? J Am Stat Assoc. 1993;88: 1242–1249. doi:10.2307/2291263

Frontiers

Fisher, Neyman-Pearson or NHST? A tutorial for teaching data testing

Despite frequent calls for the overhaul of null hypothesis significance testing (NHST), this controversial procedure remains ubiquitous in behavioral, social and biomedical teaching and research. Little change seems possible once the procedure becomes well…

235 viewsMarkt Mai, edited 08:45

Am Neumarkt 😱

#work

I realized something interesting about time management.

If I open my calendar now, I see these “tiles” of meetings filling up most of my working hours. It looks bad, but it was even worse in the past. The thing is, if I do meetings during my working hours, I will have to work extra hours to do some thinking and analysis. It is rather cruel.

So what changed? I think I realized the power of Google Docs. Instead of many people talking and nobody listening, someone should write up a draft first and send it out to the colleagues. Then, once people get the link to the docs, everyone can add comments.

This doesn’t seem to be very different from meetings. Oh, it is very different. The workflow can be async. We are not forced to use our precious focus time to attend meetings. We can read and comment on the document whenever we like: when we are commuting, when we are taking a dump, when we are on a phone/tablet, just, any, time.

Apart from the async workflow, I also like the "think, comment and forget" idea. I feel people deliver better ideas when we think first, comment next, and forget about it unless there are replies to our comments. No pressure, no useless debates.

1.2K viewsMarkt Mai, edited 13:16

Am Neumarkt 😱

#visualization

Hmm... Interesting pattern

https://www.reddit.com/r/dataisbeautiful/comments/uh02hd/uk_covid_deaths_in_2021_assorted_by_age_group_oc/

UK COVID deaths in 2021 assorted by age group [OC]

Posted in r/dataisbeautiful by u/Material-Banana8953 • 633 points and 111 comments

165 viewsMarkt Mai, edited 00:46

Am Neumarkt 😱

#ml

I heard about information bottleneck so many times but didn't really go back and read the original papers.

I spent some time on it and I found it quite interesting. It is philosophically based on what was described in Vapnik's The Nature of Statistical Learning, where he discussed how generalizations work by enforcing parsimony.
Here in this information bottleneck paper, the most interesting thing is the quantified generalization gap and complexity gap. With these, we know where to go on the information plane.

It's a good read.

Tishby N, Zaslavsky N. Deep Learning and the Information Bottleneck Principle. arXiv [cs.LG]. 2015. Available: http://arxiv.org/abs/1503.02406,

185 viewsMarkt Mai, edited 06:10

Am Neumarkt 😱

#python

Anaconda open sourced this...

I have no idea what this is for...

https://github.com/pyscript/pyscript

GitHub

GitHub - pyscript/pyscript: Try PyScript: https://pyscript.com Examples: https://tinyurl.com/pyscript-examples Community: ht…

Try PyScript: https://pyscript.com Examples: https://tinyurl.com/pyscript-examples Community: https://discord.gg/HxvBtukrg2 - pyscript/pyscript

187 viewsMarkt Mai, 06:12

Am Neumarkt 😱

#ml

Came across this post this morning. I realized the reason I am not writing a lot in Julia is simply because I don't know how to write quality code in Julia.

When we build a model in Python, we know all these details about making it quality code. For a new language, I'm just terrified by the amount of details I need to be aware of.

Ah I'm getting older.

JAX vs Julia (vs PyTorch) · Patrick Kidger
https://kidger.site/thoughts/jax-vs-julia/

kidger.site

Patrick Kidger

Personal Website. Math, SciML, scuba diving!

182 viewsMarkt Mai, edited 05:34

Am Neumarkt 😱

#ml

https://ts.gluon.ai/

Highly recommended! If you are working on deep learning for forecasting, gluonts is a great package.
It simplifies all these tedious data preprocessing, slicing, backrest stuff. We can then spend time on implementing the models themselves (there're a lot of ready-to-use models).
What's even better, we can use pytorch lightning!

See this repository for a list of transformer based forecasting models.
https://github.com/kashif/pytorch-transformer-ts

GitHub

GitHub - kashif/pytorch-transformer-ts: Repository of Transformer based PyTorch Time Series Models

Repository of Transformer based PyTorch Time Series Models - kashif/pytorch-transformer-ts

212 viewsMarkt Mai, 19:48

Am Neumarkt 😱

#data

Stop squandering data: make units of measurement machine-readable
https://www.nature.com/articles/d41586-022-01233-w

Nature

Stop squandering data: make units of measurement machine-readable

Nature - In the age of big data, it is time to ensure that units are routinely documented for easy, unambiguous exchange of information.

169 viewsMarkt Mai, 07:25

Am Neumarkt 😱

#fun

Could use this

How to Lie with Statistics - Wikipedia
https://en.wikipedia.org/wiki/How_to_Lie_with_Statistics

Wikipedia

How to Lie with Statistics

book by Darrell Huff

167 viewsMarkt Mai, edited 19:01

Am Neumarkt 😱

#health

https://www.scientificamerican.com/article/to-better-understand-women-rsquo-s-health-we-need-to-destigmatize-menstrual-blood/

Scientific American

To Better Understand Women’s Health, We Need to Destigmatize Menstrual Blood

Diseases such as endometriosis would have a cure if we could talk about them and study them without shame

154 viewsMarkt Mai, 07:39

Am Neumarkt 😱

#python

This post is a retro on how I learned Python.

Disclaimer: I can not claim that I am a master of Python. This post is a retrospective of how I learned Python in different stages.

I started using Python back in 2012. Before this, I was mostly a Matlab/C user.

Python is easy to get started, yet it is hard to master. People coming from other languages can easily make it work but will write some "disgusting" python code. And this is because Python people talk about "pythonic" all the time. Instead of being an actual style guide, it is rather a philosophy of styles.

When we get started, we are most likely not interested in [PEP8](https://peps.python.org/pep-0008/) and [PEP257](https://peps.python.org/pep-0257/). Instead, we focus on making things work. After some lectures from the university (or whatever sources), we started to get some sense of styles. Following these lectures, people will probably write code and use Python in some projects. Then we began to realize that Python is strange, sometimes even doesn't make sense. Then we started leaning about the philosophy behind it. At some point, we will get some peer reviews and probably fight against each other on some philosophies we accumulated throughout the years.

The attached drawing (in comments) somehow captures this path that I went through. It is not a monotonic path of any sort. This path is most likely to be permutation invariant and cyclic. But the bottom line is that mastering Python requires a lot of struggle, fights, and relearning. And one of the most effective methods is peer review, just as in any other learning task in our life.

Peer review makes us think, and it is very important to find some good reviewers. Don't just stay in a silo and admire our own code. To me, the whole journey helped me building one of the most important philosophies of my life: embrace open source and collaborate.

356 viewsMarkt Mai, edited 11:46