Aspiring Data Science

#ml #econometrics #timeseries

Современная шпаргалка по эконометрике и временным рядам. Есть Байес.

https://www.linkedin.com/posts/michael-mike-erlihson-phd-8208616_time-series-econometrics-a-concise-course-activity-7078803397931352065-6WKz

Michael (Mike) Erlihson, PhD on LinkedIn: Time-Series Econometrics: A Concise Course

🧠📙 𝗧𝗶𝗺𝗲-𝗦𝗲𝗿𝗶𝗲𝘀 𝗘𝗰𝗼𝗻𝗼𝗺𝗲𝘁𝗿𝗶𝗰𝘀: 𝗔 𝗖𝗼𝗻𝗰𝗶𝘀𝗲 𝗖𝗼𝘂𝗿𝘀𝗲 🚀💥 - an awesome textbook by Francis X. Diebold

𝚃𝚊𝚋𝚕𝚎 𝚘𝚏…

143 viewsAnatoly Alekseev, edited 18:06

Aspiring Data Science

#ml #timeseries #transformers

↓ В моей практике был один случай, когда трансформер из торча очень элегантно решал соревнование по предсказанию продаж Сбермаркета. Автор скрипта - Павел Орлов. На сырых данных его решение давало почти такой же по МЛ качеству результат, как мой вдесятеро больший по коду скрипт со сложным FE. В том соревновании я, кстати, с изумлением узнал, что НЕЛЬЗЯ просто так взять и использовать натренированную модельку классификации, если в test пропорции классов отличаются от train. Нельзя. Вы скажете, но как же, ведь на то и обучение, чтобы по признакам научиться угадывать таргет, а вовсе не по какой-то там пропорции примеров в трейне. Нет. При достаточно отличающемся балансе в тест прогнозы будут смещены настолько, что их придётся пересчитывать по спецформуле. Этому, кстати, не учат нигде ни в книгах, ни на курсах. По кр мере, мне не попадалось. Уникальный контент для подписчиков моего блога )

Pastebin

transformer pipeline - Pastebin.com

Pastebin.com is the number one paste tool since 2002. Pastebin is a website where you can store text online for a set period of time.

189 viewsAnatoly Alekseev, edited 13:14

Aspiring Data Science

#ml #timeseries #robustness

Используете ли Вы в задачах классификации рядов какие-то метрики робастности предсказаний? Одно дело, скажем, высокая точность или бриер лосс, но интуитивно неприятно, если прогноз на последовательных точках ряда часто меняется, или вероятности резко скачут туда-сюда от точки к точке. Столкнулся с этим на финансовых рядах. Думаю трекать что-то вроде preds.diff().abs().mean() для предсказаний и самих верных меток, как дополнительную меру качества моделек для TS.

Сразу опять-таки приходит вопрос, а почему устойчивость прогнозов по изменениям входов вообще никак не учитывается в МЛ? Да, можно посчитать важности признаков, но они по большей части считаются вовсе не небольшим варьированием входов. А ведь эта дополнительная мера качества могла бы помочь сделать выбор, скажем, между 2 коррелированными фичами. Обе имеют примерно одинаковое влияние (или одиночные ML метрики на таргет), но по одной прогнозы скачут сильнее. Тогда оставляем вторую. Может, я просто не знаю, и где-то это уже учитывается?

99 viewsAnatoly Alekseev, edited 13:24

Aspiring Data Science

#timeseries

Классно оформленная презентация!

https://www.youtube.com/watch?v=zmfe2RaX-14

YouTube

Aileen Nielsen - Time Series Analysis - PyCon 2017

"Speaker: Aileen Nielsen

Time series analysis is more relevant than ever with the rise of big data, the internet of things, and the general availability of data that follows events through time. This tutorial will introduce participants to the many versatile…

113 viewsAnatoly Alekseev, edited 06:39

Aspiring Data Science

#timeseries #irregular #nonperiodic #lombscargle #gatspy

https://www.youtube.com/watch?v=E4NMZyfao2c

YouTube

Aileen Nielsen - Irregular time series and how to whip them

PyData London 2016

This talk will present best-practices and most commonly used methods for dealing with irregular time series. Though we'd all like data to come at regular and reliable intervals, the reality is that most time series data doesn't come this…

114 viewsAnatoly Alekseev, edited 03:25

Aspiring Data Science

#timeseries #features #otus

https://www.youtube.com/watch?v=miy0KusjPa4

YouTube

Извлечение признаков из временных рядов // Демо занятие курса «Machine Learning. Advanced»

Если на минутку представить, что временной ряд чем-то похож на аудиосигнал, то нам откроется чудесный мир новых способов генерации признаков из сферы обработки сигналов. Мы посмотрим, как дополнительные признаки улучшают качество моделей, научимся пользоваться…

106 viewsAnatoly Alekseev, edited 10:15

Aspiring Data Science

#timeseries #similarity #aeon #elasticdistance #dtw #distances

Корреляция Пирсона тихо плачет в сторонке

https://www.youtube.com/watch?v=O5cnKAUBKkg

YouTube

Lets do the time warp again: time series machine learning with distance functions | PDAMS 2023

Many algorithms for machine learning from time series are based on measuring the distance or similarity between series. The most popular distance measure is dynamic time warping, which attempts to optimally realign two series to compensate for offest. There…

107 viewsAnatoly Alekseev, edited 06:59

Aspiring Data Science

#timeseries #dsp #fourier #fft #psd

"After we have transformed a signal to the frequency-domain, we can extract features from each of these transformed signals and use these features as input in standard classifiers like Random Forest, Logistic Regression, Gradient Boosting or Support Vector Machines.

Which features can we extract from these transformations? A good first step is the value of the frequencies at which oscillations occur and the corresponding amplitudes. In other words; the x and y-position of the peaks in the frequency spectrum."

https://ataspinar.com/2018/04/04/machine-learning-with-signal-processing-techniques/

ML Fundamentals

Machine Learning with Signal Processing Techniques

[latexpage] Introduction Stochastic Signal Analysis is a field of science concerned with the processing, modification and analysis of (stochastic) signals. Anyone with a background in Physics or En…

104 viewsAnatoly Alekseev, edited 07:55

Aspiring Data Science

#timeseries #wavelets #scaleogram #dwt #cwt

В комментах сравнительная таблица точностей разных вейвлетов на нескольких ML-задачах.

https://ataspinar.com/2018/12/21/a-guide-for-using-the-wavelet-transform-in-machine-learning/

92 viewsAnatoly Alekseev, edited 09:16

About

Blog

Apps

Platform