Aspiring Data Science

#masters #featureselection #redundancy #multicollinearity

"In any application involving a large number of variables, it’s nice to be able to identify sets of variables that have significant redundancy. Of course, we may be unlucky and have a situation in which the small differences between largely redundant variables contain the useful information. However, this is the exception. In most applications,it is the redundant information that is most important; if some type of effect impacts multiple variables, it’s probably important. Because dealing with fewer variables is always better, if we can identify groups of variables that have great intra-group redundancy, we may be able to eliminate many variables from consideration, focusing on a weighted average of representatives from each group, or perhaps focusing on a single factor that is highly correlated with a redundant group. Or we might just be interested in the fact of redundancy, garnering useful insight from it."

О таком же подходе, когда несколько коллинеарных факторов заменяются одним, говорил и Эрни Чан. Тим для поиска групп связанных факторов использует PCA.

89 viewsAnatoly Alekseev, edited 09:22

#featureselection #redundancy #indicators #trading #hurst

https://www.youtube.com/watch?v=x_JcExwuu60

YouTube

David Aronson of Hood River on data mining & Hurst signals - at The Trading Show Chicago 2013

David Aronson, President of Hood River Research, gave a presentation at The Trading Show Chicago 2013 on the topic, 'Rapid identification of non-redundant predictors with data mining bias correction and its application to extreme Hurst signals.'

The Trading…

129 viewsAnatoly Alekseev, edited 07:48

About

Blog

Apps

Platform