Читать книгу Data Mining and Machine Learning Applications - Группа авторов - Страница 40
2.2.5.1 The Class Imbalance Issue
ОглавлениеLearning calculations are broadly utilized during the example extraction period of the information mining measure. As this cycle manages “genuine world” information, a few issues of applying existing and settled learning calculations to genuine Information have developed. Among them, a pertinent handy issue is learning within sight of uneven class characters. Many learning calculations were planned, expecting even class circulations, for example, no significant differences in class earlier probabilities. In any case, this isn’t generally the situation in genuine Information where one class may be spoken to by countless models, while the others are spoken to by just a few. Generally, the issue of imbalanced informational indexes happens at whatever point one class speaks to a delineated idea, while the difference speaks to the partner of that idea, so models from the partner class intensely dwarf models from the positive idea class. For this situation, the inductive predisposition of learning calculations which are not extraordinarily intended to manage uneven class characters, will in general concentrate in the class which is spoken to by the biggest number of models [28].