Imbalanced Predictions
The aim of the thesis is to evaluate solutions to the class imbalance problem using real world data sets with varying degrees of class imbalance. The analysis is limited to binary classification. Three large data sets relating to credit card fraud, vehicle insurance and heart disease are used for the analysis. Several methods are compared and evaluated. Logistic regression, SVC and decision trees